[
https://issues.apache.org/jira/browse/TIKA-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ewan Mellor updated TIKA-2613:
------------------------------
Description:
Tesseract 4.0 (currently in beta-1) has removed the {{\-psm}} flag, in favor of
{{\-\-psm}} (with two dashes).
The {{\-\-psm}} variant was introduced in Nov 2016, so it should be safe to
simply switch Tika to use the two-dash variant, even for people still using
Tesseract 3.05.
For reference, the Tesseract cset is:
{code}
commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb
Author: Stefan Weil <[email protected]>
Date: Sun Mar 25 17:28:33 2018 +0200
Remove deprecated support for -psm argument (#1419)
It was replaced by --psm and deprecated in commit 92d981b93.
Signed-off-by: Stefan Weil <[email protected]>
{code}
was:
Tesseract 4.0 (currently in beta-1) has removed the {-psm} flag, in favor of
{--psm} (with two dashes).
The {--psm} variant was introduced in Nov 2016, so it should be safe to simply
switch Tika to use the two-dash variant, even for people still using Tesseract
3.05.
For reference, the Tesseract cset is:
{code}
commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb
Author: Stefan Weil <[email protected]>
Date: Sun Mar 25 17:28:33 2018 +0200
Remove deprecated support for -psm argument (#1419)
It was replaced by --psm and deprecated in commit 92d981b93.
Signed-off-by: Stefan Weil <[email protected]>
{code}
> Tesseract 4.0 has removed -psm, so Tika must update
> ---------------------------------------------------
>
> Key: TIKA-2613
> URL: https://issues.apache.org/jira/browse/TIKA-2613
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.17
> Reporter: Ewan Mellor
> Priority: Major
>
> Tesseract 4.0 (currently in beta-1) has removed the {{\-psm}} flag, in favor
> of {{\-\-psm}} (with two dashes).
> The {{\-\-psm}} variant was introduced in Nov 2016, so it should be safe to
> simply switch Tika to use the two-dash variant, even for people still using
> Tesseract 3.05.
> For reference, the Tesseract cset is:
> {code}
> commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb
> Author: Stefan Weil <[email protected]>
> Date: Sun Mar 25 17:28:33 2018 +0200
> Remove deprecated support for -psm argument (#1419)
> It was replaced by --psm and deprecated in commit 92d981b93.
> Signed-off-by: Stefan Weil <[email protected]>
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)