[
https://issues.apache.org/jira/browse/TIKA-3347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17764275#comment-17764275
]
Tilman Hausherr commented on TIKA-3347:
---------------------------------------
3 files:
bug_trackers/poppler/poppler-58785-0.zip-7.pdf isn't good with Adobe so doesn't
matter
commoncrawl3_refetched/HQ/HQXZGM6CGDEGMIWX5PDFEGN7MLPYWROP: might be a real
difference, needs more investigation
govdocs1/372/372582.pdf: it's true that 3.0 is losing a bit, but the file is
also a mess with 2.0.29
> Upgrade to PDFBox 3.x when available
> ------------------------------------
>
> Key: TIKA-3347
> URL: https://issues.apache.org/jira/browse/TIKA-3347
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
>
> 3.0.0-RC1 was recently released. We should integrate it on a dev branch asap
> so that we can help with regression testing...
--
This message was sent by Atlassian Jira
(v8.20.10#820010)