Ahh, makes sense. I did have a feeling I was barking up the wrong tree
since it's an Extraction issue, but I thought I'd throw it out there,
anyway.
Thanks so much for the information!
On Wed, Feb 17, 2016 at 4:49 PM, Rachel Lynn Underwood <
r.lynn.underw...@gmail.com> wrote:
> This is an error
This is an error being thrown by Apache PDFBox/Tika. You're seeing it now
because Solr 4.x uses a different Tika version than Solr 3.x.
It looks like this error is thrown when you parse a PDF with Tika, and a
font in that PDF doesn't have a ToUnicode mapping.
https://issues.apache.org/jira/browse/
Does literally nobody else see this error in their logs? I see this error
hundreds of times per day, in occasional bursts. Should I file this as a
bug?
On Mon, Feb 15, 2016 at 4:56 PM, Joseph Hagerty wrote:
> After migrating from 3.5 to 4.10.3, I'm seeing the following error with
> alarming regu
After migrating from 3.5 to 4.10.3, I'm seeing the following error with
alarming regularity in the master's error log:
2/15/2016, 4:32:22 PM ERROR PDSimpleFont Can't determine the width of the
space character using 250 as default
I can't seem to glean much information about this one from the web.