Hi, I'm facing the issue of that the Tesseract OCR is not able to extract the words in a PDF file in an attachment in EMLfile and index it into Solr occasionally? However, most of the time it can be extracted.
What could be the reason that causes the file in the email attachment to be failed to extracted using OCR? I'm using Solr 6.4.2. Regards, Edwin