Good day, We solved the situation. Here is what was used and changed: In our installation we used Tesseract version 3.05, Tika version 1.17, SOLR version 7.4. We actually, had TIKA version 1.17, not 18. 1. Changed from HOCR to TXT >>> <property name="outputType" value="TXT"/> in file parseContext.xml 2. Had to start SOLR as a root user. Version 4.1.1 is not compatible with TIKA 1.17 , so we will upgrade SOLR to version 7.7, TIKA version 1.19 and will try to install Tesseract 4.1.1 <https://lucene.472066.n3.nabble.com/file/t495209/Capture.png>
-- Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html