Re: Tesseract command-line OCR engine has stopped working

2016-02-10 Thread Jan Høydahl
You do not tell us much of how Solr is setup. I found your stackoverflow question too at http://stackoverflow.com/questions/35220443/tesseract-command-line-ocr-engine-has-stopped-working with a screenshot. That suggests that you have setup Tika with OCR for images, and emails with images are

Re: Tesseract command-line OCR engine has stopped working

2016-02-08 Thread Zheng Lin Edwin Yeo
Has anyone experienced this before during indexing of EML files? Regards, Edwin On 5 February 2016 at 17:30, Zheng Lin Edwin Yeo wrote: > Hi, > > I am indexing EML files (emails) into Solr, and some of those emails has > attachment. > > During the indexing, I encountered this "*Tesseract comman