Fwd: configuring Solr with Tesseract

2017-11-03 Thread Admin eLawJournal
Hi, I have read that we can use tesseract with solr to index image files. I would like some guidance on setting this up. Currently, I am using solr for searching my wordpress installation via the WPSOLR plugin. I have Solr 6.6 installed on ubuntu 14.04 which is working fine with wordpress. I hav

Re: Fwd: configuring Solr with Tesseract

2017-11-06 Thread Admin eLawJournal
I also noticed that Tika 1.14 is capable of ocr by itself. I would be okay with a setup of solr using Tika 1.14 to ocr the PDF if that is possible. Best regards, Anand On Nov 6, 2017 5:05 PM, "Charlie Hull" wrote: On 03/11/2017 15:32, Admin eLawJournal wrote: > Hi, > I have

Re: Fwd: configuring Solr with Tesseract

2017-11-06 Thread Admin eLawJournal
R output in a DB or the filesystem. > Then you will want to be able to re-index Solr easily as you fine tune Solr. > > Yes, use Python or your preferred Scripting language. > Cheers -- Rick > > On November 6, 2017 4:05:42 AM EST, Charlie Hull > wrote: > >On 03/11/2017 15:32,