Hi, can you please point me out to "the discussion about how OCR can take
minutes of CPU per page", I really need to understand more the Tika OCR
behavior with solr.
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
Thanks Rick, minutes of CPU is definitely going to break my site. I'm
looking for someone to hire as I have no coding knowledge. Please let me
know if you are up for it.
On Mon, Nov 6, 2017 at 8:05 PM, Rick Leir wrote:
> Anand,
> As Charlie says you should have a separate process for this. Also,
Anand,
As Charlie says you should have a separate process for this. Also, if you go
back about ten months in this mailing list you will see some discussion about
how OCR can take minutes of CPU per page, and needs some preprocessing with
Imagemagick or Graphicsmagick. You will want to do some fi
Hi Charlie,
Thanks for the reply. You're right. I haven't got my hands dirty with solr
yet. I am not from an IT background and learnt everything I know through
lots of reading online. However, all the documentation on solr assumes that
the reader has advanced IT knowledge. In fact, it took me a we
On 03/11/2017 15:32, Admin eLawJournal wrote:
Hi,
I have read that we can use tesseract with solr to index image files. I
would like some guidance on setting this up.
Currently, I am using solr for searching my wordpress installation via the
WPSOLR plugin.
I have Solr 6.6 installed on ubuntu 14
Hi,
I have read that we can use tesseract with solr to index image files. I
would like some guidance on setting this up.
Currently, I am using solr for searching my wordpress installation via the
WPSOLR plugin.
I have Solr 6.6 installed on ubuntu 14.04 which is working fine with
wordpress.
I hav