Re: regarding Extracting text from Images

2019-10-26 Thread Edward Ribeiro
No. You should install tesseract-ocr on the same box your Solr instance is, and configure Solr so that embedded Tika is able to use Tesseract to do the ocr of images. Best, Edward Em qua, 23 de out de 2019 20:08, suresh pendap escreveu: > Hi Alex, > Thanks for your reply. How do we integrate te

Re: NRT vs TLOG bulk indexing performances

2019-10-26 Thread Erick Erickson
"I understand that while non leader TLOG is copying the index from leader, the leader stop indexing” This _better_ not be happening. If you can demonstrate this let’s open a JIRA. > On Oct 25, 2019, at 8:28 AM, Dominique Bejean > wrote: > > I understand that while non leader TLOG is copying th

Re: Dynamic facet limits using Solr

2019-10-26 Thread Erick Erickson
How are you getting the word counts? Do you have a field in the doc where you store it? If so, either use “interval facets” or “facet queries”, either one should give you what you want. Your particular example would also work with “range faceting” since the buckets are identically sized. See: