Re: SolrCloud scaling/optimization for high request rate

2018-10-27 Thread Deepak Goel
On Fri, Oct 26, 2018 at 9:25 PM Sofiya Strochyk wrote: > Hi everyone, > > We have a SolrCloud setup with the following configuration: > >- 4 nodes (3x128GB RAM Intel Xeon E5-1650v2, 1x64GB RAM Intel Xeon >E5-1650v2, 12 cores, with SSDs) >- One collection, 4 shards, each has only a sin

RE: Tesseract language

2018-10-27 Thread Martin Frank Hansen (MHQ)
Hi Rohan, Thanks for your reply, are you using tess4j with Tika or on its own? I will take a look at tess4j if I can't make it work with Tika alone. Best regards Martin -Original Message- From: Rohan Kasat Sent: 26. oktober 2018 21:45 To: solr-user@lucene.apache.org Subject: Re: Tess

Re: Tesseract language

2018-10-27 Thread Rohan Kasat
I used tess4j for image formats and Tika for scanned PDFs and images within PDFs. Regards, Rohan Kasat On Sat, Oct 27, 2018 at 12:39 AM Martin Frank Hansen (MHQ) wrote: > Hi Rohan, > > Thanks for your reply, are you using tess4j with Tika or on its own? I > will take a look at tess4j if I can'

How to get spellcheck results per field in solr ?

2018-10-27 Thread govind nitk
Hi, I have done suggestion using suggest component. And the results returned are having format: suggest: { "cityname_suggest": { }, "location_suggest": {}, "area_suggest":{} } given cityname_suggest, location_suggest, area_suggest are different dictionary names. Now comparing this re

Re: Tesseract language

2018-10-27 Thread Tim Allison
Martin, Let’s move this over to user@tika. Rohan, Is there something about Tika’s use of tesseract for image files that can be improved? Best, Tim On Sat, Oct 27, 2018 at 3:40 AM Rohan Kasat wrote: > I used tess4j for image formats and Tika for scanned PDFs and images within > P