Re: Multiple Queries per request

2018-10-02 Thread Shamik Sinha
The Solr uses REST based calls which is done over http or https which cannot handle multiple requests at one shot. However what you can do is return all the necessary data at one shot and group them according to your needs. Thanks and regards, Shamik On 02-Oct-2018 8:11 PM, "Greenhorn Techie" wr

Re: Error when indexing with SolrJ HTTP ERROR 405

2018-03-19 Thread Shamik Sinha
You need to send binary content instead of html. Atleast that is what the error shows. I also think the url is wrong. The correct url should have http://localhost:8983/solr/core/update Check first whether indexing is working on the same data that you are trying to or not using the browser based

Re: Text in images are not extracted and indexed to content

2018-04-10 Thread Shamik Sinha
To index text in images the image needs to be searchable i. e. text needs to be overlayed on the image like a searchable pdf. You can do this using ocr but it is a bit unreliable if the images are scanned copies of written text. On 10-Apr-2018 4:12 PM, "Rahul Singh" wrote: May need to extract ou

Re: Regarding pdf indexing issue

2018-07-11 Thread Shamik Sinha
You may try to use tesseract tool to check data extraction from pdf or images and then go forward accordingly. As far as I understand the PDF is an image and not data. The searchable PDF actually overlays the selectable text as hidden text over the PDF image. These PDFs can be indexed and extracted