Re: Indexing PDF file in Apache SOLR via Apache TIKA

2018-10-30 Thread R Nair
I have done a production implementation of this, running for last four months without any issue. Just a resatrt every week of all components. http://blog.cloudera.com/blog/2015/10/how-to-index-scanned-pdfs-at-scale-using-fewer-than-50-lines-of-code/ Best, Ravion On Tue, Oct 30, 2018, 1:00 PM Er

Re: Indexing documents from S3 bucket

2018-10-08 Thread R Nair
S3 gives listeners. So tap those listeners when objects are added, updated or deleted and use Solr API to push. That's high level, but I believe doable. I worked on Minio, an open source object storage supporting S3 and could do this because Minio gives me good and stable listeners. Best, Ravion

Re: SolrJ: build a SolrClient(-connection) with HttpClientUtil

2018-08-13 Thread R Nair
Or HttpSolrClient.builder as well On Mon, Aug 13, 2018, 2:50 PM ☼ R Nair wrote: > Pls use CloudSolrClient.builder(...).withHttpClient(...). > Best, Ravion > > On Mon, Aug 13, 2018, 2:40 PM Clemens Wyss DEV > wrote: > >> What is the proposed way to get/build a Sol

Re: SolrJ: build a SolrClient(-connection) with HttpClientUtil

2018-08-13 Thread R Nair
Pls use CloudSolrClient.builder(...).withHttpClient(...). Best, Ravion On Mon, Aug 13, 2018, 2:40 PM Clemens Wyss DEV wrote: > What is the proposed way to get/build a SolrClient(-connection) via > HttpClientUtil > - respecting a given connection and response (socket) timeout > (ROP_SO_TIMEOUT, P

Re: 4 days and no solution - please help on Solr

2018-08-10 Thread R Nair
Updating a document that does not > exist... will add it. > > -chris > > On 8/10/18 3:01 PM, ☼ R Nair wrote: > > Do you feel that this is only partially complete? > > > > Best, Ravion > > > > On Fri, Aug 10, 2018, 1:37 PM ☼ R Nair > wrote:

Re: 4 days and no solution - please help on Solr

2018-08-10 Thread R Nair
Do you feel that this is only partially complete? Best, Ravion On Fri, Aug 10, 2018, 1:37 PM ☼ R Nair wrote: > I saw this. Please provide for add. My issue is with add. There is no > "AddRequesg". So how to do that, thanks > > Best Ravion > > On Fri, Aug 10, 20

Re: 4 days and no solution - please help on Solr

2018-08-10 Thread R Nair
final List docsToIndex = new ArrayList<>(); > ...Prepare your docs for indexing > final UpdateRequest update = new UpdateRequest(); > update.add(docsToIndex); > update.setBasicAuthCredentials("solr", "solrRocks"); > update.process(client, "te

Re: 4 days and no solution - please help on Solr

2018-08-10 Thread R Nair
/password from Java system properties, and is used to > > > configure the HttpClient that SolrJ creates internally for sending > > > requests. I find this second method a little more complex, and it > > > looks like you're providing your own HttpClient anyways, so for b

4 days and no solution - please help on Solr

2018-08-09 Thread R Nair
Dear all, I have tried my best to do it - searched all Google. But I an=m unsuccessful. Kindly help. We have a solo environment. Its secured with userid and password. I used CloudSolrClient.Builder(solrURLs).withHttpClient(mycloseablehttpclient) method to access it. The url is of the form http:/