Re: Async exceptions during distributed update

2018-05-13 Thread Jay Potharaju
Hi, I restarted both my solr servers but I am seeing the async error again. In older 5x version of solrcloud, solr would normally recover gracefully in case of network errors, but solr 6.6.3 does not seem to be doing that. At this time I am not doing only a small percentage of deletebyquery operat

Multi threading indexing

2018-05-13 Thread Raymond Xie
Hello, I have a huge amount of data (TB level) to be indexed, I am wondering if anyone can share your idea/code to do the multithreading indexing? ** *Sincerely yours,* *Raymond*

Re: Solr 6.6.3 won't start - unrecognized vm option "UseParNewGC" - Java 10.0.1

2018-05-13 Thread turpentyne
I lucked out and found a forum note at Drupal.org regarding a line of code I can change in the Drupal 8 Solr Search API Schema.xml, and 7.3 will work, despite their module not technically being set up to work with any 7.x versions. I'll stick with that and hope for the best since I'm just testing

Re: Solr 6.6.3 won't start - unrecognized vm option "UseParNewGC" - Java 10.0.1

2018-05-13 Thread Yasufumi Mizoguchi
Instead of setting JAVA_HOME variables, if you want to use Java8 only with Solr, you can use SOLR_JAVA_HOME variable in Solr's bin/solr.in.sh script( or bin\solr.in.cmd if you use Windows.) e.g. ) SOLR_JAVA_HOME="/home/ubuntu/jdk1.8.0_171" Regards, Yasufumi 2018年5月13日(日) 11:17 Alexandre Rafalovi

Re: Apache SOLR Design Query

2018-05-13 Thread Rahul Singh
This is a good start. Few things to consider. 1. Extract the contents via Tika externally or via Tika Server. 2. Create a canonical “Item” document schema which would have title, metadata, contents, imagePreview (something to consider) , etc. 3. Use the extracted Tika data to populate your index.