Hi,
I restarted both my solr servers but I am seeing the async error again. In
older 5x version of solrcloud, solr would normally recover gracefully in
case of network errors, but solr 6.6.3 does not seem to be doing that. At
this time I am not doing only a small percentage of deletebyquery
operat
Hello,
I have a huge amount of data (TB level) to be indexed, I am wondering if
anyone can share your idea/code to do the multithreading indexing?
**
*Sincerely yours,*
*Raymond*
I lucked out and found a forum note at Drupal.org regarding a line of code
I can change in the Drupal 8 Solr Search API Schema.xml, and 7.3 will work,
despite their module not technically being set up to work with any 7.x
versions.
I'll stick with that and hope for the best since I'm just testing
Instead of setting JAVA_HOME variables, if you want to use Java8 only with
Solr, you can
use SOLR_JAVA_HOME variable in Solr's bin/solr.in.sh script( or
bin\solr.in.cmd if you use Windows.)
e.g. )
SOLR_JAVA_HOME="/home/ubuntu/jdk1.8.0_171"
Regards,
Yasufumi
2018年5月13日(日) 11:17 Alexandre Rafalovi
This is a good start. Few things to consider.
1. Extract the contents via Tika externally or via Tika Server.
2. Create a canonical “Item” document schema which would have title, metadata,
contents, imagePreview (something to consider) , etc.
3. Use the extracted Tika data to populate your index.