Hi all,
As a bit of background, we're trying to run a map-reduce job on a Hadoop
cluster (CDH version
5.4.5) which involved writing from Solr during both the Map phase. To accomplish
this, we are using the Solrj library with version 4.10.3-cdh5.4.5. In the
driver class which launch the MR Job, w
Hello,
I am indexing lots of big documents thanks to Solr Cloud in a map reduce job:
so every day it is 1 - 2 documents (avg:8Mb, max 100Mb, total ~ 100
Gb). This is done is 20 minutes. We have 5 nodes, Solr server is launched with
20 Gb of Ram (and GC1). We add in parallel around 200
S