from:"Bertrand Venzal"

Solr Cloud: Indexing in a Map reduce Job with Kerberos

2015-09-24 Thread Bertrand Venzal

Hi all, As a bit of background, we're trying to run a map-reduce job on a Hadoop cluster (CDH version 5.4.5) which involved writing from Solr during both the Map phase. To accomplish this, we are using the Solrj library with version 4.10.3-cdh5.4.5. In the driver class which launch the MR Job, w

Solr Cloud: Massive indexing

2015-09-08 Thread Bertrand Venzal

Hello, I am indexing lots of big documents thanks to Solr Cloud in a map reduce job: so every day it is 1 - 2 documents (avg:8Mb, max 100Mb, total ~ 100 Gb). This is done is 20 minutes. We have 5 nodes, Solr server is launched with 20 Gb of Ram (and GC1). We add in parallel around 200 S