Hi Sisheel! We are using Solr 6.5.
We've already looked at Issue 7333, but none of the params seem to change the behaviour. Also, I'm not sure having more parallelism will improve performance since the problem seems to be related to replication. It looks like the writes need to get to all the replicas before the indexing can continue with the next batch Isart On Tue, Jun 6, 2017 at 2:31 PM, Susheel Kumar <susheel2...@gmail.com> wrote: > Which version of Solr are you using. See > > https://lucidworks.com/2015/06/10/indexing-performance-solr- > 5-2-now-twice-fast/ > > > https://issues.apache.org/jira/browse/SOLR-7333 > > Also would suggest to index using SolrJ with parallelism (multiple threads > and/or machines) to increase indexing thru-put further. > > On Tue, Jun 6, 2017 at 4:51 AM, Isart Montane <isart.mont...@gmail.com> > wrote: > > > Hello, > > > > We are using SolrCloud with 5 nodes, 2 collections, 2 shards each. The > > problem we are seeing is a huge drop on writes when the number of > replicas > > increase. > > > > When we index (using DIH and batches) a collection with no replicas, we > are > > able to index at 1800 inserts/sec. That number decreases to 1200 with 1 > > replica, 800 with 2 replicas and 400 with 3 replicas and it keeps getting > > worst when more replicas are added. > > > > We've been reading about it and it seems that the `replicationFactor` > plays > > a big role on that, but we've got it set to 1, so I'm not sure why it > keeps > > decreasing when more replicas are added. In fact, we don't need the data > to > > be replicated in real time (we can even afford minutes of delay), but > I've > > been unable to find how to tune that. > > > > Has anyone experienced a similar behaviour? is there any way to increase > > the indexing performance when using SolrCloud? > > > > We've seen posts about people having +100 replicas, so my feeling is that > > there's something to tune that we are not doing. > > > > Thanks > > > > > > Isart Montane Mogas > > >