There is some indexing performance penalty is by adding a replica but then onwards adding more replica(s) shouldn't have much difference. So something very wrong. Can you watch your logs when indexing happening and see what's going when gets replicated etc.
Thanks, Susheel On Tue, Jun 6, 2017 at 9:28 AM, Isart Montane <isart.mont...@gmail.com> wrote: > Hi Sisheel! > > We are using Solr 6.5. > > We've already looked at Issue 7333, but none of the params seem to change > the behaviour. > > Also, I'm not sure having more parallelism will improve performance since > the problem seems to be related to replication. It looks like the writes > need to get to all the replicas before the indexing can continue with the > next batch > > Isart > > > > On Tue, Jun 6, 2017 at 2:31 PM, Susheel Kumar <susheel2...@gmail.com> > wrote: > > > Which version of Solr are you using. See > > > > https://lucidworks.com/2015/06/10/indexing-performance-solr- > > 5-2-now-twice-fast/ > > > > > > https://issues.apache.org/jira/browse/SOLR-7333 > > > > Also would suggest to index using SolrJ with parallelism (multiple > threads > > and/or machines) to increase indexing thru-put further. > > > > On Tue, Jun 6, 2017 at 4:51 AM, Isart Montane <isart.mont...@gmail.com> > > wrote: > > > > > Hello, > > > > > > We are using SolrCloud with 5 nodes, 2 collections, 2 shards each. The > > > problem we are seeing is a huge drop on writes when the number of > > replicas > > > increase. > > > > > > When we index (using DIH and batches) a collection with no replicas, we > > are > > > able to index at 1800 inserts/sec. That number decreases to 1200 with 1 > > > replica, 800 with 2 replicas and 400 with 3 replicas and it keeps > getting > > > worst when more replicas are added. > > > > > > We've been reading about it and it seems that the `replicationFactor` > > plays > > > a big role on that, but we've got it set to 1, so I'm not sure why it > > keeps > > > decreasing when more replicas are added. In fact, we don't need the > data > > to > > > be replicated in real time (we can even afford minutes of delay), but > > I've > > > been unable to find how to tune that. > > > > > > Has anyone experienced a similar behaviour? is there any way to > increase > > > the indexing performance when using SolrCloud? > > > > > > We've seen posts about people having +100 replicas, so my feeling is > that > > > there's something to tune that we are not doing. > > > > > > Thanks > > > > > > > > > Isart Montane Mogas > > > > > >