Re: Slow inserting with SolrCloud when increasing replicas

2017-06-07 Thread Susheel Kumar
Does 50K batch size is what are you using to ingest into Solr? If that's the case it may be too high and you may want to start with 100-1000 batch size depending on your document size and gradually increase until it starts degrading the performance. On Wed, Jun 7, 2017 at 5:51 AM, Isart Montane

Re: Slow inserting with SolrCloud when increasing replicas

2017-06-07 Thread Isart Montane
Hi, The cluster is running on EC2 using 5x r3.xlarge instances and disks are 1TB gp2 EBS. I will try to get the logs that Susheel requested but it's not an easy task. When indexing there's very few IO. Solr is started with the following flags: ``` /usr/lib/jvm/java-8-oracle/bin/java -server

Re: Slow inserting with SolrCloud when increasing replicas

2017-06-07 Thread Toke Eskildsen
On Tue, 2017-06-06 at 10:51 +0200, Isart Montane wrote: > We are using SolrCloud with 5 nodes, 2 collections, 2 shards each. > The problem we are seeing is a huge drop on writes when the number of > replicas increase. > > When we index (using DIH and batches) a collection with no replicas, > we ar

Re: Slow inserting with SolrCloud when increasing replicas

2017-06-06 Thread Susheel Kumar
There is some indexing performance penalty is by adding a replica but then onwards adding more replica(s) shouldn't have much difference. So something very wrong. Can you watch your logs when indexing happening and see what's going when gets replicated etc. Thanks, Susheel On Tue, Jun 6, 2017 a

Re: Slow inserting with SolrCloud when increasing replicas

2017-06-06 Thread Isart Montane
Hi Sisheel! We are using Solr 6.5. We've already looked at Issue 7333, but none of the params seem to change the behaviour. Also, I'm not sure having more parallelism will improve performance since the problem seems to be related to replication. It looks like the writes need to get to all the re

Re: Slow inserting with SolrCloud when increasing replicas

2017-06-06 Thread Susheel Kumar
Which version of Solr are you using. See https://lucidworks.com/2015/06/10/indexing-performance-solr-5-2-now-twice-fast/ https://issues.apache.org/jira/browse/SOLR-7333 Also would suggest to index using SolrJ with parallelism (multiple threads and/or machines) to increase indexing thru-put furt

Slow inserting with SolrCloud when increasing replicas

2017-06-06 Thread Isart Montane
Hello, We are using SolrCloud with 5 nodes, 2 collections, 2 shards each. The problem we are seeing is a huge drop on writes when the number of replicas increase. When we index (using DIH and batches) a collection with no replicas, we are able to index at 1800 inserts/sec. That number decreases t