Hi Sisheel!

We are using Solr 6.5.

We've already looked at Issue 7333, but none of the params seem to change
the behaviour.

Also, I'm not sure having more parallelism will improve performance since
the problem seems to be related to replication. It looks like the writes
need to get to all the replicas before the indexing can continue with the
next batch

Isart



On Tue, Jun 6, 2017 at 2:31 PM, Susheel Kumar <susheel2...@gmail.com> wrote:

> Which version of Solr are you using. See
>
> https://lucidworks.com/2015/06/10/indexing-performance-solr-
> 5-2-now-twice-fast/
>
>
> https://issues.apache.org/jira/browse/SOLR-7333
>
> Also would suggest to index using SolrJ with parallelism (multiple threads
> and/or machines) to increase indexing thru-put further.
>
> On Tue, Jun 6, 2017 at 4:51 AM, Isart Montane <isart.mont...@gmail.com>
> wrote:
>
> > Hello,
> >
> > We are using SolrCloud with 5 nodes, 2 collections, 2 shards each. The
> > problem we are seeing is a huge drop on writes when the number of
> replicas
> > increase.
> >
> > When we index (using DIH and batches) a collection with no replicas, we
> are
> > able to index at 1800 inserts/sec. That number decreases to 1200 with 1
> > replica, 800 with 2 replicas and 400 with 3 replicas and it keeps getting
> > worst when more replicas are added.
> >
> > We've been reading about it and it seems that the `replicationFactor`
> plays
> > a big role on that, but we've got it set to 1, so I'm not sure why it
> keeps
> > decreasing when more replicas are added. In fact, we don't need the data
> to
> > be replicated in real time (we can even afford minutes of delay), but
> I've
> > been unable to find how to tune that.
> >
> > Has anyone  experienced a similar behaviour? is there any way to increase
> > the indexing performance when using SolrCloud?
> >
> > We've seen posts about people having +100 replicas, so my feeling is that
> > there's something to tune that we are not doing.
> >
> > Thanks
> >
> >
> > Isart Montane Mogas
> >
>

Reply via email to