Re: The Impact of the Number of Collections on Indexing Performance in Solr 6.0

Shawn Heisey Sat, 10 Mar 2018 10:59:07 -0800

On 3/10/2018 9:44 AM, Erick Erickson wrote:

There are quite a number of reasons you may be seeing this, all having
to do with trying to put too much stuff in too little hardware.

<snip>

At any rate, there's no a-priori limit to the number of
collections/replicas/whatever that Solr can deal with, the limits are
your hardware.

Big +1 to everything Erick said. I was working on a response, but Ithink the things that Erick said are better than what I was writing. Idid want to add a little bit of detail, though.


Let me throw some numbers at you:

With 900 collections that have 25 shards, the SolrCloud cluster ishandling 22500 individual indexes -- and that's for only one replica. If I assume that you've only got one copy of all that data in Solr, thenevery one of those 49 servers will average over 450 index cores! If youplanned for redundancy and replicationFactor is 2, then each server willhave over 900 index cores on it (45000 total indexes).

The index count numbers on each server might look like small numbers toyou ... but when you start understanding some of what Solr and Luceneare doing under the covers, you start to realize that these numbers arequite large.

If there's even a hint of significant index/query traffic going to thosecollections, then those servers are going to be hard-pressed to keep up,unless you've spent a LOT more money for hardware than what I see in atypical server.

Have you ever heard of a phenomenon known as a performance knee? Thefollowing link should open up to page 482 of the associated book. Scroll up to page 481 and reading the two sections named "HardwareResource Requirements" and "Peak Load Behavior". Your reading would endpartway through page 482.


https://books.google.com/books?id=ka4QO9kXQFUC&pg=PA482&lpg=PA482&dq=performance+knee+scalability+graph&source=bl&ots=ytNn04PMU8&sig=r8GAGLIaT4mDwjCjYs_waQJbqo8&hl=en&sa=X&ved=0ahUKEwjH6bSdq-HZAhUJslMKHeQhALQQ6AEINDAB#v=onepage&q=performance%20knee%20scalability%20graph&f=false

A common tendency in the performance of computer systems/software isthat they show an increase in response time that's more or less linearwith load increases, until the system reaches a breaking point where thesmallest increase in load causes an incredibly large increase inresponse times. That can lead to an outage situation where things takeso long that clients give up and go to an error state.

FYI: You're running a "dot zero" release. 6.0 is the first releasewhere MAJOR development changes had been accumulating that weren't in5.x versions. The 6.x timeframe was an absolute hotbed of innovation onSolrCloud.

There are typically more bugs in a .0 release than later minorreleases. If there are reasons for you to avoid running 7.x releases,then you should upgrade to the latest 6.x. Right now this is 6.6.2, butwe are in the middle of the 6.6.3 release right now, and that is likelyto be announced any day now.

SolrCloud does not handle large numbers of collections very well,especially if the shard and/or replica counts are high for eachcollection. Up to a point, throwing a LOT of hardware at the problemhelps, but there are scalability issues related to the way SolrClouduses the ZooKeeper database. Eventually it just can't handle it. Seethis issue for some experiments that I conducted on the problem:


https://issues.apache.org/jira/browse/SOLR-7191

Thanks,
Shawn

Re: The Impact of the Number of Collections on Indexing Performance in Solr 6.0

Reply via email to