Re: SolrCloud scaling/optimization for high request rate

Shawn Heisey Fri, 26 Oct 2018 23:29:23 -0700

On 10/26/2018 9:55 AM, Sofiya Strochyk wrote:


We have a SolrCloud setup with the following configuration:

I'm late to this party. You've gotten some good replies already. Ihope I can add something useful.

  * 4 nodes (3x128GB RAM Intel Xeon E5-1650v2, 1x64GB RAM Intel Xeon
    E5-1650v2, 12 cores, with SSDs)
  * One collection, 4 shards, each has only a single replica (so 4
    replicas in total), using compositeId router
  * Total index size is about 150M documents/320GB, so about 40M/80GB
    per node

With 80GB of index data and one node that only has 64GB of memory, thefull index won't fit into memory on that one server. With approximately56GB of memory (assuming there's nothing besides Solr running on theseservers and the size of all Java heaps on the system is 8GB) to cache80GB of index data, performance might be good. Or it might beterrible. It's impossible to predict effectively.

  * Heap size is set to 8GB.

I'm not sure that an 8GB heap is large enough. Especially given whatyou said later about experiencing OOM and seeing a lot of full GCs.

If properly tuned, the G1 collector is overall more efficient than CMS,but CMS can be quite good. If GC is not working well with CMS, chancesare that switching to G1 will not help. The root problem is likely tobe something that a different collector can't fix -- like the heap beingtoo small.

I wrote the page you referenced for GC tuning. I have *never* had asingle problem using G1 with Solr.

Target query rate is up to 500 qps, maybe 300, and we need to keepresponse time at <200ms. But at the moment we only see very goodsearch performance with up to 100 requests per second. Whenever itgrows to about 200, average response time abruptly increases to 0.5-1second. (Also it seems that request rate reported by SOLR in adminmetrics is 2x higher than the real one, because for every query, everyshard receives 2 requests: one to obtain IDs and second one to getdata by IDs; so target rate for SOLR metrics would be 1000 qps).

Getting 100 requests per second on a single replica is quite good,especially with a sharded index. I never could get performance likethat. To handle hundreds of requests per second, you need several replicas.

If you can reduce the number of shards, the amount of work involved fora single request will decrease, which MIGHT increase the queries persecond your hardware can handle. With four shards, one query typicallyis actually 9 requests.

Unless your clients are all Java-based, to avoid a single point offailure, you need a load balancer as well. (The Java client can talk tothe entire SolrCloud cluster and wouldn't need a load balancer)

What you are seeing where there is a sharp drop in performance from arelatively modest load increase is VERY common. This is the way thatalmost all software systems behave when faced with extreme loads. Search for "knee" on this page:


https://www.oreilly.com/library/view/the-art-of/9780596155858/ch04.html

During high request load, CPU usage increases dramatically on the SOLRnodes. It doesn't reach 100% but averages at 50-70% on 3 servers andabout 93% on 1 server (random server each time, not the smallest one).

Very likely the one with a higher load is the one that is aggregatingshard requests for a full result.

The documentation mentions replication to spread the load between theservers. We tested replicating to smaller servers (32GB RAM, IntelCore i7-4770). However, when we tested it, the replicas were going outof sync all the time (possibly during commits) and reported errorslike "PeerSync Recovery was not successful - trying replication." Thenthey proceed with replication which takes hours and the leader handlesall requests singlehandedly during that time. Also both leaders andreplicas started encountering OOM errors (heap space) for unknown reason.

With only 32GB of memory, assuming 8GB is allocated to the heap, there'sonly 24GB to cache the 80GB of index data. That's not enough, andperformance would be MUCH worse than your 64GB or 128GB machines.

I would suspect extreme GC pauses and/or general performance issues fromnot enough cache memory to be the root cause of the sync and recoveryproblems.

Heap dump analysis shows that most of the memory is consumed by [J(array of long) type, my best guess would be that it is "_version_"field, but it's still unclear why it happens.

I'm not familiar enough with how Lucene allocates memory internally tohave any hope of telling you exactly what that memory structure is.

Also, even though with replication request rate and CPU usage drop 2times, it doesn't seem to affect mean_ms, stddev_ms or p95_ms numbers(p75_ms is much smaller on nodes with replication, but still not aslow as under load of <100 requests/s).
Garbage collection is much more active during high load as well. FullGC happens almost exclusively during those times. We have tried tuningGC options like suggested here<https://wiki.apache.org/solr/ShawnHeisey#CMS_.28ConcurrentMarkSweep.29_Collector>and it didn't change things though.

Symptoms like that generally mean that your heap is too small and needsto be increased.

  * How do we increase throughput? Is replication the only solution?

Ensuring there's enough memory for caching is the first step. But thatcan only take you so far. Dealing with the very high query rate you'vegot will require multiple replicas.

  * if yes - then why doesn't it affect response times, considering
    that CPU is not 100% used and index fits into memory?


Hard to say without an in-depth look.  See the end of my reply.

  * How to deal with OOM and replicas going into recovery?

There are precisely two ways to deal with OOM. One is to increase thesize of the resource that's depleted. The other is to change things sothat the program doesn't require as much of that resource. The secondoption is frequently not possible.

  * Is memory or CPU the main problem? (When searching on the
    internet, i never see CPU as main bottleneck for SOLR, but our
    case might be different)

Most Solr performance issues are memory related. With an extreme queryrate, CPU can also be a bottleneck, but memory will almost always be thebottleneck you run into first.

  * Or do we need smaller shards? Could segments merging be a problem?

Smaller shards really won't make much difference in segment merging,unless the size reduction is *EXTREME* -- switching to a VERY largenumber of shards.

If you increase the numbers in your merge policy, then merging willhappen less frequently. The config that I chose to use was 35 formaxMergeAtOnce and segmentsPerTier, with 105 formaxMergeAtOnceExplicit. The disadvantage to this is that your indexeswill have a LOT more files in them, so it's much easier to run into anopen file limit in the OS.

  * How to add faceting without search queries slowing down too much?

As Erick said ... this isn't possible. To handle the query load you'vementioned *with* facets will require even more replicas. Facets requiremore heap memory, more CPU resources, and are likely to access more ofthe index data -- which means having plenty of cache memory is even moreimportant.

  * How to diagnose these problems and narrow down to the real reason
    in hardware or setup?

A lot of useful information can be obtained from the GC logs that Solr'sbuilt-in scripting creates. Can you share these logs?


The screenshots described here can also be very useful for troubleshooting:

https://wiki.apache.org/solr/SolrPerformanceProblems#Asking_for_help_on_a_memory.2Fperformance_issue

Thanks,
Shawn

Re: SolrCloud scaling/optimization for high request rate

Reply via email to