Re: SolrCloud scaling/optimization for high request rate

Shawn Heisey Tue, 30 Oct 2018 13:05:30 -0700

On 10/29/2018 7:24 AM, Sofiya Strochyk wrote:

Actually the smallest server doesn't look bad in terms of performance,it has been consistently better that the other ones (withoutreplication) which seems a bit strange (it should be about the same orslightly worse, right?). I guess the memory being smaller than indexdoesn't cause problems due to the fact that we use SSDs.

SSD, while fast, is nowhere near as fast as main memory. As I said, thememory numbers might cause performance problems, or they might not. Glad you're in the latter category.

What if we are sending requests to machine which is part of thecluster but doesn't host any shards? Does it handle the initialrequest and merging of the results, or this has to be handled by oneof the shards anyway?Also i was thinking "more shards -> each shard searches smaller set ofdocuments -> search is faster". Or is the overhead for merging resultsbigger than overhead from searching larger set of documents?

If every shard is on its own machine, many shards might not be aperformance bottleneck with a high query rate. The more shards youhave, the more the machine doing the aggregation must do to produce results.

SolrCloud complicates the situation further. It normally does loadbalancing of all requests that come in across the cloud. So the machinehandling the request might not be the machine where you SENT the request.

Very likely the one with a higher load is the one that is aggregatingshard requests for a full result.
Is there a way to confirm this? Maybe the aggregating shard is goingto have additional requests in its solr.log?

The logfiles on your servers should be verbose enough to indicate whatmachines are handling which parts of the request.

Most Solr performance issues are memory related. With an extremequery rate, CPU can also be a bottleneck, but memory will almostalways be the bottleneck you run into first.
This is the advice i've seen often, but how exactly can we run out ofmemory if total RAM is 128, heap is 8 and index size is 80. Especiallysince node with 64G runs just as fine if not better.

Even when memory is insufficient, "running out" of memory generallydoesn't happen unless the heap is too small.Java will work within thelimits imposed by the system if it can. For OS disk cache, the OS triesto be as smart as it can about which data stays in the cache and whichdata is discarded.

A lot of useful information can be obtained from the GC logs thatSolr's built-in scripting creates. Can you share these logs?
The screenshots described here can also be very useful fortroubleshooting:
https://wiki.apache.org/solr/SolrPerformanceProblems#Asking_for_help_on_a_memory.2Fperformance_issue
I have attached some GC logs and screenshots, hope these are helpful(can only attach small files)

Only one attachment made it to the list. I'm surprised that ANY of themmade it -- usually they don't. Generally you need to use a file sharingwebsite and provide links. Dropbox is one site that works well. Gistmight also work.

The GC log that made it through (solr_gc.log.7.1) is only two minuteslong. Nothing useful can be learned from a log that short. It is alsomissing the information at the top about the JVM that created it, so I'mwondering if you edited the file so it was shorter before including it.


Thanks,
Shawn

Re: SolrCloud scaling/optimization for high request rate

Reply via email to