Re: How to trace one query？the debug/debugQuery info are not enough to find out why a query is slow

Shawn Heisey Thu, 23 Aug 2018 06:06:02 -0700

On 8/23/2018 5:19 AM, zhenyuan wei wrote:

Thanks for your detail answer @Shawn


Yes I run the query in SolrCloud mode, and my collection has 20 shards,
each shard size is 30~50GB。
4 solr server, each solr JVM  use 6GB, HDFS datanode are 4 too, each
datanode JVM use 2.5GB。
Linux server host are 4 node too，each node is 16 core/32GB RAM/1600GB SSD 。

So, in order to  search 2 billion docs fast( HDFS shows 787GB )，I should
turn on autowarm，and   How
much  solr RAM/how many solr node  it should be？
Is there a roughly  formula to budget ？

There are no generic answers, no rough formulas. Every install isdifferent and minimum requirements are dependent on the specifics of theinstall.

How many replicas do you have of each of those 20 shards? Is the 787GBof data the size of *one* replica, or the size of *all* replicas? Basedon the info you shared, I suspect that it's the size of one replica.


Here's a guide I've written:

https://wiki.apache.org/solr/SolrPerformanceProblems

That guide doesn't consider HDFS, so the info about the OS disk cache onthat page is probably not helpful. I really have no idea whatrequirements HDFS has. I *think* that the HDFS client block cache wouldreplace the OS disk cache, and that the Solr heap must be increased toaccommodate that block cache. This might lead to GC issues, though,because ideally the cache would be large enough to cache all of theindex data that the Solr instance is accessing. In your case, that's aLOT of data, far more than you can fit into the 32GB total systemmemory.Solr performance will suffer if you're not able to have thesystem cache Solr's index data. But I will tell you that achieving aQTime of 125 on a wildcard query against a 2 billion document index isimpressive, not something I would expect to happen with the low hardwareresources you're using.

You have 20 shards. If your replicationFactor is 3, then ideally youwould have 60 servers - one for each shard replica. Each server wouldhave enough memory installed that it could cache the 30-50GB of data inthat shard, or at least MOST of it.

IMHO, Solr should be using local storage, not a network filesystem likeHDFS. Things are a lot more straightforward that way.


Thanks,
Shawn

Re: How to trace one query？the debug/debugQuery info are not enough to find out why a query is slow

Reply via email to