Re: Long GC Pauses

Shawn Heisey Wed, 07 Feb 2018 07:08:56 -0800

On 2/7/2018 5:20 AM, Maulin Rathod wrote:

Further analyzing issue we found that asking for too many rows (e.g. 
rows=10000000) can cause full GC problem as mentioned in below link.

This is because when you ask for 10 million rows, Solr allocates amemory structure capable of storing information for each of those 10million rows, even before it knows how many documents are actually goingto match the query. This problem is mentioned by Toke's blog post youlinked.

Bare wildcard queries can also lead to big problems with memory churn,and are not recommended. Your query has a bare "*" included in itFOURTEEN times, on the summary field. The name of that field suggeststhat it will have a very high term count. If it does have a lot ofunique terms, then ONE wildcard is going to be horrifically slow andconsume a ton of memory. Fourteen of them is going to be particularlyinsane. You've also got a number of wildcards with text prefixes, whichwill not be as bad as the bare wildcard, but can still chew up a lot ofmemory and time.

I suspect that entire "summary" part of your query generation needs tobe reworked.


You also have wildcards in the part of the query on the "title" field.

The kind of query you do with wildcards can often be completely replacedwith ngram or edgengram filtering on the analysis chain, usually with abig performance advantage.

I suspect that the large number of wildcards is a big part of why yourexample query took 83 seconds to execute. There may have also been somenasty GC pauses during the query.

You still have not answered the questions asked early in this threadabout memory. Is the heap 40GB, or is that the total memory installedin the server? What is the total size of all Solr heaps on the machine,how much total memory is in the server, and how much index data (bothdocument count and disk space size) is being handled by all the Solrinstances on that machine?

The portion of your GC log that you included is too short, and has alsobeen completely mangled by being pasted into an email. If you want itanalyzed, we will need a full copy of the logfile, without anymodification, which likely means you need to use a file sharing site totransport it.

What I *can* decipher from your GC log suggests that your heap size mayactually be 48GB, not 40GB. After the big GC event, there was a littleover 17GB of heap memory allocations remaining. So my first bit ofadvice is to try reducing the heap size. Without a large GC log, mycurrent thought is to make it half what it currently is -- 24GB. With amore extensive GC log, I could make a more accurate recommendation. Mysecond bit of advice would be to eliminate as many wildcards from yourquery as you can. If your queries are producing the correct results,then I will tell you that the "summary" part of your query example isquite possibly completely unnecessary, and is going to require a LOT ofmemory.




Additional advice, not really related to the main discussion:

Some of the query looks like it is a perfect candidate for extractioninto filter queries. Any portion of the query that is particularlystatic is probably going to benefit from being changed into a filterquery. Possible filters you could use based on what I see:


fq=isFolderActive:true
fq=isXref:false
fq=*:* -document_type_id:(3 7)

If your index activity is well-suited for heavy filterCache usage,filters like this can achieve incredible speedups.

A lot of the other things in the query appear to be for ID values thatare likely to change for every user. Query clauses like that are not agood fit for filter queries.


Thanks,
Shawn

Re: Long GC Pauses

Reply via email to