Re: autowarmCount usefulness

Erik Hatcher Tue, 27 Jun 2006 03:45:36 -0700


On Jun 26, 2006, at 10:38 PM, Chris Hostetter wrote:

: I'm trying to fully understand the LRUCache and the autowarmCount
: parameter.   Why does it make sense to auto-warm filters and query
: results?   In my case, if a new document is added it may invalidate
: many filters, and it would require knowing the details of the
: documents added/removed to know which caches could be copied.
:
: Can someone shed light on the scenarios where blindly copying over
: any cached filters (or query results) makes sense?

Autowarming of the filterCache and queryResultCache doesn't justcopy the

cached values -- it reexecutes the queries used as the keys for those

caches and generates new DocSet/DocLists using the *new* searcher,before

that searcher is made available to threads serving queries over HTTP.

Ah, that was the secret sauce I was missing. I'm still making my waythrough the codebase understanding how it is put together, and now Isee the regenerators in the SolrIndexSearcher for these built-in caches.

For named User caches, autowarming doesn't work at all unless you've
specified a regenerator -- which can do whatever it wants using thenew
searcher and the information from the old cache.

How do I use LRUCache as a custom user cache to deal with cachemisses and look up data dynamically then? It seems to me thatLRUCache.get() should deal with misses itself and call theregenerator if the key is not found. But rather SolrIndexSearcherdeals with this. If I define a custom cache as an LRUCache with acustom regenerator, it looks like I have to add a bit more customcode around where I use that cache to deal with misses. Does it makesense that LRUCache would pass through to a regenerator on .get() ifthe key is not found?

The reason autowarming is configured using an autowarmCount is soyou cancontrol just how much effort Solr should put into the autowarmingof thenew cache ... if you've got a limitless supply of RAM, and an indexthat
doesn't change very often, you can make your caches so big that no
DocSet/DocList is ever generated dynamically more then once -- butwhat
happens when your index does finally change? ... if your autowarmCount
is the same as the size of your index, Solr could spend daysautowarmingevery query ever executed against your index, even if it was onlyexecuted
one time 3 weeks ago.  the autowarmCount tells Solr to only warm the N
"best" keys in the cache where "best" is defined by the Cache
implimentation (for an LRUCache, the "best" things are the things most
recently used).

"LRU" abbreviation confuses me.... I see "least recently used" when Isee that, but it really means "last recently used" within Solr. :)

Once upon a time Yonik and I hypothisized that it would be cool tohaveautowarmTimelimit and autowarmPercentage (of current size) paramsand someother things like that so you could have other ways of tweakingjust how
much autowarming is done on your behalf ... but they were never built.

No worries there. The caching is quite nice as it is. As needarises, more bells and whistles can be added, but the currentparameters are sufficient for my needs so far.


        Erik

Re: autowarmCount usefulness

Reply via email to