Re: Cache use

sfox Thu, 06 Dec 2007 13:24:12 -0800

One possible explanation is that the OS's native file system caching isbeing successful at keeping these files mostly in RAM most of the time.And so the performance benefits of 'forcing' the files into RAM byusing tmpfs aren't significant.

So the slowness of the queries is the result of being CPU bound, ratherthan IO bound. The cache within Solr is faster because it is saving andreturning the information for which the CPU-bound work has already beendone.


Just one possible explanation.

Sean Fox

Matthew Phillips wrote:

No one has a suggestion? I must be missing something because as Iunderstand it from Dennis' email, all of queries are very quick (cachedtype response times) whereas mine are not. I can clearly see timedifferences between queries that are cached (things that have been autowarmed) and queries that are not. This seems odd as my whole index isloaded on a tmpfs memory based file system. Thanks for the help.
Matt

On Dec 4, 2007, at 3:55 PM, Matthew Phillips wrote:
Thanks for the suggestion, Dennis. I decided to implement this as youdescribed on my collection of about 400,000 documents, but I did notreceive the results I expected.
Prior to putting the indexes on a tmpfs, I did a bit of benchmarkingand found that it usually takes a little under two seconds for eachfacet query. After moving my indexes from disk to a tmpfs file system,I seem to get about the same result from facet queries: about twoseconds.
Does anyone have any insight into this? Doesn't it seem odd that myresponse times are about the same? Thanks for the help.
Matt Phillips

Dennis Kubes wrote:
One way to do this if you are running on linux is to create a tempfs(which is ram) and then mount the filesystem in the ram. Then yourindex acts normally to the application but is essentially served fromRam. This is how we server the Nutch lucene indexes on our websearch engine (www.visvo.com) which is ~100M pages. Below is how youcan achieve this, assuming your indexes are in /path/to/indexes:
mv /path/to/indexes /path/to/indexes.dist
mkdir /path/to/indexes
cd /path/to
mount -t tmpfs -o size=2684354560 none /path/to/indexes
rsync --progress -aptv indexes.dist/* indexes/
chown -R user:group indexes
This would of course be limited by the amount of RAM you have on themachine. But with this approach most searches are sub-second.
Dennis Kubes
Evgeniy Strokin wrote:
Hello,...
we have 110M records index under Solr. Some queries takes a while,but we need sub-second results. I guess the only solution is cache(something else?)...We use standard LRUCache. In docs it says (as far as I understood)that it loads view of index in to memory and next time works withmemory instead of hard drive.So, my question: hypothetically, we can have all index in memory ifwe'd have enough memory size, right? In this case the result shouldcome up very fast. We have very rear updates. So I think this couldbe a solution.
How should I configure the cache to achieve such approach?
Thanks for any advise.
Gene

Re: Cache use

Reply via email to