On rereading my original post it does sound weird. Let me try again and thanks for bearing with me.

I want to know how long solr will take to process a unique query taking full advantage of OS i/o buffers. I think executing a set of unique queries from a cold start should measure that if I knew that I was making maximal use of OS i/o buffering. That is, that the only variable in my test was the amount of memory devoted to buffering index segments. Is that basically correct?

As I scale up shard size, I just want to know that response time is staying within bounds as I add documents. When response time falls out of bounds, I'll add another shard but I want to know when to do that.

So I guess I'm asking how to make sure that solr is making best use of OS i/o buffers on a dedicated server.

Phil


Otis Gospodnetic wrote:
Hi,

I think you are describing some "weird" unrealistic scenarios.
There is typically no need to test "just solr" without relying on disk caches.  
Not using disk buffers will only work in trivial scenarios, but if you really want to 
test it, run something that hogs memory while running solr perf tests on the same server.

You often can't prewarm caches at the very beginning because you can't predict 
queries (not always true), so yes, initially caches will be empty, but then 
they will get filled and then you will (want to) use them.  I don't think there 
is a way to clear Solr caches.

I can't give more advice at this time, I don't fully understand what you are 
trying to test...


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
From: Phillip Farber <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Wednesday, August 20, 2008 1:34:20 PM
Subject: Testing query response time



I would like to test query response time for a set of queries. I'm not interested in capacity Q/sec, just response time. My queries will be against an index of OCR'd books so in the real world every query is probably unique and impossible to predict so I don't see a way to prewarm any of the caches. I'm not sorting. I'm not faceting. I'm querying on a few fields like title, author, subject and date in a range.

Regarding initial conditions, it seems that there's no useful state into which I can put the caches. Would the best approach be to run the queries from a cold solr startup?

What about OS disk caches? I can see two arguments. One, just to test solr the disk caches should be empty. On the other hand, realistically, the disk caches would be full so that argues for executing enough queries to load those and then redo the query set (with empty solr caches).

Speaking of empty solr caches, is there a way to flush those while solr is running?

What other system states do I need to control for to get a handle on response time?

Thanks and regards,

Phil
------------------------------------------
Phillip Farber - http://www.umdl.umich.edu

Reply via email to