Re: What's the bottleneck?

Mike Klaas Thu, 11 Sep 2008 15:43:15 -0700

On 11-Sep-08, at 8:24 AM, Jason Rennie wrote:

We have a 14 million document index that we only use for querying
(optimized, read-only). When we issue queries that have few,relativelyrare words, the query returns quickly. However, when the query islongerand uses more common words (hitting, say, ~1 million docs), it mighttakeseconds to return. I'd like to know: what's the bottleneck? Itdoesn'tseem to be disk---i/o wait times on the machine are much, much lowerthan onour database servers (e.g. 3% vs. 50%). Our search server is an 8-core
machine and we do see cpu regularly holding above 100%, so cpu seems
plausible, but would it really take that long to compute scores?
We're using DisMax. There are a number of different fields that wesearchover (5 to be exact). We also have an fq on a single-digit statusfield.Does it make sense that computation time could easily exceed asecond? Ifcpu is the bottleneck, is there anything we could do to easily trim-down
computation time (besides removing common words from the query)?

Are you using pf? phrase queries are much more expensive than termqueries.

If you have a restrictive fq, you might try an approach similar to theone in https://issues.apache.org/jira/browse/SOLR-407 .


-Mike

Re: What's the bottleneck?

Reply via email to