On 10/20/06 10:24 AM, "Mike Klaas" <[EMAIL PROTECTED]> wrote:

> Finally, it can be much faster to search a single field rather than
> multiple fields.  One hacky way of achieving this is to make a field
> which receives a single copy of contents and eight copies of title.
> This is imperfect, as it messes up length normalization and
> summarizing.

Matching a token eight times is probably faster than fetching
a second field. For titles, the normalization probably should
be turned off anyway. Normalization is really there to compare
1000 word docs with 8000 word docs, not 3 word titles with 6 word
titles.

Maybe I'll try one searchable field per weight and check that
for performance. Any rule of thumbs about how the performance
changes when different numbers of fields are searched?

Thanks for all the help. I'm trying to avoid premature optimization,
but I'm starting with a load of 1-2 million queries/day, so I need
to be ready to make it perform.

wunder
-- 
Walter Underwood
Search Guru, Netflix


Reply via email to