On 6-Feb-08, at 11:07 AM, Fuad Efendi wrote:
What I did: single-valued fields for Category and ItemName.
Category field
is tokenized (with custom analyzer), and I updated only 30% of
Lucene index,
but it was more than enough for huge performance improvements.
Before that,
due to some mistakes in parsing HTML and etc., I had *multivalued*
field for
facets; I tried to minimize amount of tokens (sorry for possibly wrong
terminology). Field for facets is still tokenized, but it is single-
value.
Before: after Commit/Optimize and Server Restart first query took 5-7
minutes to execute
After: only 2 seconds!
I was browsing Lucene fieldCache, unfortunately it's not applicable
for
tokenized fields...
filterCache size is almost the same as before, but it works much
faster.
Indeed the field cache method works much better when the values are
single-valued. Unfortunately, there is no way for solr to know that
the analyzer is only outputting a single token per document, else we
could apply this optimization automatically.
-Mike