On 2/7/07, Gunther, Andrew <[EMAIL PROTECTED]> wrote:
Yes most all terms are multi-valued which I can't avoid.
Since the data is coming from a library catalogue I am translating a
subject field to make a subject facet.  That facet alone is the biggest,
hovering near 39k.  If I remove this facet.field things return faster.
So am I to assume that this particular field bogging down operations and
there are no other optimization options besides cutting down this field?

Well, the applicable optimizations probably will be related to how you
use the results.  Surely you are not displaying 39,000 facet counts to
the user?

If you are only displaying the top subjects, one solution is to
collect more documents than you need, enumerate the subjects of the
results, and only facet on that subset.

This could be built into solr eventually (some sort of sampling bitset
intersection size).

-Mike

Reply via email to