On 2/7/07, Gunther, Andrew <[EMAIL PROTECTED]> wrote:
Yes most all terms are multi-valued which I can't avoid. Since the data is coming from a library catalogue I am translating a subject field to make a subject facet. That facet alone is the biggest, hovering near 39k. If I remove this facet.field things return faster. So am I to assume that this particular field bogging down operations and there are no other optimization options besides cutting down this field?
Well, the applicable optimizations probably will be related to how you use the results. Surely you are not displaying 39,000 facet counts to the user? If you are only displaying the top subjects, one solution is to collect more documents than you need, enumerate the subjects of the results, and only facet on that subset. This could be built into solr eventually (some sort of sampling bitset intersection size). -Mike