RE: Getting facet counts for 10,000 most relevant hits

Chris Hostetter Sat, 15 Oct 2011 15:03:13 -0700

: I didn't realize how much more complicated this gets with distributed 
: search. Do you think it's worth opening a JIRA issue for this?


features are always worth opening jiras for if you have ideas related to 
those features to add as comments (or a patch)

by all means open a jira and put whatever relevant notes you think make 
sense (crib from my email as much as you want)

as i (think i) smentioned: the only feasible way i can think of to 
appraoch this type of problem in a generalized way at scale is to think 
about hte API as a "sampling" API, where instead of specying absolute (ie: 
give me the top 100 constraints from the top 10,000 matches) the API works 
in terms of "goals" (ie: suggest the top 100 constraints based on top 10% 
matches") and then solr has some wiggle room -- it can ask each shard for 
the 100*N constraints from the top (10*M)% matches, then weght all those 
constraints based on how many matches come from each shard to pick the 
final 100 constraints, then ask each shard for the final counts from those 
constraints (like it already does)

: Is there already some ongoing work on the faceting code that this might fit 
in with?

not that i know of.


-Hoss

RE: Getting facet counts for 10,000 most relevant hits

Reply via email to