I'm trying to solve a search problem and wondering if facets (or something 
else) might solve the problem.

Let's assume I have a bunch of documents (100 million+).  Each document has a 
category (keyword) assigned to it.  A single document my only have one 
category, but there may be multiple documents with the same category (1 to a 
few hundred documents may be in any one category).  There are several million 
categories.

Supposed I'm doing a search with a page size of 50.  What I want to do is do a 
search (e.g., "dog") and get back the top 50 documents that match the contain 
the word "dog" and are all in different categories.  So, there needs to be one 
document from 50 different categories.

If that's not possible, then is it possible to do it if I know the 50 
categories up-front and hand that off as part of the search (so "find 50 
documents that match the term 'dog' and there is one document from each of 50 
specified categories").

Is there a way to do this?

I'm not extremely knowledgeable about facets, but thought that might be a 
solution.  But, it doesn't have to be facets.

Thanks for any help

Scott


Reply via email to