: My example is just simple, in real life the numbers are a lot bigger. However,
: the amount of unique products vs variations is such that it seems a lot of
: work to iterate over al variations in a DocSet just to get the few unique
: products.
: But, what I understand from you anwser is that the
Hi Thijs.
If you are not concerned with a *EXACT* number there is a paper that was
published in 1990 that discusses this problem.
http://dblab.kaist.ac.kr/Publication/pdf/ACM90_TODS_v15n2.pdf
from the paper (If I understand it correctly)
For 120,000,000 records you can sample 10,112,529 reco
It must be my english.
When I read your comment, I think you could compare it to the category
example...
Maybe with an example I can explain my situation better:
The documents in the index contain variations of different products.
Say for example I have 10 different products. Every product is i
: You are correct I'm looking for the unique values for one field in a DocSet.
: The field is not multivalued. and it contains only 1 long value, the pk of a
: database table
: But you said the counts are stored in the index, I don't see that. Because
there's something very confusing about your qu
You are correct I'm looking for the unique values for one field in a
DocSet. The field is not multivalued. and it contains only 1 long value,
the pk of a database table
But you said the counts are stored in the index, I don't see that.
Because when I debug simplefacet. It always iterates over al
On Apr 27, 2008, at 7:50 AM, Thijs Vonk wrote:
What is the best way to get the unique terms from a field in a result?
I've been using SimpleFacet to do this. However, I don't need the
counts, so it seems overkill to have to iterate over all the result
documents per field to get the unique va