Re: unique values from a field in a result

2008-04-29 Thread Chris Hostetter
: My example is just simple, in real life the numbers are a lot bigger. However, : the amount of unique products vs variations is such that it seems a lot of : work to iterate over al variations in a DocSet just to get the few unique : products. : But, what I understand from you anwser is that the

Re: unique values from a field in a result

2008-04-29 Thread Ian Holsman
Hi Thijs. If you are not concerned with a *EXACT* number there is a paper that was published in 1990 that discusses this problem. http://dblab.kaist.ac.kr/Publication/pdf/ACM90_TODS_v15n2.pdf from the paper (If I understand it correctly) For 120,000,000 records you can sample 10,112,529 reco

Re: unique values from a field in a result

2008-04-29 Thread Thijs
It must be my english. When I read your comment, I think you could compare it to the category example... Maybe with an example I can explain my situation better: The documents in the index contain variations of different products. Say for example I have 10 different products. Every product is i

Re: unique values from a field in a result

2008-04-28 Thread Chris Hostetter
: You are correct I'm looking for the unique values for one field in a DocSet. : The field is not multivalued. and it contains only 1 long value, the pk of a : database table : But you said the counts are stored in the index, I don't see that. Because there's something very confusing about your qu

Re: unique values from a field in a result

2008-04-28 Thread Thijs Vonk
You are correct I'm looking for the unique values for one field in a DocSet. The field is not multivalued. and it contains only 1 long value, the pk of a database table But you said the counts are stored in the index, I don't see that. Because when I debug simplefacet. It always iterates over al

Re: unique values from a field in a result

2008-04-28 Thread Ryan McKinley
On Apr 27, 2008, at 7:50 AM, Thijs Vonk wrote: What is the best way to get the unique terms from a field in a result? I've been using SimpleFacet to do this. However, I don't need the counts, so it seems overkill to have to iterate over all the result documents per field to get the unique va