In SimpleFacets.getFacetTermEnumCounts, we seek to the first term
matching the prefix using the index and then for each term after
compare the prefix until it no longer matches.
-Yonik
On Mon, Apr 24, 2017 at 5:04 AM, alessandro.benedetti
wrote:
> Thanks Yonik and Maria.
> It make sense, if we
Thanks Yonik and Maria.
It make sense, if we reduce the number of terms, term enum becomes a very
good solution.
@Yonik : do we still check the prefix on the term dictionary one by one, or
an FST is used to identify the set of candidate terms ?
I will check the code later,
Regards
-
--
I see. Once I specify a prefix the number of terms is MUCH smaller.
Thank you again for all your help.
Maria
On Fri, Apr 21, 2017 at 1:46 PM, Yonik Seeley wrote:
> On Fri, Apr 21, 2017 at 4:25 PM, Maria Muslea
> wrote:
> > The field is:
> >
> >
> >
> > and using unique() I found that it has
On Fri, Apr 21, 2017 at 4:25 PM, Maria Muslea wrote:
> The field is:
>
>
>
> and using unique() I found that it has 700K+ unique values.
>
> The query before (that takes ~10s):
>
> wt=json&indent=true&q=*:*&rows=0&facet=true&facet.field=concept&facet.prefix=A/
>
> the query after (that is almost
The field is:
and using unique() I found that it has 700K+ unique values.
The query before (that takes ~10s):
wt=json&indent=true&q=*:*&rows=0&facet=true&facet.field=concept&facet.prefix=A/
the query after (that is almost instant):
wt=json&indent=true&q=*:*&rows=0&facet=true&facet.field=conc
That is quite interesting !
You can use the stats module ( in association with the Json facets if you
need it) to calculate an accurate approximation of the unique values [1] [2]
.
Good to know it improved your scenario, I may need to update my knowledge of
term enum internals!
Can you describe yo
Actually using facet.method=enum made a HUGE difference even in my case
where I have many unique values. I am happy with the query response time
now.
Is there a way in SOLR to count the unique values for a field? If not, I
could run the reindexing and count the unique values while I add them to
gi
Hi Maria,
If you have 100-500.000 unique values for the field you are interested in,
and the cardinality of your search results is actually quite small in
comparison, I am not that sure term enum will help you that much ...
To simplify, with the term enum approach, you iterate over each unique
val
Hmmm, not sure. Probably in the range of 100K-500K.
Before writing the email I was just looking at:
http://yonik.com/facet-performance/
Wow, using facet.method=enum makes a big difference. I will read on it to
understand what it does.
Thank you so much.
Maria
On Tue, Apr 18, 2017 at 5:21 PM, Y
How many unique values in the index?
You could try facet.method=enum
-Yonik
On Tue, Apr 18, 2017 at 8:16 PM, Maria Muslea wrote:
> Hi,
>
> I have ~40K documents in SOLR (not many) and a multivalued facet field that
> contains at least 2K values per document.
>
> The values of the facet field lo
10 matches
Mail list logo