Re: Solr facets implementation question

2015-09-23 Thread Toke Eskildsen
On Tue, 2015-09-22 at 11:56 -0700, Erick Erickson wrote: > FWIW, there is work being done for "high cardinality faceting" with > some of the recent Streaming Aggregation code. The JIRA I can find that seems relevant is https://issues.apache.org/jira/browse/SOLR-7903 I can see streaming faceting w

Re: Solr facets implementation question

2015-09-22 Thread Erick Erickson
FWIW, there is work being done for "high cardinality faceting" with some of the recent Streaming Aggregation code. So it's at least on the way if not already there. Erick On Tue, Sep 22, 2015 at 11:44 AM, Toke Eskildsen wrote: > adfel70 wrote: >> Hi Toke, Thank you for the detailed explanatio

Re: Solr facets implementation question

2015-09-22 Thread Toke Eskildsen
adfel70 wrote: > Hi Toke, Thank you for the detailed explanation, thats exactly what I was > looking for, except this algorithm fit single index only. could you please > elaborate what adjustments are needed for distributed index? Vanilla Solr requests top-X terms from each shard, with over-provi

Re: Solr facets implementation question

2015-09-17 Thread adfel70
Toke Eskildsen wrote > adfel70 < > adfel70@ > > wrote: >> I am trying to understand why faceting on a field with lots of unique >> values >> has a great impact on query performance. > > Faceting in Solr is performed in different ways. String faceting different > from Numerics faceting, DocValued

Re: Solr facets implementation question

2015-09-08 Thread Toke Eskildsen
adfel70 wrote: > I am trying to understand why faceting on a field with lots of unique values > has a great impact on query performance. Faceting in Solr is performed in different ways. String faceting different from Numerics faceting, DocValued fields different from non-DocValued, fc different

Re: Solr facets implementation question

2015-09-08 Thread Walter Underwood
Every faceting implementation I’ve seen (not just Solr/Lucene) makes big in-memory lists. Lots of values means a bigger list. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) On Sep 8, 2015, at 8:33 AM, Shawn Heisey wrote: > On 9/8/2015 9:10 AM, adfel70

Re: Solr facets implementation question

2015-09-08 Thread Shawn Heisey
On 9/8/2015 9:10 AM, adfel70 wrote: > I am trying to understand why faceting on a field with lots of unique values > has a great impact on query performance. Since Googling for Solr facet > algorithm did not yield anything, I looked how facets are implemented in > Lucene. I found out that there are