On Tue, 2015-09-22 at 11:56 -0700, Erick Erickson wrote:
> FWIW, there is work being done for "high cardinality faceting" with
> some of the recent Streaming Aggregation code.
The JIRA I can find that seems relevant is
https://issues.apache.org/jira/browse/SOLR-7903
I can see streaming faceting w
FWIW, there is work being done for "high cardinality faceting" with
some of the recent Streaming Aggregation code.
So it's at least on the way if not already there.
Erick
On Tue, Sep 22, 2015 at 11:44 AM, Toke Eskildsen
wrote:
> adfel70 wrote:
>> Hi Toke, Thank you for the detailed explanatio
adfel70 wrote:
> Hi Toke, Thank you for the detailed explanation, thats exactly what I was
> looking for, except this algorithm fit single index only. could you please
> elaborate what adjustments are needed for distributed index?
Vanilla Solr requests top-X terms from each shard, with over-provi
Toke Eskildsen wrote
> adfel70 <
> adfel70@
> > wrote:
>> I am trying to understand why faceting on a field with lots of unique
>> values
>> has a great impact on query performance.
>
> Faceting in Solr is performed in different ways. String faceting different
> from Numerics faceting, DocValued
adfel70 wrote:
> I am trying to understand why faceting on a field with lots of unique values
> has a great impact on query performance.
Faceting in Solr is performed in different ways. String faceting different from
Numerics faceting, DocValued fields different from non-DocValued, fc different
Every faceting implementation I’ve seen (not just Solr/Lucene) makes big
in-memory lists. Lots of values means a bigger list.
wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/ (my blog)
On Sep 8, 2015, at 8:33 AM, Shawn Heisey wrote:
> On 9/8/2015 9:10 AM, adfel70
On 9/8/2015 9:10 AM, adfel70 wrote:
> I am trying to understand why faceting on a field with lots of unique values
> has a great impact on query performance. Since Googling for Solr facet
> algorithm did not yield anything, I looked how facets are implemented in
> Lucene. I found out that there are