On Tue, 2015-09-22 at 11:56 -0700, Erick Erickson wrote:
> FWIW, there is work being done for "high cardinality faceting" with
> some of the recent Streaming Aggregation code.
The JIRA I can find that seems relevant is
https://issues.apache.org/jira/browse/SOLR-7903
I can see streaming faceting w
FWIW, there is work being done for "high cardinality faceting" with
some of the recent Streaming Aggregation code.
So it's at least on the way if not already there.
Erick
On Tue, Sep 22, 2015 at 11:44 AM, Toke Eskildsen
wrote:
> adfel70 wrote:
>> Hi Toke, Thank you for the detailed explanatio
adfel70 wrote:
> Hi Toke, Thank you for the detailed explanation, thats exactly what I was
> looking for, except this algorithm fit single index only. could you please
> elaborate what adjustments are needed for distributed index?
Vanilla Solr requests top-X terms from each shard, with over-provi
t top K results from
every shard is not good enough).
Is that correct? I tried to verify this behaviour, but I didnt see that the
process who got the request from the user used more memory than the other
shards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-facets-implementation-question-tp4227604p4229741.html
Sent from the Solr - User mailing list archive at Nabble.com.
adfel70 wrote:
> I am trying to understand why faceting on a field with lots of unique values
> has a great impact on query performance.
Faceting in Solr is performed in different ways. String faceting different from
Numerics faceting, DocValued fields different from non-DocValued, fc different
Every faceting implementation I’ve seen (not just Solr/Lucene) makes big
in-memory lists. Lots of values means a bigger list.
wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/ (my blog)
On Sep 8, 2015, at 8:33 AM, Shawn Heisey wrote:
> On 9/8/2015 9:10 AM, adfel70
On 9/8/2015 9:10 AM, adfel70 wrote:
> I am trying to understand why faceting on a field with lots of unique values
> has a great impact on query performance. Since Googling for Solr facet
> algorithm did not yield anything, I looked how facets are implemented in
> Lucene. I found out that there are
SortedSetDocValues-based. Does Solr facet capabilities are based on one of
those methods? if so, I still cant understand why unique values impacts
query performance...
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-facets-implementation-question-tp4227604.html
Sent from the Solr