On Tue, Oct 24, 2017 at 8:37 AM, Erick Erickson <erickerick...@gmail.com>
wrote:

> bq:  It is a bit surprising why facet computation
>  is so slow even when the query matches hundreds of docs.
>
> The number of terms in the field over all docs also comes into play.
> Say you're faceting over a field that has 100,000,000 unique values
> across all docs, that's a lot of bookkeeping.
>
>
100M unique values might be across all docs, and unless the faceting
implementation is really naive I cannot see how that can come into play
when the query matches a fraction of those.



> Best,
> Erick
>
>
> On Tue, Oct 24, 2017 at 1:08 AM, Emir Arnautović
> <emir.arnauto...@sematext.com> wrote:
> > Hi John,
> > Did you mean “docValues don’t work for analysed fields” since it works
> for multivalue string (or other supported types) fields. What you need to
> do is to convert your analysed field to multivalue string field - that
> requires changes in indexing flow.
> >
> > HTH,
> > Emir
> > --
> > Monitoring - Log Management - Alerting - Anomaly Detection
> > Solr & Elasticsearch Consulting Support Training - http://sematext.com/
> >
> >
> >
> >> On 23 Oct 2017, at 21:08, John Davis <johndavis925...@gmail.com> wrote:
> >>
> >> Docvalues don't work for multivalued fields. I just started a separate
> >> thread with more debug info. It is a bit surprising why facet
> computation
> >> is so slow even when the query matches hundreds of docs.
> >>
> >> On Mon, Oct 23, 2017 at 6:53 AM, alessandro.benedetti <
> a.benede...@sease.io>
> >> wrote:
> >>
> >>> Hi John,
> >>> first of all, I may state the obvious, but have you tried docValues ?
> >>>
> >>> Apart from that a friend of mine ( Diego Ceccarelli) was discussing a
> >>> probabilistic implementation similar to the hyperloglog[1] to
> approximate
> >>> facets counting.
> >>> I didn't have time to take a look in details / implement anything yet.
> >>> But it is on our To Do list :)
> >>> He may add some info here.
> >>>
> >>> Cheers
> >>>
> >>>
> >>>
> >>>
> >>> [1]
> >>> https://blog.yld.io/2017/04/19/hyperloglog-a-
> probabilistic-data-structure/
> >>>
> >>>
> >>>
> >>> -----
> >>> ---------------
> >>> Alessandro Benedetti
> >>> Search Consultant, R&D Software Engineer, Director
> >>> Sease Ltd. - www.sease.io
> >>> --
> >>> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
> >>>
> >
>

Reply via email to