On Tue, Oct 24, 2017 at 8:37 AM, Erick Erickson <erickerick...@gmail.com> wrote:
> bq: It is a bit surprising why facet computation > is so slow even when the query matches hundreds of docs. > > The number of terms in the field over all docs also comes into play. > Say you're faceting over a field that has 100,000,000 unique values > across all docs, that's a lot of bookkeeping. > > 100M unique values might be across all docs, and unless the faceting implementation is really naive I cannot see how that can come into play when the query matches a fraction of those. > Best, > Erick > > > On Tue, Oct 24, 2017 at 1:08 AM, Emir Arnautović > <emir.arnauto...@sematext.com> wrote: > > Hi John, > > Did you mean “docValues don’t work for analysed fields” since it works > for multivalue string (or other supported types) fields. What you need to > do is to convert your analysed field to multivalue string field - that > requires changes in indexing flow. > > > > HTH, > > Emir > > -- > > Monitoring - Log Management - Alerting - Anomaly Detection > > Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > > > > > > > >> On 23 Oct 2017, at 21:08, John Davis <johndavis925...@gmail.com> wrote: > >> > >> Docvalues don't work for multivalued fields. I just started a separate > >> thread with more debug info. It is a bit surprising why facet > computation > >> is so slow even when the query matches hundreds of docs. > >> > >> On Mon, Oct 23, 2017 at 6:53 AM, alessandro.benedetti < > a.benede...@sease.io> > >> wrote: > >> > >>> Hi John, > >>> first of all, I may state the obvious, but have you tried docValues ? > >>> > >>> Apart from that a friend of mine ( Diego Ceccarelli) was discussing a > >>> probabilistic implementation similar to the hyperloglog[1] to > approximate > >>> facets counting. > >>> I didn't have time to take a look in details / implement anything yet. > >>> But it is on our To Do list :) > >>> He may add some info here. > >>> > >>> Cheers > >>> > >>> > >>> > >>> > >>> [1] > >>> https://blog.yld.io/2017/04/19/hyperloglog-a- > probabilistic-data-structure/ > >>> > >>> > >>> > >>> ----- > >>> --------------- > >>> Alessandro Benedetti > >>> Search Consultant, R&D Software Engineer, Director > >>> Sease Ltd. - www.sease.io > >>> -- > >>> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html > >>> > > >