Furthermore I was checking the internals of the old facet implementation (
which comes when using the classic request parameter based,  instead of the
json facet). It seems that if you enable docValues even with the enun
method passed as parameter , actually fc with docValues will be used.
i will give some report on the performance we get with docValues.

Cheers
On 23 May 2016 16:29, "Joel Bernstein" <joels...@gmail.com> wrote:

> If you can make min/max work for you instead of sort then it should be
> faster, but I haven't spent time comparing the performance.
>
> But if you're using the top_fc with the min/max param the performance
> between Solr 4 & Solr 6 should be very close as the data structures behind
> them are the same.
>
>
>
>
>
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Mon, May 23, 2016 at 3:34 PM, Alessandro Benedetti <
> abenede...@apache.org
> > wrote:
>
> > Hi Joel,
> > thanks for the reply, actually we were not using field collapsing before,
> > we basically want to replace grouping with that.
> > The grouping performance between Solr 4 and 6 are basically comparable.
> > It's surprising I got so big degradation with the field collapsing.
> >
> > So basically the comparison we did were based on the Solr4 queries ,
> > extracted from logs, and modified slightly to include field collapsing
> > parameter.
> >
> > To build the tests to compare Solr 4.10.2 to Solr 6 we basically
> proceeded
> > in this way :
> >
> > 1) install Solr 4.10.2 and Solr 6.0.0
> > 2) migrate the index with the related lucene tool ( 4.10.2 -> 5.5.0 ->
> Solr
> > 6.0 )
> > 3) switch on/off the 2 instances and repeating the tests both with cold
> > instances and warm instances.
> >
> > This means that the query looks the same.
> > I have not double checked the results but only the timings.
> > I will provide additional feedback to see if the query are producing
> > comparable results as well.
> >
> > Related your suggestion about the top_fc, thanks, I will try that .
> > I actually discovered that a little bit after I posted the mailing list
> ( I
> > think exactly from another post of yours :) )
> >
> > Not sure if setting up docValues for the field we use to collapse could
> > give some benefit as well.
> >
> > I keep you updated,
> >
> > Cheers
> >
> > On Mon, May 23, 2016 at 2:48 PM, Joel Bernstein <joels...@gmail.com>
> > wrote:
> >
> > > Were you using the sort param or min/max param in Solr 4 to select the
> > > group head? The sort work came later and I'm not sure how it compares
> in
> > > performance to the min/max param.
> > >
> > > Since you are collapsing on a string field you can use the top_fc hint
> > > which will use a top level field cache for the collapse. This is faster
> > at
> > > query time then the default which uses MultiDocValue ordinal map.
> > >
> > > The docs cover the top_fc hint.
> > >
> > >
> >
> https://cwiki.apache.org/confluence/display/solr/Collapse+and+Expand+Results
> > >
> > >
> > >
> > > Joel Bernstein
> > > http://joelsolr.blogspot.com/
> > >
> > > On Mon, May 23, 2016 at 12:14 PM, Alessandro Benedetti <
> > > abenede...@apache.org> wrote:
> > >
> > > > Let's add some additional details guys :
> > > >
> > > > 1) *Faceting*
> > > > Currently the facet method used is "enum" and it runs over 20 fields
> > more
> > > > or less.
> > > > Mainly using it on low cardinality fields except one which has a
> > > > cardinality of 1000 terms.
> > > > I am aware of the famous Jira related faceting regression :
> > > > https://issues.apache.org/jira/browse/SOLR-8096 .
> > > >
> > > > Our index is indeed quite static ( we index once per day) and the
> > fields
> > > we
> > > > facet on are multi-valued ( by schema definition but not in
> practise) .
> > > > But we use Term Enum as method so i was not expecting to hit the
> > > > regression.
> > > > We currently see  query times which are 30% worse than Solr 4.10.2 .
> > > > Our next experiment will be to enable docValues for all the fields
> and
> > > > verify if we get any benefit ( switching the facet method to fc) .
> > > > At the moment, switching to json faceting is not an option as we
> would
> > > like
> > > > first to proceed with a transparent migration and then possibly add
> > > > improvements and refactor in the future.
> > > > Following will be to fix the schema to set as multi valued only what
> is
> > > > really multi-valued ( do you know if this can affect ? the wrong
> schema
> > > > definition is enough to mess up the facet performance ? even if then
> > the
> > > > fields are single valued ?)
> > > >
> > > >
> > > > 2) *Field Collapsing*
> > > > Field collapsing performance seems much, much worse, something like
> 200
> > > ms
> > > > ( Solr 4) vs 1800 ms ( Solr 6) .
> > > > This is suprising as I never heard about any regression in field
> > > > collapsing.
> > > > I will investigate a little bit more in details about the internals
> of
> > > the
> > > > field collapsing and why the performance could be so degraded.
> > > > I will also verify if I find any info in the mailing list or Jira.
> > > >
> > > > &fq={!collapse field=string_field sort='TrieDoubleField asc'}
> > > >
> > > > let me know if you faced something similar
> > > >
> > > > Cheers
> > > >
> > > > On Fri, May 13, 2016 at 10:41 PM, Alessandro Benedetti <
> > > > abenede...@apache.org> wrote:
> > > >
> > > > > I'm planning a migration from 4.10.2 to 6.0 .
> > > > > Because we generate the index on daily basis from scratch, we don't
> > > need
> > > > > to migrate the index but actually only migrate the server
> instances.
> > > > > With my team we were doing some experiments on some dev machines,
> > > > > basically comparing Solr 4.10.2 and Solr 6.0 to check any
> functional
> > > and
> > > > > performance regression in our use cases.
> > > > >
> > > > > After setting up two installation on the same machine ( switching
> on
> > > and
> > > > > off each version for doing comparison and experiments) we are
> > > verifying a
> > > > > degradation of the performances with Solr 6.
> > > > >
> > > > > Basically from a queryTime and throughput perspective Solr 6 is not
> > > > > performing as well as Solr 4.10.2 .
> > > > > Still need to start the proper investigations but this appears
> weird
> > to
> > > > me.
> > > > > Will proceed with all the analysis of the case and a deep study of
> > our
> > > > > queries ( which anyway are mainly fq , faceting and grouping).
> > > > >
> > > > > Any suggestion in particular to start with ? Has anyone
> experienced a
> > > > > similar migration with similar experience ?
> > > > > I will anyway explore also the mailing list in search for similar
> > > cases.
> > > > >
> > > > > Cheers
> > > > >
> > > > > --
> > > > > --------------------------
> > > > >
> > > > > Benedetti Alessandro
> > > > > Visiting card : http://about.me/alessandro_benedetti
> > > > >
> > > > > "Tyger, tyger burning bright
> > > > > In the forests of the night,
> > > > > What immortal hand or eye
> > > > > Could frame thy fearful symmetry?"
> > > > >
> > > > > William Blake - Songs of Experience -1794 England
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > --------------------------
> > > >
> > > > Benedetti Alessandro
> > > > Visiting card : http://about.me/alessandro_benedetti
> > > >
> > > > "Tyger, tyger burning bright
> > > > In the forests of the night,
> > > > What immortal hand or eye
> > > > Could frame thy fearful symmetry?"
> > > >
> > > > William Blake - Songs of Experience -1794 England
> > > >
> > >
> >
> >
> >
> > --
> > --------------------------
> >
> > Benedetti Alessandro
> > Visiting card : http://about.me/alessandro_benedetti
> >
> > "Tyger, tyger burning bright
> > In the forests of the night,
> > What immortal hand or eye
> > Could frame thy fearful symmetry?"
> >
> > William Blake - Songs of Experience -1794 England
> >
>

Reply via email to