Hi,

I'm using SOLR 4.x from trunk. This was the version from 2012-07-10. So
this is one of the latest versions.

I searched mailing list and jira but found only this
https://issues.apache.org/jira/browse/SOLR-3436

It was committed in May to trunk so my version of SOLR has this fix. But
the problem still exists.

Cheers
Agnieszka

2012/7/15 Erick Erickson <erickerick...@gmail.com>

> what version of Solr are you using? There's been quite a bit of work
> on this lately,
> I'm not even sure how much has made it into 3.6. You might try searching
> the
> JIRA list, Martijn van Groningen has done a bunch of work lately, look for
> his name. Fortunately, it's not likely to get a bunch of false hits <G>..
>
> Best
> Erick
>
> On Fri, Jul 13, 2012 at 7:50 AM, Agnieszka Kukałowicz
> <agnieszka.kukalow...@usable.pl> wrote:
> > Hi,
> >
> > I have problem with faceting count in distributed grouping. It appears
> only
> > when I make query that returns almost all of the documents.
> >
> > My SOLR implementation has 4 shards and my queries looks like:
> >
> > http://host:port
> >
> /select/q?=*:*&shards=shard1,shard2,shard3,shard4&group=true&group.field=id&group.facet=true&group.ngroups=true&facet.field=category1&facet.missing=false&facet.mincount=1
> >
> > With query like above I get strange counts for field category1.
> > The counts for values are very big:
> > <int name="val1">9659</int>
> > <int name="val2">7015</int>
> > <int name="val3">5676</int>
> > <int name="val4">1180</int>
> > <int name="val5">1105</int>
> > <int name="val6">979</int>
> > <int name="val7">770</int>
> > <int name="val8">701</int>
> > <int name="">612</int>
> > <int name="val9">422</int>
> > <int name="val10">358</int>
> >
> > When I make query to narrow the results adding to query
> > fq=category1:"val1", etc. I get different counts than facet category1
> shows
> > for a few first values:
> >
> > fq=category1:"val1" - counts: 22
> > fq=category1:"val2" - counts: 22
> > fq=category1:"val3" - counts: 21
> > fq=category1:"val4" - counts: 19
> > fq=category1:"val5" - counts: 19
> > fq=category1:"val6" - counts: 20
> > fq=category1:"val7" - counts: 20
> > fq=category1:"val8" - counts: 25
> > fq=category1:"val9" - counts: 422
> > fq=category1:"val10" - counts: 358
> >
> > From val9 the count is ok.
> >
> > First I thought that for some values in facet "category1" groups count
> does
> > not work and it returns counts of all documents not group by field id.
> > But the number of all documents matches query  fq=category1:"val1" is
> > 45468. So the numbers are not the same.
> >
> > I check the queries on each shard for val1 and the results are:
> >
> > shard1:
> > query:
> >
> http://shard1/select/?q=*:*group=true&group.field=id&group.facet=true&group.ngroups=true&facet.field=category1&facet.missing=false&facet.mincount=1
> >
> > <lst name="fcategory">
> > <int name="val1">11</int>
> >
> > query:
> >
> http://shard1/select/?q=*:*group=true&group.field=id&group.facet=true&group.ngroups=true&facet.field=category1&facet.missing=false&facet.mincount=1&fq=category1
> > :"val1"
> >
> > shard 2:
> > query:
> >
> http://shard2/select/?q=*:*group=true&group.field=id&group.facet=true&group.ngroups=true&facet.field=category1&facet.missing=false&facet.mincount=1
> >
> > there is no value "val1" in category1 facet.
> >
> > query:
> >
> http://shard2/select/?q=*:*group=true&group.field=id&group.facet=true&group.ngroups=true&facet.field=category1&facet.missing=false&facet.mincount=1&fq=category1
> > :"val1"
> >
> > <int name="ngroups">7</int>
> >
> > shard3:
> > query:
> >
> http://shard3/select/?q=*:*group=true&group.field=id&group.facet=true&group.ngroups=true&facet.field=category1&facet.missing=false&facet.mincount=1
> >
> > there is no value val1 in category1 facet
> >
> > query:
> >
> http://shard3/select/?q=*:*group=true&group.field=id&group.facet=true&group.ngroups=true&facet.field=category1&facet.missing=false&facet.mincount=1&fq=category1
> > :"val1"
> >
> > <int name="ngroups">4</int>
> >
> > So it looks that detail query with fq=category1:"val1" returns the
> relevant
> > results. But Solr has problem with faceting counts when one of the shard
> > does not return the faceting value (in this scenario "val1") that exists
> on
> > other shards.
> >
> > I checked shards for "val10" and I got:
> >
> > shard1: count for val10 - 142
> > shard2: count for val10 - 131
> > shard3: count for val10 -  149
> > sum of counts 422 - ok.
> >
> > I'm not sure how to resolve that situation. For sure the counts of val1
> to
> > val9 should be different and they should not be on the top of the
> category1
> > facet because this is very confusing. Do you have any idea how to fix
> this
> > problem?
> >
> > Best regards
> > Agnieszka
>

Reply via email to