Hi,

when you write "I get xxx results", does it come from 'numFound' ? Or
you really display xxx results ?
When using both field collapsing and sharding, the 'numFound' may be
wrong. In that case, think about using 'shards.rows' parameter with a
high value (be careful, it's bad for performance).

If the problem is really about the returned results, it may be because
of several documents having the same unique key "document_id" in
different shards.

Hope it helps,
Franck



Le vendredi 30 mars 2012 à 23:52 +0000, Young, Cody a écrit :
> I forgot to mention, I can see the distributed requests happening in the logs:
> 
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core2] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core2&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=2
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core4] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core4&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=1
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core1] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core1&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=1
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core3] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core3&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=1
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core0] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core0&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=1
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core6] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core6&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=0
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core7] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core7&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=3
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core5] webapp=/solr path=/select 
> params={group.distributed.first=true&distrib=false&wt=javabin&rows=10&version=2&fl=document_id,score&shard.url=localhost:8086/solr/core5&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=1
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core4] webapp=/solr path=/select 
> params={distrib=false&group.distributed.second=true&wt=javabin&version=2&rows=10&group.topgroups.group_field=4183765296&group.topgroups.group_field=4608765424&group.topgroups.group_field=3524954944&group.topgroups.group_field=4182445488&group.topgroups.group_field=4213143392&group.topgroups.group_field=4328299312&group.topgroups.group_field=4206259648&group.topgroups.group_field=3465497912&group.topgroups.group_field=3554417600&group.topgroups.group_field=3140802904&fl=document_id,score&shard.url=localhost:8086/solr/core4&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=2
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core6] webapp=/solr path=/select 
> params={distrib=false&group.distributed.second=true&wt=javabin&version=2&rows=10&group.topgroups.group_field=4183765296&group.topgroups.group_field=4608765424&group.topgroups.group_field=3524954944&group.topgroups.group_field=4182445488&group.topgroups.group_field=4213143392&group.topgroups.group_field=4328299312&group.topgroups.group_field=4206259648&group.topgroups.group_field=3465497912&group.topgroups.group_field=3554417600&group.topgroups.group_field=3140802904&fl=document_id,score&shard.url=localhost:8086/solr/core6&NOW=1333151353217&start=0&q=*:*&group.field=group_field&group=true&isShard=true}
>  status=0 QTime=2
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core4] webapp=/solr path=/select 
> params={NOW=1333151353217&shard.url=localhost:8086/solr/core4&ids=4182445488-535180165,3554417600-527549713,4608765424-526014561,3524954944-531590393,4183765296-514134497,4206259648-530219973,3465497912-534955957,4213143392-534186349,3140802904-538688961,4328299312-533482537&q=*:*&distrib=false&group.field=group_field&wt=javabin&isShard=true&version=2&rows=10}
>  status=0 QTime=5
> Mar 30, 2012 4:49:13 PM org.apache.solr.core.SolrCore execute
> INFO: [core0] webapp=/solr path=/select/ 
> params={shards=localhost:8086/solr/core0,localhost:8086/solr/core1,localhost:8086/solr/core2,localhost:8086/solr/core3,localhost:8086/solr/core4,localhost:8086/solr/core5,localhost:8086/solr/core6,localhost:8086/solr/core7&q=*:*&group.field=group_field&group=true}
>  status=0 QTime=106 
> 
> -----Original Message-----
> From: Young, Cody [mailto:cody.yo...@move.com] 
> Sent: Friday, March 30, 2012 4:35 PM
> To: solr-user@lucene.apache.org
> Subject: Distributed grouping issue
> 
> Hi All,
> 
> I'm having an issue getting distributed grouping  working on trunk (Mar 29, 
> 2012).
> 
> If I send this query:
> 
> http://localhost:8086/solr/core0/select/?q=*:*&group=false 
> &shards=localhost:8086/solr/core0,localhost:8086/solr/core1,localhost:8086/solr/core2,localhost:8086/solr/core3,localhost:8086/solr/core4,localhost:8086/solr/core5,localhost:8086/solr/core6,localhost:8086/solr/core7
> 
> I get 260,000 results. As soon as I change to using grouping:
> 
> http://localhost:8086/solr/core0/select/?q=*:*&group=true&group.field=group_field&shards=localhost:8086/solr/core0,localhost:8086/solr/core1,localhost:8086/solr/core2,localhost:8086/solr/core3,localhost:8086/solr/core4,localhost:8086/solr/core5,localhost:8086/solr/core6,localhost:8086/solr/core7
> 
> I only get 32,000 results. (the number of documents in a single core.)
> 
> The field that I am grouping on is defined as:
> 
> <field name="group_field" type="string" indexed="true" stored="true" 
> multiValued="false" />
> 
> <fieldType name="string" class="solr.StrField" sortMissingLast="true" 
> omitNorms="true"/>
> 
> The document id:
> 
> 
> <field name="document_id" type="string" indexed="true" stored="true" 
> required="true" />
> 
> <fieldType name="string" class="solr.StrField" sortMissingLast="true" 
> omitNorms="true"/>
> 
> <uniqueKey>document_id</uniqueKey>
> 
> Anyone else experiencing this? Any ideas?
> 
> Thanks,
> Cody


Reply via email to