Hi Martijn,

I created a JIRA issue and attached a test that fails. It seems to exhibit the 
same issue that I see on my local box. (If you run it multiple times you can 
see that the group value of the top doc changes between runs.)

Also, I had to change add fixShardCount = true; in the constructor of the 
TestDistributedGrouping class, which caused another test case to fail. (It's 
commented out in the patch with a TODO above it.)

Please let me know if you need any other information.

https://issues.apache.org/jira/browse/SOLR-3316

Thanks!!
Cody

-----Original Message-----
From: martijn.is.h...@gmail.com [mailto:martijn.is.h...@gmail.com] On Behalf Of 
Martijn v Groningen
Sent: Monday, April 02, 2012 10:49 PM
To: solr-user@lucene.apache.org
Subject: Re: Distributed grouping issue

I tried the to reproduce this. However the matches always returns 4 in my case 
(when using rows=1 and rows=2).
In your case the 2 documents on each core do belong to the same group, right?

I did find something else. If I use rows=0 then an error occurs. I think we 
need to further investigate this.
Can you open an issue in Jira? I'm a bit busy today. We can then further look 
into this in the coming days.

Martijn

On 2 April 2012 23:00, Young, Cody <cody.yo...@move.com> wrote:

> Okay, I've played with this a bit more. Found something interesting:
>
> When the groups returned do not include results from a core, then the 
> core is excluded from the count. (I have 1 group, 2 documents per 
> core)
>
> Example:
>
>
> http://localhost:8983/solr/core0/select/?q=*:*&shards=localhost:8983/s
> olr/core0,localhost:8983/solr/core1&group=true&group.field=group_field
> &group.limit=10&rows=1
>
> <lst name="grouped">
> <lst name="group_field">
> <int name="matches">2</int>
>
> Then, just by changing rows=2
>
>
> http://localhost:8983/solr/core0/select/?q=*:*&shards=localhost:8983/s
> olr/core0,localhost:8983/solr/core1&group=true&group.field=group_field
> &group.limit=10&rows=2
>
> <lst name="grouped">
> <lst name="group_field">
> <int name="matches">4</int>
>
> Let me know if you have any luck reproducing.
>
> Thanks,
> Cody
>
> -----Original Message-----
> From: martijn.is.h...@gmail.com [mailto:martijn.is.h...@gmail.com] On 
> Behalf Of Martijn v Groningen
> Sent: Monday, April 02, 2012 1:48 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Distributed grouping issue
>
> >
> > All documents of a group exist on a single shard, there are no 
> > cross-shard groups.
> >
> You only have to partition documents by group when the groupCount and 
> some other features need to be accurate. For the "matches" this is not 
> necessary. The matches are summed up during merging the shared responses.
>
> I can't reproduce the error you are describing on a small local setup 
> I have here. I have two Solr cores with a simple schema. Each core has 
> 3 documents. When grouping the matches element returns 6. I'm running 
> on a trunk that I have updated 30 minutes ago. Can you try to isolate 
> the problem by testing with a small subset of your data?
>
> Martijn
>



--
Met vriendelijke groet,

Martijn van Groningen

Reply via email to