Re: Distributed query: strange behavior.

2013-05-28 Thread Valery Giner
Eric, Thank you for the explanation. My problem was that allowing the docs with the same unique ids to be present in the multiple shards in a "normal" situation, makes it impossible to estimate the number of shards needed for an index with a "really large" number of docs. Thanks, Val On 05

Re: Distributed query: strange behavior.

2013-05-27 Thread Luis Cappa Banda
Hello, guys! Well, I've done some tests and I think that there exists some kind of bug related with distributed search. Currently I'm setting a key field that it's impossible to be duplicated, and I have experienced the same wrong behavior with numFound field while changing rows parameter. Has any

Re: Distributed query: strange behavior.

2013-05-26 Thread Luis Cappa Banda
Hi, Erick! That's it! I'm using a custom implementation of a SolrServer with distributed behavior that routes queries and updates using an in-house Round Robin method. But the thing is that I'm doing this myself because I've noticed that duplicated documents appears using LBHttpSolrServer implemen

Re: Distributed query: strange behavior.

2013-05-26 Thread Erick Erickson
Valery: I share your puzzlement. _If_ you are letting Solr do the document routing, and not doing any of the custom routing, then the same unique key should be going to the same shard and replacing the previous doc with that key. But, if you're using custom routing, if you've been experimenting w

Re: Distributed query: strange behavior.

2013-05-24 Thread Shalin Shekhar Mangar
The uniqueKey is enforced within the same shard/index only. On Fri, May 24, 2013 at 6:39 PM, Valery Giner wrote: > Shawn, > > How is it possible for more than one document with the same unique key to > appear in the index, even in different shards? > Isn't it a bug by definition? > What am I mis

Re: Distributed query: strange behavior.

2013-05-24 Thread Valery Giner
Shawn, How is it possible for more than one document with the same unique key to appear in the index, even in different shards? Isn't it a bug by definition? What am I missing here? Thanks, Val On 05/23/2013 09:55 AM, Shawn Heisey wrote: On 5/23/2013 1:51 AM, Luis Cappa Banda wrote: I've qu

Re: Distributed query: strange behavior.

2013-05-24 Thread Luis Cappa Banda
Uhm... that sounds reasonable. My data model may allow duplicate keys, but it's quite difficult. My key is a hash formed by an URL during a crawling process, and it's posible to re-crawl an existing URL. I think that I need to find a new way to compose an unique key to avoid this kind of bad behavi

Re: Distributed query: strange behavior.

2013-05-23 Thread Shawn Heisey
On 5/23/2013 1:51 AM, Luis Cappa Banda wrote: > I've query each Solr shard server one by one and the total number of > documents is correct. However, when I change rows parameter from 10 to 100 > the total numFound of documents change: I've seen this problem on the list before and the cause has be