Our documents are keyed with UUIDs, and we shard chronologically. The
write events are issued as part of a SQS queue that only allows one
reader to see the message. I think it's pretty unlikely that we have
more than one document with the same uniquekey.

I can actually prove this if it will help the discussion, since I just
dumped 4 of our shards to JSON, but it's over 117 million docs, so
I'll wait until someone asks. :)

Michael Della Bitta

------------------------------------------------
Appinions | 18 East 41st St., Suite 1806 | New York, NY 10017
www.appinions.com
Where Influence Isn’t a Game


On Wed, Aug 8, 2012 at 11:33 AM, Chris Hostetter
<hossman_luc...@fucit.org> wrote:
>
> : We are using Solr3.6 and 2 shards, we are noticing that when we fire a query
> : with start as 0 and rows X the total numFound and the total numFound changes
> : when we fire the same exact query with start as y and rows X.
>
> The only situation where i've ever heard of this happening is when
> multiple
> shards have documents with identical uniqueKeys...
>
> http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201206.mbox/%3CCAPoDz8S4Z-jnyptFXdv7VJdWntY0Lx_=nzhvq0qtcfqyx7m...@mail.gmail.com%3E
> http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201206.mbox/%3Calpine.DEB.2.00.1206191429520.19329@bester%3E
> http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201206.mbox/%3CCAPoDz8S59kzUdCAZwHRquzUhM=C90ReyCNe3Au00xsc=wh0...@mail.gmail.com%3E
>
> As noted in the docs..
>
> http://wiki.apache.org/solr/DistributedSearch?#Distributed_Searching_Limitations
>
> "The unique key field must be unique across all shards. If docs with
> duplicate unique keys are encountered, Solr will make an attempt to return
> valid results, but the behavior may be non-deterministic. "
>
>
>
>
> -Hoss

Reply via email to