On 4/13/2016 4:57 AM, Anil wrote:
> Yes Erick.
>
> I have the attached the queries generated from logs.
>
> i see many duplicate records :( . i could not see any duplicates on
> solr admin console.
>
> Each run giving different number of duplicates.
>
> Do you think Not (-) on query is an issue? please advice.

There are two ways this can happen.  One is that the index has changed
between different queries, pushing or pulling results between the end of
one page and the beginning of the next page.  The other is having the
same uniqueKey value in more than one shard.

Lior Sapir indicated that SolrCloud would behave differently and
eliminate all duplicates from multiple shards, but this is *not* the
case.  Both cloud and non-cloud behave the same.  When the duplicates
are on different pages, they will not be filtered out.  Solr *will*
eliminate duplicates from all results *in the same query* ... but
different pages are different queries.

Thanks,
Shawn

Reply via email to