On 3/8/2016 6:56 AM, Robert Brown wrote:
> I have 2 shards, each with 1 replica.
>
> When sending the same request to the cluster, I'm seeing the same
> results, but ordered differently, and with different scores.
>
> Does this highlight an issue with my index, or is this an accepted
> anomaly?

SolrCloud's method of operation can result in a different number of
deleted documents on different replicas.  Deleted documents that still
exist within the index can affect scores.  Because SolrCloud picks an
available replica at random to satisfy queries, different requests will
use different replicas.

Distributed IDF, available starting in version 5.0 and described on the
following documentation page, can help even these differences out, but
will not completely eliminate them:

https://cwiki.apache.org/confluence/display/solr/Distributed+Requests

The only way that I know of to completely wipe out these anomalies is to
optimize your collection.  This will completely rewrite the index,
getting rid of deleted documents as it runs, which tends to be very slow
and can be very disruptive to Solr's performance.  It will also block
deleteByQuery requests until it is finished.

Thanks,
Shawn

Reply via email to