Hi,

Sorry if its a very basic question but I am pretty new to SolrCloud and I am
trying to understand the underlying mechanism for calculating relevancy.

Currently we are using SOLR 3.6.X and we use shards to perform distributed
searching. Our shards are not of equal size hence sometimes the results are
not as we expected. 

For ex: Shard 1 has 30 million documents, Shard 2 has 30 millon documents
and shard 3 has just 3 million documents (push indexing via message queue). 

When we do a search using shards, documents from shard 1 and shard 2 gets
higher priority compared to documents in shard 3 (since its smaller).
Currently we add index time boost when adding documents to shard 3 so that
the documents from shard 3 also comes up (higher) in search results.

Now when using SolrCloud, say for example if one shard has person name
repeated 5 times (with different unique id)  and we have one more same
person name in shard 2 (with diff id), and when we do a search how does SOLR
calculate the score? Does it do something like constant scoring across
various shards in order to bring up the search results across various
shards? How does the score gets calculated.. Does the score of all 6
documents have same value(5 from shard 1 and 1 from shard 2 -if all the
fields have same value except for unique id)? 

Thanks,
BB 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/SolrCloud-Score-calculation-tp4071805.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to