Indexes cannot be directly compared unless they have similar collection statistics.  That is the same terms occur with the same frequency across all indexes and the average document lengths are about the same (though the default similarity in Lucene may not care about average document length--I'm not sure).

SOLR-303 is an attempt to solve the partitioning issue from the search side of things.

-Sean

Lance Norskog wrote:
Are the score values generated in Solr relative to the index or are they
against an absolute standard?
Is it possible to create a scoring algorithm with this property? Are there
parts of the score inputs that are absolute?
 
My use case is this: I would like to do a parallel search against two Solr
indexes, and combine the results. The two indexes are built with the same
data sources, we just can't handle one giant index. If the score values are
against a common 'scale', then scores from the two search indexes can be
compared. I could combine the result sets with a simple merge by score.
 
This is a difficult concept to explain. I hope I have succeeded.
 
Thanks,
 
Lance

  

Reply via email to