On Tue, 2010-10-26 at 15:48 +0200, Ron Mayer wrote: > And a third potential reason - it's arguably a feature instead of a bug > for some applications. Depending on how I organize my shards, "give me > the most relevant document from each shard for this search" seems like > it could be useful.
You can get that even if the shards scored equally, so it is a limitation, not a feature. I hope to find the time later this week to read some of the papers Andrzej was kind enough to point out, but it seems like I really need to do the heavy lifting of setting up comparisons for our own material. The problem is of course to judge the quality of the outputs, but setting the single index as the norm and plotting the differences in document positions in the result sets might provide some insight. Regards, Toke Eskildsen