Max - field collapsing may be your friend - https://issues.apache.org/jira/browse/SOLR-236
This field collapsing keeps coming up... Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: Max Scheffler <max.scheff...@freiheit.com> > To: solr-user@lucene.apache.org > Sent: Friday, December 12, 2008 10:23:23 AM > Subject: Unwanted clustering of search results after sorting by score > > Hallo, > > We have a website on which you can search through a large amount of > products from different shops. > > The information describing the products are provided to us by the shops > which sell these products. > > If we sort a search result by score many products of the same shop are > clustered together. The reason for this behavior is that a shops tend to > use the same 'style' to describe their products. For example: > > Shop 'foo' describes its products with 250 words and uses the searched > word once. Shop 'bar' describes its products with only 25 words and also > uses the searched word once. The score for shop 'foo' will be much worst > than for shop 'bar'. In a search in which are many products of shop > 'foo' and 'bar' the products of shop 'bar' are shown before the products > of shop 'foo'. > > We tried to avoid this behavior by not using the term frequency. But > after this we got very strange products under the first results. > > Has anybody an idea to avoid the clustering of products (documents) > which are from the same shop? > > Greetings > Max