Max - field collapsing may be your friend - 
https://issues.apache.org/jira/browse/SOLR-236


This field collapsing keeps coming up...

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Max Scheffler <max.scheff...@freiheit.com>
> To: solr-user@lucene.apache.org
> Sent: Friday, December 12, 2008 10:23:23 AM
> Subject: Unwanted clustering of search results after sorting by score
> 
> Hallo,
> 
> We have a website on which you can search through a large amount of
> products from different shops.
> 
> The information describing the products are provided to us by the shops
> which sell these products.
> 
> If we sort a search result by score many products of the same shop are
> clustered together. The reason for this behavior is that a shops tend to
> use the same 'style' to describe their products. For example:
> 
> Shop 'foo' describes its products with 250 words and uses the searched
> word once. Shop 'bar' describes its products with only 25 words and also
> uses the searched word once. The score for shop 'foo' will be much worst
> than for shop 'bar'. In a search in which are many products of shop
> 'foo' and 'bar' the products of shop 'bar' are shown before the products
> of shop 'foo'.
> 
> We tried to avoid this behavior by not using the term frequency. But
> after this we got very strange products under the first results.
> 
> Has anybody an idea to avoid the clustering of products (documents)
> which are from the same shop?
> 
> Greetings
>     Max

Reply via email to