Re: diversity in results

2008-08-04 Thread Grant Ingersoll
On Aug 4, 2008, at 6:17 PM, Jason Rennie wrote: Does the MLT handler simply select a few high tfidf terms from the doc and use them as a query? Sounds like a useful tool. Do you know anything about relevant performance issues? I noticed that the Solr MoreLikeThis wiki page recommends tu

Re: diversity in results

2008-08-04 Thread Otis Gospodnetic
x27;t think things changed that much in the last few years. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Jason Rennie <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday, August 4, 2008 6:17:28 PM > Su

Re: diversity in results

2008-08-04 Thread Jason Rennie
Does the MLT handler simply select a few high tfidf terms from the doc and use them as a query? Sounds like a useful tool. Do you know anything about relevant performance issues? I noticed that the Solr MoreLikeThis wiki page recommends turning on TermVectors for corresponding fields. Can lucen

Re: diversity in results

2008-08-04 Thread Jason Rennie
Thanks for the pointers. Looks interesting, at least as a starting point for something more sophisticated. Cheers, Jason On Mon, Aug 4, 2008 at 4:38 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > See https://issues.apache.org/jira/browse/SOLR-236 and > http://wiki.apache.org/solr/FieldCollap

Re: diversity in results

2008-08-04 Thread Brian Whitman
On Aug 4, 2008, at 12:50 PM, Jason Rennie wrote: Is there any option in solr to encourage diversity in the results? Our solr index has millions of products, many of which are quite similar to each other. Even something simple like max 50% text overlap in successive results would be valuabl

Re: diversity in results

2008-08-04 Thread Grant Ingersoll
See https://issues.apache.org/jira/browse/SOLR-236 and http://wiki.apache.org/solr/FieldCollapsing , but I gather it has been languishing. I also don't think it will do anything as extensive as the text similarity question you are asking (50% overlap) but I have not tried it. -Grant On Au