On Aug 4, 2008, at 6:17 PM, Jason Rennie wrote:
Does the MLT handler simply select a few high tfidf terms from the
doc and
use them as a query? Sounds like a useful tool. Do you know
anything about
relevant performance issues? I noticed that the Solr MoreLikeThis
wiki page
recommends tu
x27;t think things changed
that much in the last few years.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Jason Rennie <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Monday, August 4, 2008 6:17:28 PM
> Su
Does the MLT handler simply select a few high tfidf terms from the doc and
use them as a query? Sounds like a useful tool. Do you know anything about
relevant performance issues? I noticed that the Solr MoreLikeThis wiki page
recommends turning on TermVectors for corresponding fields. Can lucen
Thanks for the pointers. Looks interesting, at least as a starting point
for something more sophisticated.
Cheers,
Jason
On Mon, Aug 4, 2008 at 4:38 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> See https://issues.apache.org/jira/browse/SOLR-236 and
> http://wiki.apache.org/solr/FieldCollap
On Aug 4, 2008, at 12:50 PM, Jason Rennie wrote:
Is there any option in solr to encourage diversity in the results?
Our solr
index has millions of products, many of which are quite similar to
each
other. Even something simple like max 50% text overlap in successive
results would be valuabl
See https://issues.apache.org/jira/browse/SOLR-236 and http://wiki.apache.org/solr/FieldCollapsing
, but I gather it has been languishing. I also don't think it will do
anything as extensive as the text similarity question you are asking
(50% overlap) but I have not tried it.
-Grant
On Au