Solr mlt doesn't return documents with "exactly the same" contents

2014-11-27 Thread hhc
I have two documents with ids "aaa" and "bbb", and the titles of both documents are "a black fox jumps over a red flower". I imported both documents, along with several other testing documents, two a core "test". I want solr to return documents similar to document "aaa", so I submited the followi

Re: Solr mlt doesn't return documents with "exactly the same" contents

2014-11-27 Thread hhc
Hi Nishant, Thank you for the reply. I believe that solr removes the first document from the mlt list because a document is most similar to "itself" and thus should be removed. In my case, "aaa" and "bbb" are two different documents. When search for documents similar to "aaa", the document "a

Re: Solr mlt doesn't return documents with "exactly the same" contents

2014-11-27 Thread hhc
After carefully reading the mlt parameters here https://wiki.apache.org/solr/MoreLikeThis I found that I can specify the following parameters to return "bbb" when search for similar documents of "aaa": mlt.mintf=1 mlt.mindf=2 Details: mlt.mintf: Minimum Term Frequency - the frequency below which