Hi there, i have a solr server running containing tweets. my schema.xml contains following fields:
<fields> <field name="id" type="string" indexed="true" stored="true" required="true" /> <field name="tweet" type="text_general" indexed="true" stored="true" termVectors="true"/> <field name="hashtags" type="text_general" indexed="true" stored="true" termVectors="true"/> </fields> my problem is actually quite simple; somewhere in my GUI the user types text and i want to retrieve tweets that are most similar to it. Therefore, i tried the "morelikethis" functionality. My problem is that currently, mlt finds additional tweets to every tweet found by the "select" handler. I'm not sure however if the select handler finds the most fitting tweet or just returns the first match. currently, i am using following query: http://localhost:8983/solr/select/?q=tweet:heaven&mlt=true&mlt.fl=tweet,hashtags&wt=json&indent=true Am i missing something critical? So eventually, i just want to retrieve x tweets with the most similar text, sorted by their similarity (cosine of termVectors). Is MoreLikeThis the way to go? Thanks in advance!