Hi there,
i have a solr server running containing tweets. my schema.xml contains
following fields:

<fields>
 <field name="id" type="string" indexed="true" stored="true"
 required="true" />
 <field name="tweet" type="text_general"
indexed="true" stored="true" termVectors="true"/>
 <field
name="hashtags" type="text_general" indexed="true" stored="true"
termVectors="true"/>
</fields>

my problem is actually quite simple; somewhere in my GUI the user types
text and i want to retrieve tweets that are most similar to it.
Therefore, i tried the "morelikethis" functionality. My problem is that
currently, mlt finds additional tweets to every tweet found by the
"select" handler. I'm not sure however if the select handler finds the
most fitting tweet or just returns the first match. currently, i am
using following query:

http://localhost:8983/solr/select/?q=tweet:heaven&mlt=true&mlt.fl=tweet,hashtags&wt=json&indent=true

Am i missing something critical? So eventually, i just want to retrieve
x tweets with the most similar text, sorted by their similarity (cosine
of termVectors). Is MoreLikeThis the way to go?

Thanks in advance!

Reply via email to