Just use the edismax query parser with bigrams and trigrams enabled and the default operator set to OR. That will select all sentences even vaguely similar and will more highly score sentences that have a greater number of words and phrases that match.

-- Jack Krupansky

-----Original Message----- From: Jayni
Sent: Friday, December 13, 2013 12:02 PM
To: solr-user@lucene.apache.org
Subject: Similarity search with Solr

Hi,

I want to do a similarity search on millions of sentences. They are written
in natural language and I want to find sentences, which have a "similar" set
of words.
A search based on trigrams or a kind of Full Text search, which finds
similar sentences is my aim.
Before I used PostgreSQL, but it was far to slow.

Do you think it's possible to realize a performed similarity search like
described with Solr and do you think it's the right search engine to do
that?
Thanks for your Answers!

Janek



--
View this message in context: http://lucene.472066.n3.nabble.com/Similarity-search-with-Solr-tp4106623.html Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to