Just use the edismax query parser with bigrams and trigrams enabled and the
default operator set to OR. That will select all sentences even vaguely
similar and will more highly score sentences that have a greater number of
words and phrases that match.
-- Jack Krupansky
-----Original Message-----
From: Jayni
Sent: Friday, December 13, 2013 12:02 PM
To: solr-user@lucene.apache.org
Subject: Similarity search with Solr
Hi,
I want to do a similarity search on millions of sentences. They are written
in natural language and I want to find sentences, which have a "similar" set
of words.
A search based on trigrams or a kind of Full Text search, which finds
similar sentences is my aim.
Before I used PostgreSQL, but it was far to slow.
Do you think it's possible to realize a performed similarity search like
described with Solr and do you think it's the right search engine to do
that?
Thanks for your Answers!
Janek
--
View this message in context:
http://lucene.472066.n3.nabble.com/Similarity-search-with-Solr-tp4106623.html
Sent from the Solr - User mailing list archive at Nabble.com.