RE: How to properly use Levenstein distance with ~ in Java

2014-10-23 Thread Will Martin
using the Levenshtein distance. " http://dx.doi.org/10.1145/2623330.2623677 re will -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Thursday, October 23, 2014 12:05 PM To: solr-user Subject: Re: How to properly use Levenstein distance with ~ in Java

Re: How to properly use Levenstein distance with ~ in Java

2014-10-23 Thread Alexandre Rafalovitch
The last real update on that is 2.5 years old. Is there more recent update? I am interested in this topic as well. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://w

Re: How to properly use Levenstein distance with ~ in Java

2014-10-23 Thread Walter Underwood
We’re reimplementing fuzzy support in edismax on Solr 4.x right now. See: https://issues.apache.org/jira/browse/SOLR-629 wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Oct 22, 2014, at 11:08 PM, karsten-s...@gmx.de wrote: > Hi Aleksander, > > The Fuzzy Search

Re: Re: How to properly use Levenstein distance with ~ in Java

2014-10-22 Thread karsten-solr
Hi Aleksander,   The Fuzzy Searche '~' is not supported in dismax (defType=dismax) https://cwiki.apache.org/confluence/display/solr/The+DisMax+Query+Parser   You are using SearchComponent "spellchecker". This does not change the query results.   btw: It looks like you are using path "/select" wit

Re: How to properly use Levenstein distance with ~ in Java

2014-10-21 Thread Erick Erickson
When used on bare terms, ~ is indeed "fuzzy matching" rather than proximity, it's an overloaded operator in that sense. If I had to guess, I'd guess that your analysis chain for the field is doing "interesting" things for "taveranx" and the resulting token is far enough "away" (in the Levenshtein

Re: How to properly use Levenstein distance with ~ in Java

2014-10-20 Thread Ramzi Alqrainy
Because ~ is proximity matching. Lucene supports finding words are a within a specific distance away. Search for "foo bar" within 4 words from each other. "foo bar"~4 Note that for proximity searches, exact matches are proximity zero, and word transpositions (bar foo) are proximity 1. A query suc

Re: How to properly use Levenstein distance with ~ in Java

2014-10-20 Thread Aleksander Sadecki
Ok, thank you for your response. But why I cannot use '~'? On 20 October 2014 07:40, Ramzi Alqrainy wrote: > You can use Levenstein Distance algorithm inside solr without writing code > by > specifing the source of terms in solrconfig.xml > > > > solr.IndexBasedSpellChecker > ./spell

Re: How to properly use Levenstein distance with ~ in Java

2014-10-19 Thread Ramzi Alqrainy
You can use Levenstein Distance algorithm inside solr without writing code by specifing the source of terms in solrconfig.xml solr.IndexBasedSpellChecker ./spellchecker content true This example shows the results of a simple query that defines a query using the spellcheck.