Good question... I think in Lucene 4.0, the edit distance is (will be) in Unicode code points, but in past releases, it's UTF16 code units.
Mike McCandless http://blog.mikemccandless.com 2011/6/30 Floyd Wu <floyd...@gmail.com>: > if this is edit distance implementation, what is the result apply to CJK > query? For example, "您好"~3 > > Floyd > > > 2011/6/30 entdeveloper <cameron.develo...@gmail.com> > >> I'm using Solr trunk. >> >> If it's levenstein/edit distance, that's great, that's what I want. It just >> didn't seem to be officially documented anywhere so I wanted to find out >> for >> sure. Thanks for confirming. >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/Fuzzy-Query-Param-tp3120235p3122418.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> >