Re: NGramTokenFilter behaviour

2009-09-30 Thread Shalin Shekhar Mangar
On Wed, Sep 30, 2009 at 11:24 PM, wrote: > If I index the following text: "I live in Dublin Ireland where > Guinness is brewed" > > Then search for: duvlin > > Should Solr return a match? > > In the admin interface under the analysis section, Solr highlights > some NGram matches? > > When I enter

Re: NGramTokenFilter behaviour

2009-09-30 Thread Shalin Shekhar Mangar
On Wed, Sep 30, 2009 at 11:24 PM, wrote: > > Can someone please clarify what the purpose of the > NGramFilter/tokenizer is, if not to allow for > misspellings/morphological variation and also, what the correct > configuration is in terms of use at index/query time. > > If it is spellcheck you are

RE: NGramTokenFilter behaviour

2009-09-30 Thread Feak, Todd
My understanding of a NGramTokenizing is to help with languages that don't necessarily contain spaces as a word delimiter (Japanese et al). In that case bi-gramming is used to find words contained within a stream of unbroken characters. In that case, you want to find all of the bi-grams that you