Thanks a million! That totally did the trick. It is now working at least 95% like I want it to.

Gotta tweak it a little more but it seems like the hard part is over.

Thanks once again to everybody who helped out.

//Daniel

Chris Hostetter wrote:
: You are doing the right thing. If you are creating n-grams at index : time, you have to match that at query time. If the query is "monitor", : you need to pass that through n-gram tokenizer, too. n-grams of length : 18 look a little weird....

you don't *have* to use ngrams at query time ... his goal is "parital" word matching, so he wants to create various sized ngrams so that input like "onit" matches "monitor" but does not match "on it"

Daniel: the options for NGramTokenizerFactory are minGramSize and maxGramSize ... not minGram and maxGram ... you are getting the defaults (which are 1 and 2 i think)

it confused me too untill i tried you schema changes, and then looked at the analysis.jsp link and saw only 1 and 2 gram tokens being created .. then i checked the class.



-Hoss

Reply via email to