Re: WordDelimiterFilter looses position increments of tokens

2006-07-05 Thread Eric Jain
Yonik Seeley wrote: Something like this I suppose: Yes, just what I meant :-) Thanks a lot!

Re: WordDelimiterFilter looses position increments of tokens

2006-07-04 Thread Eric Jain
Yonik Seeley wrote: Your'e right, it does look possible that position info can be lost. The fix probably isn't as simple as copying the position increment in newTok()... the original increment should only be copied for the first token generated. If something like for (int i = 1, max = queue.

WordDelimiterFilter looses position increments of tokens

2006-07-04 Thread Eric Jain
Just noticed that in the newTok method in the WordDelimiterFilter, the "position increment" of the token doesn't seem to be copied?

Re: Splitting and matching words

2006-06-25 Thread Eric Jain
Eric Jain wrote: I'd like to have "PowerShot", "powershot" and "power-shot" match each other. Solr has a WordDelimiterFilter, which works quite well, except that "powershot" still won't match "PowerShot" (tokenized into "pow

Splitting and matching words

2006-06-25 Thread Eric Jain
I'd like to have "PowerShot", "powershot" and "power-shot" match each other. Solr has a WordDelimiterFilter, which works quite well, except that "powershot" still won't match "PowerShot" (tokenized into "power (shot powershot)", so "power powershot" would match..."). Any suggestions?