right, most stemmers expect the diacritics to be in their input to work correctly, too.
On Sun, Feb 21, 2010 at 5:19 PM, Erik Hatcher <erik.hatc...@gmail.com>wrote: > won't some stemmers leave diacritics in the terms that ought to be removed > before indexing? > > > > On Feb 21, 2010, at 4:57 PM, Shalin Shekhar Mangar wrote: > > Hello, >> >> Looking over the CharFilter franchise, it seems to me that the >> ASCIIFoldingFilter is a perfect candidate for being a CharFilter as it >> performs character level substitutions like MappingCharFilter. However it >> is >> not a CharFilter. Is there a reason why? >> >> -- >> Regards, >> Shalin Shekhar Mangar. >> > > -- Robert Muir rcm...@gmail.com