I wasn't suggesting that they should be changed but trying to understand why. This makes sense. Thanks Erik and Robert.
On Mon, Feb 22, 2010 at 6:16 AM, Robert Muir <rcm...@gmail.com> wrote: > right, most stemmers expect the diacritics to be in their input to work > correctly, too. > > On Sun, Feb 21, 2010 at 5:19 PM, Erik Hatcher <erik.hatc...@gmail.com > >wrote: > > > won't some stemmers leave diacritics in the terms that ought to be > removed > > before indexing? > > > > > > > > On Feb 21, 2010, at 4:57 PM, Shalin Shekhar Mangar wrote: > > > > Hello, > >> > >> Looking over the CharFilter franchise, it seems to me that the > >> ASCIIFoldingFilter is a perfect candidate for being a CharFilter as it > >> performs character level substitutions like MappingCharFilter. However > it > >> is > >> not a CharFilter. Is there a reason why? > >> > >> -- > >> Regards, > >> Shalin Shekhar Mangar. > >> > > > > > > > -- > Robert Muir > rcm...@gmail.com > -- Regards, Shalin Shekhar Mangar.