right, most stemmers expect the diacritics to be in their input to work
correctly, too.

On Sun, Feb 21, 2010 at 5:19 PM, Erik Hatcher <erik.hatc...@gmail.com>wrote:

> won't some stemmers leave diacritics in the terms that ought to be removed
> before indexing?
>
>
>
> On Feb 21, 2010, at 4:57 PM, Shalin Shekhar Mangar wrote:
>
>  Hello,
>>
>> Looking over the CharFilter franchise, it seems to me that the
>> ASCIIFoldingFilter is a perfect candidate for being a CharFilter as it
>> performs character level substitutions like MappingCharFilter. However it
>> is
>> not a CharFilter. Is there a reason why?
>>
>> --
>> Regards,
>> Shalin Shekhar Mangar.
>>
>
>


-- 
Robert Muir
rcm...@gmail.com

Reply via email to