Is there any analyzer that can do full Unicode case folding (for example, as
described at
http://www.w3.org/International/wiki/Case_folding#Recommendations_for_Case_Folding
)?

Specifically, in a German index, I would like the sharp s character (ß) to
be normalized into ss, which isn't done by any of the Unicode Normal Forms,
but only by case folding.

If there isn't an analyzer for this - any suggestions on how to roll my own?
Should I simply apply String.toUpperCase() followed by .toLowerCase()?

Thanks,
-- Avi

Reply via email to