Is there any analyzer that can do full Unicode case folding (for example, as described at http://www.w3.org/International/wiki/Case_folding#Recommendations_for_Case_Folding )?
Specifically, in a German index, I would like the sharp s character (ß) to be normalized into ss, which isn't done by any of the Unicode Normal Forms, but only by case folding. If there isn't an analyzer for this - any suggestions on how to roll my own? Should I simply apply String.toUpperCase() followed by .toLowerCase()? Thanks, -- Avi