2008/12/30 Paolo Bonzini <bonz...@gnu.org>: > Thomas wrote: >> Package: sed >> Version: 4.1.5-6 >> Severity: normal >> >> For instance, take U+02E2 MODIFIER LETTER SMALL S: >> >> $ echo ˢ | sed -r 's/[a-z]|[^a-z]//' >> ˢ >> >> Expected output: nothing. >> >> Sed does not handle "ˢ" (U02E2) as a letter (in [a-z]) nor as a >> "non-letter" (in [^a-z]). >> >> The problem happens with around 3700 characters: see attached file. It >> does not seem related to locales. > > But it is a bug (if anywhere) in glibc's locale descriptions or regex > matcher, since that's where the relevant code comes from. Should be > reassigned.
I don't think so, because the bug does not happen with sed from the heirloom toolchest. Thomas