Paolo, > > [=e=] to match "e" as well as accented versions like é, è and ê). > > That is the one feature that you get with glibc, and that you would > > sacrifice when building --with-included-regex. > > I agree. It's up to distros to choose, of course.
If you are on the point of sacrificing a glibc feature in many programs, then IMO you should first talk with the glibc people to see what alternative they can offer. > It's strictly about gawk/grep/gnulib; no need to involve > glibc from the beginning. I disagree. It must involve glibc. It is probably futile to ask Ulrich Drepper to change how [a-z] is interpreted by default. But what would gnulib need so as to implement our "desired" behaviour? As far as I understand, you want to keep the interpretation of [=e=] in the POSIX + glibc way, but change the interpretation of [a-z]? Then, what do we need from glibc? - Do we need a RE_RANGES_IGNORE_LOCALES flag, like Arnold proposed? - Do we need an API that allows us to access the collation elements? (Or is strcoll and wcscoll sufficient?) Bruno -- In memoriam Johanna Kirchner <http://en.wikipedia.org/wiki/Johanna_Kirchner>