On Sat, Aug 04, 2012 at 09:00:49PM +0200, Andreas Krey wrote: > And moving outside of ASCII also has surprises with LANG=C, unfortunately.
That is the way it is supposed to be according to POSIX. The "C" environment is ASCII. Newer C standard versions apparently allow for unicode in C symbol names but I'm not sure if any system has already implemented this, and I doubt it affects locales (haven't checked though). > (In my understanding LANG tells what console I/O should be, not how file > names are interpreted.) To my knowledge there is nothing in POSIX that says how filenames should be interpreted. They're always considered a string of bytes.