On 2014-09-11 18:16:29 -0700, Paul Eggert wrote: > Vincent Lefevre wrote: > >the C locale corresponds to ANSI_X3.4-1968, > > No it doesn't, at least not on any current platform I'm aware of.
It does on Debian: ypig% LC_ALL=C locale charmap ANSI_X3.4-1968 > >I would say that this should be the same for invalid > >byte sequences in a UTF-8 locale. > > One *could* design an encoding with that property, but it wouldn't be UTF-8; > it would be something else. I don't know of any C library that does that to > UTF-8. There are good arguments against doing it, e.g., one loses the > property that one can concatenate character strings by concatenating their > byte representations. I'm talking only about grep here. BTW, the current behavior breaks the sometimes used "grep ." solution to match non-empty lines. -- Vincent Lefèvre <vinc...@vinc17.net> - Web: <https://www.vinc17.net/> 100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/> Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon) -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org