Re: \c escape within $'...' can produce mangled UTF-8

2010-08-15 Thread Dmitry Groshev
e quite an useless waste of time, however. OK, will know not to do that again. -- -= With best regards, Dmitry Groshev =-

Re: \c escape within $'...' can produce mangled UTF-8

2010-08-15 Thread Dmitry Groshev
ssibly invalid) multibyte sequence in UTF-8. Read up on the UTF-8 encoding, and don't make wrong guesses again. -- -= With best regards, Dmitry Groshev =-

Re: \c escape within $'...' can produce mangled UTF-8

2010-08-14 Thread Dmitry Groshev
parameter makes no sense, escape sequence should be left untranslated - just the way "\x" handles things like "\xZZ". Make "\c" check that its parameter is an ASCII char, and the problem will be fixed. Unless for some reason you consider this bug worth preserving. :-) -- -= With best regards, Dmitry Groshev =-

\c escape within $'...' can produce mangled UTF-8

2010-08-14 Thread Dmitry Groshev
nverts the character's first byte, leaving the rest of UTF-8 sequence as it were. Repeat-By: echo $'\cА' > utf8bug.txt The "А" character in the example is cyrillic - U+0410, UTF-8: 0xD0 0x90 . It gets transformed into 0x10 0x90 which is invalid UTF-8. -- -= With best regards, Dmitry Groshev =-