On 5/10/11 9:17 AM, Greg Wooledge wrote: > In yours, however, it is 0x65 0xcc 0x81 which is U+0065 LATIN SMALL > LETTER E followed by U+0301 COMBINING ACUTE ACCENT. > > Perhaps Bash does not know how to interpret COMBINING ACUTE ACCENT which > follows a letter...? > > I'm not intimately familiar with this stuff myself, but it looks like > a real bastard to me... I thought the point of UTF-8 was that you could > read it a byte at a time, and know when you encountered a byte that > signified the start of a multi-byte character. But apparently not! > If I'm interpreting this COMBINING ACUTE ACCENT thing properly, the > only indicator that you are in a multi-byte character comes with the > *second* byte, so you have to backtrack. What idiot thought this up?
http://lists.gnu.org/archive/html/bug-readline/2011-01/msg00011.html -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, ITS, CWRU c...@case.edu http://cnswww.cns.cwru.edu/~chet/