----- Original Message ----- From: "Tomas Frydrych" <[EMAIL PROTECTED]> To: "Andrew Dunbar" <[EMAIL PROTECTED]> Cc: <[EMAIL PROTECTED]> Sent: Saturday, April 20, 2002 9:33 PM Subject: Re: commit: abi: UTF8String class
> > > Andrew Dunbar <[EMAIL PROTECTED]> wrote: > > > Well pretty soon we're going to need a real > > replacement. Dom and I are both in favour of the > > replacement being UTF-8 but some here seem to want > > UTF-32. > > UTF-8 is an encoding scheme that is intended to allow Unicode > communication between separate processes over 8-bit channels. > For that it is great, but that's about the only thing it is really good > for. UTF-8 processing is cumbersome, and as such it is completely > unsuitable format to use for the piecetable. We need a fixed with > encoding for that, such as the curent UCS-2, i.e., UTF-32. Tomas, I think that you're confusing the intent of UTF-8 with UTF-7. Can you explain where helps UTF-8 in the communication through 8-bit channels? You can also send UTF-32 (or any other encoding) over 8-bit channels, chopping each char in 4 bytes. The only think that keeps (broken) mail servers and such stuff that eats your 8th-bit of each byte from eating it, is UTF-7, and UTF-8 is absolutely unrelated to that stuff. Cheers, -- Joaquin Cuenca Abela [EMAIL PROTECTED]
