----- Original Message -----
From: "Tomas Frydrych" <[EMAIL PROTECTED]>
To: "Andrew Dunbar" <[EMAIL PROTECTED]>
Cc: <[EMAIL PROTECTED]>
Sent: Saturday, April 20, 2002 9:33 PM
Subject: Re: commit: abi: UTF8String class


>
> > Andrew Dunbar <[EMAIL PROTECTED]> wrote:
>
> > Well pretty soon we're going to need a real
> > replacement.  Dom and I are both in favour of the
> > replacement being UTF-8 but some here seem to want
> > UTF-32.
>
> UTF-8 is an encoding scheme that is intended to allow Unicode
> communication between separate processes over 8-bit channels.
> For that it is great, but that's about the only thing it is really good
> for. UTF-8 processing is cumbersome, and as such it is completely
> unsuitable format to use for the piecetable. We need a fixed with
> encoding for that, such as the curent UCS-2, i.e., UTF-32.

Tomas, I think that you're confusing the intent of UTF-8 with UTF-7.
Can you explain where helps UTF-8 in the communication through 8-bit
channels?
You can also send UTF-32 (or any other encoding) over 8-bit channels,
chopping each char in 4 bytes.

The only think that keeps (broken) mail servers and such stuff that eats
your 8th-bit of each byte from eating it, is UTF-7, and UTF-8 is absolutely
unrelated to that stuff.

Cheers,

--
Joaquin Cuenca Abela
[EMAIL PROTECTED]


Reply via email to