Tomas wrote:
>
> > Tomas, I think that you're confusing the intent of UTF-8 with UTF-7.
> > Can you explain where helps UTF-8 in the communication through 8-bit
> > channels? You can also send UTF-32 (or any other encoding) over 8-bit
> > channels, chopping each char in 4 bytes.
> UTF-8 strings can be passed through dumb processes based on
> the 8-bit C library; UCS-2 strings, for instance, cannot, because
> any leading nulls will be interpreted as string terminators; that's all I
> meant (I suspect this is responsible for the proliferation of utf-8 in
> the *nix world, because it allows to start using Unicode without
> having to rewrite the entire system).

ok, I understand now your point.  Yes, that's an advantage of UTF-8, but I
would not say that it has been designed only to make strlen & al. work for
english users.

Cheers,

--
Joaquin Cuenca Abela
[EMAIL PROTECTED]


Reply via email to