Tomas wrote:
> UTF-8 processing is cumbersome, and as such it is completely
> unsuitable format to use for the piecetable. We need a fixed with
> encoding for that, such as the curent UCS-2, i.e., UTF-32.

Tomas, can you also clarify this point?
I don't see why UTF-8 is unsuitable for the piecetable.

One things that worries me about UTF-8 is that even if random access is not
something very useful, I don't think that it would be trivial to change code
that assumes random access to strings to code that uses only forward &
backwards iterators, so UTF-32 may still be our best shot here.

UTF-8 has another advantage over UTF-32.  In the gtk & the qnx frontend, the
format used to output text is UTF-8, so we'll not need to do any conversion
to get text displayed.  win32 uses UTF-16/UCS-2 (in funcion of the
registry?) so anyway you need to do a conversion from UTF-8 or UCS-4 to
UCS-2.

Cheers,

--
Joaquin Cuenca Abela
[EMAIL PROTECTED]


Reply via email to