Hi Andrew,

> Actually, my understanding is that .abw documents are
> *always* in UTF-8 encoding.  At the top of any XML
> document the encoding must be declared if it is
> something other than UTF-8.

Things must have changed recently, AW used to use the encoding 
of the locale under which it is running in the abw document, i.e., if 
LANG was set to ru_RU.KOI8-R, the document was internally 
coded using KOI8-R. Any characters not found in the encoding set, 
were represented by xml entities &#... UTF-8 was only used if the 
locale itself used utf-8. The main reason for this was so that the 
user could use external utilities, such as grep, on their documents.

If it is true that AW now defaults to utf-8, I would like to suggest 
that this should be changed back.

Tomas



Reply via email to