On Wed, Sep 21, 2005 at 04:32:06PM -0700, Anton Zinoviev wrote:

> Changes: 
>  bgoffice (3.0-5) unstable; urgency=low
...
>    * Files /etc/emacs21/site-start.d/90{aspell-bg,ibulgarian}.el to
>      codepage-setup cp1251.  It is still not clear to me how to support
>      spelling of Bulgarian UTF-8 texts in Emacs.

This should be internally handled by most {x}emacs if
buffer-file-coding-system is set to the encoding instead to 'undecided' or
equivalent. Notably xemacs21-nomule does not support that. ispell.el will
recode that UTF-8 to the encoding declared by the dictionary when sending
strings and the other way back when receiving them. That should be
transparent to the user, unless the original UTF-8 has characters that
cannot be recoded to the single byte encoding, leading to misalignment
errors (like in #205516).

>    * Add entries for different Emacs versions in ibulgarian.info-ispell and
>      aspell-bg.info-aspell.  Thanks to Ivan Raikov, closes: #321040.

Seems that xemacs21 also does not support cp1251. The summary seems to be

emacs20: nothing
emacs21: cp1251
emacs22: cp1251, windows-1251
xemacs21: windows-1251

I would forget emacs20, that was not even shipped with sarge (and whose
iso-8859-1 entry was wrong), and concentrate in leaving only the cp1251
entry, that also matches aspell. The only problem is (emacs20 discarded)
with xemacs21, and seems to be easily fixable defining cp1251 as an alias to
windows-1251 for xemacs. I can add that in an initialization file.

I have seen another problem in the ispell entry name. While all utf-8
entries I tried displayed as raw chars in my latin1 environment when used
in a debconf prompt, showing all chars, the bulgarian entry seems to only
show the first char (as a 3 byte UTF-8 char) and nothing of the remaining
chars.

I do not have a clear position regarding this last, when the use of utf8
was introduced in policy seemed that all utf8 chars were to be displayed as
multibyte chains in single byte encodings, leaving in the worst case the
english translation readable. But this case confuses me, we should probably
suggest trying first some sort of 7bit 'native' transliteration when possible
instead of directly suggesting the use of UTF8, or at least using something
like

7bit_transliteration [UTF-8_native_name] (english translation)

when utf8 is used. I hope that would at least make the 7bit_transliteration
readable in the worst case, when something in the utf8 string confuses
whiptail (but I did not check that). This seems to not affect readline or
gnome frontends. Another possibility would be to leave things as they
currently are, expecting utf8 support be improved in the meantime.

What do you think?

-- 
Agustin


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to