-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Thomas!

Thank you for your quick response!  :-)

Thomas Viehmann (Listmaster) wrote:
> Hi Anthony,
> 
> thanks for contacting us! We would be very happy to weed out as much of
> the remaining encoding issues, at least for correctly formatted mails.
> (We still have bug #381436 to track Chinese in particular, but
> unfortunately Carlos seems to be unavailable.)

Thanks for letting me know of Carlos bug report.  Let me Cc: to that too.

>> Several days ago, I sent an e-mail to debian-chinese-gb using the
>> GB18030 encoding, which is now the national standard replacing GB2312.
>> I just noticed that the lists.debian.org web-based interface does not
>> yet decode GB18030 properly:
>>     http://lists.debian.org/debian-chinese-gb/2008/03/msg00086.html
> 
> Thanks for pointing at the message, that is of great use. We will get
> back to you as soon as possible.

Is MHonArc 2.6.1 being used for doing the actual HTML mail conversion?
I just did a "apt-get source mhonarc" and noticed that MHonArc currently
recognizes gb2312 and gbk, but "grep -r" shows no result of gb18030.

Quick fix: Since GB18030 is a strict superset of GBK (a.k.a. CP936),
making gb18030 an alias of cp936, perhaps modifying def-mime.mrc:

        cp936;              gbk
        cp936;              ms936
        cp936;              windows-936
+       cp936;              gb18030
        cp949:              euc-kr
        cp949:              ks_c_5601-1987
        cp949:              ks_c_5601-1989

or mhopt.pl would do the trick?  :-)  While it is not perfect, it should
be fine with 99.9% of all GB18030 mail out there.

The long-term solution is to extend CP936.pm to create a GB18030.pm.
James Su (SCIM author) and I worked on that several years ago, e.g. the
text codec in Qt.  Maybe eventually I'll make one for MHonArc too.
Haha, I'd better not make promises yet as I'm still kind of MIA.  :-)
So, until GB18030.pm is available, an alias to CP936 would do.

For more background information, you may like to take a look at

  http://lists.w3.org/Archives/Public/ietf-charsets/2002JanMar/0038.html

Thank you very much for your help!

Warm regards,

- --
Anthony Fok
ThizLinux Software Co., Ltd.
Debian GNU/Linux Developer
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFH4w9JLa8qZm1n95ARAgS2AJsE93QK4unFHxgBacjEfzEAV8PFBQCeKEZB
Ez8QyyOseSsH3pH970DHIns=
=aaY2
-----END PGP SIGNATURE-----



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to