On 2008-03-21 02:28:41.00 Anthony Fok <[EMAIL PROTECTED]> wrote:
Is MHonArc 2.6.1 being used for doing the actual HTML mail conversion? I just did a "apt-get source mhonarc" and noticed that MHonArc currently
recognizes gb2312 and gbk, but "grep -r" shows no result of gb18030.
Actually, we are using mhonarc 2.6.16.

Quick fix: Since GB18030 is a strict superset of GBK (a.k.a. CP936),
making gb18030 an alias of cp936, perhaps modifying def-mime.mrc:

        cp936;              gbk
        cp936;              ms936
        cp936;              windows-936
+       cp936;              gb18030
        cp949:              euc-kr
        cp949:              ks_c_5601-1987
        cp949:              ks_c_5601-1989

or mhopt.pl would do the trick? :-) While it is not perfect, it should
be fine with 99.9% of all GB18030 mail out there.
Thanks, I put that into our config files and regenerated the 2008-03 archive. If you don't notice any breakage, I will regenerate the old debian-chinese-gb lists as well and then close this bug unless you feel that gb18030 needs proper support.

The long-term solution is to extend CP936.pm to create a GB18030.pm.
James Su (SCIM author) and I worked on that several years ago, e.g. the
text codec in Qt.  Maybe eventually I'll make one for MHonArc too.
Haha, I'd better not make promises yet as I'm still kind of MIA. :-)
So, until GB18030.pm is available, an alias to CP936 would do.

It seems that gb18030 is not supported in standard perl but I needs something like Encoding::HanExtra from CPAN.

Kind regards

T.
--
Thomas Viehmann, http://thomas.viehmann.net/



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to