Hi Carlos, quite a while ago, you submitted a bug to the Debian BTS noting deficiencies in the encoding of the list archive[1]. We have now installed a newer version of the mailing list software that should provide some improvements. While we did not regenerate all of the archive yet, the current month's archive[1] seems to indicate that things will improve when we do this (soon). One thing I noticed is that gb18030[3] does not seem to be decoded, apparently because upstream is lacking support. What it would need is a character mapping from gb18030 to UTF. There already are those for CP936 (which is, I take it, gbk) and gb3212. If someone could provide those maps, it would probably be easy to add support for those as well. (The ones that are already there are in CP936.pm and GB2312.pm in the mhonarc package, the format should be similar and - if I understood your mail correctly - the overlapping sections of these character encodings could be used as a starting point) If you or someone else would be able to help out here, I would appreciate that a lot.
Kind regards Thomas 1. http://bugs.debian.org/381436 2. http://lists.debian.org/debian-chinese-gb/2007/11/ 3. http://lists.debian.org/debian-chinese-gb/2007/11/threads.html#00025 -- Thomas Viehmann, http://thomas.viehmann.net/ -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]