Hi Carlos,

quite a while ago, you submitted a bug to the Debian BTS noting
deficiencies in the encoding of the list archive[1].
We have now installed a newer version of the mailing list software that
should provide some improvements.
While we did not regenerate all of the archive yet, the current month's
archive[1] seems to indicate that things will improve when we do this
(soon).
One thing I noticed is that gb18030[3] does not seem to be decoded,
apparently because upstream is lacking support.
What it would need is a character mapping from gb18030 to UTF. There
already are those for CP936 (which is, I take it, gbk) and gb3212.
If someone could provide those maps, it would probably be easy to add
support for those as well. (The ones that are already there are in
CP936.pm and GB2312.pm in the mhonarc package, the format should be
similar and - if I understood your mail correctly - the overlapping
sections of these character encodings could be used as a starting point)
If you or someone else would be able to help out here, I would
appreciate that a lot.

Kind regards

Thomas

1. http://bugs.debian.org/381436
2. http://lists.debian.org/debian-chinese-gb/2007/11/
3. http://lists.debian.org/debian-chinese-gb/2007/11/threads.html#00025
-- 
Thomas Viehmann, http://thomas.viehmann.net/



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to