Catdoc doesn't support more than 8-byte input encodings. Significant changes in the internal architecture are needed to support them.
I feel myself unable to solve this problem myself, because I can't read Chinese or Korean, so I cannot validate the results. Any patches which were sent to me on this subject so far, were rejected, because they break catdoc on some of supported platform. So, there is no reason just to drop in big5, cp950 and another CJK charset files. Please remove them from the package, because they just waste space. -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]