Hi All. I'm Koji Otani, posted patches about CJK (see Bug #11413) before. I found more problem about CJK. and registered patch as Bug #13582.
Adobe Japan1 6 character set includes characters in outside of UNICODE BMP. But poppler cannot display these and some characters with TrueType font. current poppler has following problems about that. (1) CMap data is old. Current data (poppler-data-0.1.1.tar.gz ) has only Adobe Japan 4 data. This should be update with newer one. (GhostScript 8.60 has already new CMap data) (2) poppler doesn't lookup format12 cmap table of TrueType font. Only format12 cmap table supports codes outside of UNICODE BMP. (3) poppler lookups only UCS2 CMaps when making unicodeToGID map UCS2 CMap supports only codes in inside of UNICODE BMP. (4) missing handling CID conflict in CMap . CMap maps multiple unicode to a same CID. So, a CID can map multiple unicode. Currently poppler use only the first one. If that code is not exist in the cmap of TT Font. It is not displayed. I proposed patch solved (2), (3), (4). Please check it if you are interestead in this. I would appreciate it if you accept this patch. Reagrads, Koji Otani _______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
