Package: w3m
Version: 0.5.1-4
Severity: wishlist

w3m has big problems reading a file full of HTML entities.
$ w3m http://ehp.niehs.nih.gov/cehp/docs/2005/113-1c/toc.html
We see lots of "?". Firefox doesn't have any problems.

Even after
$ wwwoffle -o http://ehp.niehs.nih.gov/cehp/docs/2005/113-1c/toc.html|\
perl -pwe 'use HTML::Entities;$_=decode_entities($_);\
s/gb2312/utf-8/'>file.html
w3m has problems.

OK, I was finally able to prepare it for a big5 PDA:
wwwoffle -o http://ehp.niehs.nih.gov/cehp/docs/2005/113-1c/toc.html|\
perl -pwe 'use HTML::Entities;$_=decode_entities($_);\
s/gb2312/big5/'|iconv -f utf-8 -t gb2312 -c|\
iconv -f gb2312 -t big5 -c > file.html

We note the two iconv steps probably due to thier non complete mapping
which I recall telling them. Also there is in fact no gb2312 in the original 
file.
-- System Information:
Locale: LANG=zh_TW.Big5, LC_CTYPE=zh_TW.Big5 (charmap=BIG5)


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to