# su - nobody $ wget -O - http://jidanni.org/geo/taipower/sunriver/index.html| iconv -f big5 -t utf8 > i.html $ tidy -q -utf8 -raw i.html|perl -nwe 'print if length>80'|wc 4 5 419 #and they are just a tiny bit over 80. OK. $ tidy -q -raw -utf8 i.html|perl -nwe 'print if length>80'|wc 25 130 2845 #And to think that I was about to batch job convert my whole site into a highly wrapped mess! $ dlocate -s tidy Version: 20050415-1 ...
P.S., a side effect of -utf8 -raw is that <meta ... charset=big5"> no longer got changed to utf-8. OK, I will do two runs, first with just -utf8, then with -utf8 -raw. -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]