Package: lynx-cur
Version: 2.8.6-17
Severity: normal

There is something wrong in what lynx sends e.g., uxterm.
$ lynx -dump http://seba.ulyssis.org/thesis/howto-pinyin.php is fine.
Nothing to do with raw utf-8 vs. &#stuff.

My investigation concludes that lynx is sending e.g., M-G ~ P when it should be
sending M-G M-^P, where M- means high bit set. Instead of setting the
high bit of ^P and sending that, it sends two characters, ~ and P.

Looking at their bits,
^P   10 00010000
P    50 01010000
~    7e 01111110
M-^P 90 10010000

we see lynx is sending
11000111 01111110 010abcde
instead of
11000111          100abcde
where abcde varies with the particular char being sent.
What a screw up?! Looks like some mis-estimate in your to-utf-8
converter for screen presentation.

(pstree shows a "xterm--luit" connection.)

None of
character_set=UNICODE (UTF-8)
character_set=Transparent
character_set=Chinese
helped.

 Display and Character Set
  Use locale-based character set(!): [ON_]
  Display character set            : [UNICODE (UTF-8)________________]
  Assumed document character set(!): [utf-8___________]
  Raw 8-bit (!)                    : [ON_]
also didn't help

And why does "=" show
  Charset: utf-8
when "o" shows
  Assumed document character set(!): [iso-8859-1______]

-assume_local_charset=UTF-8 no help, -assume_charset=UTF-8 same.
file:///usr/share/doc/HOWTO/en-html/Unicode-HOWTO-4.html no help.

w3m works fine.


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to