Package: lynx-cur Version: 2.8.8dev.6-3 Severity: normal This is more or less a followup of bug 592883, which has been fixed for the XML parser, but no changes have been done for the HTML parser. A testcase can be found here:
http://www.vinc17.net/test/c1.html I've also attached the current version. In the first test (how the € entity is to be interpreted by a HTML parser), lynx follows the behavior of a HTML 5 parser by displaying a Euro symbol, but this behavior is incorrect in HTML 4. The second test (how the ¬it; is to be interpreted by a HTML parser), lynx follows the behavior of a HTML 4 parser by outputting "¬it;" (this behavior is incorrect in HTML 5). Note: Firefox and Webkit had the same bug, but it has been fixed for FF4 and in Webkit's trunk[*], which both follow the HTML 5 rules only. [*] https://bugs.webkit.org/show_bug.cgi?id=14391 -- System Information: Debian Release: squeeze/sid APT prefers unstable APT policy: (500, 'unstable'), (500, 'testing'), (1, 'experimental') Architecture: amd64 (x86_64) Kernel: Linux 2.6.32-5-amd64 (SMP w/8 CPU cores) Locale: LANG=POSIX, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages lynx-cur depends on: ii debconf [debconf-2.0] 1.5.36 Debian configuration management sy ii libbsd0 0.2.0-1 utility functions from BSD systems ii libc6 2.11.2-7 Embedded GNU C Library: Shared lib ii libgcrypt11 1.4.5-2 LGPL Crypto library - runtime libr ii libgnutls26 2.8.6-1 the GNU TLS library - runtime libr ii libidn11 1.18-1 GNU Libidn library, implementation ii libncursesw5 5.7+20100313-4 shared libraries for terminal hand ii zlib1g 1:1.2.3.4.dfsg-3 compression library - runtime Versions of packages lynx-cur recommends: ii mime-support 3.51-1 MIME files 'mime.types' & 'mailcap Versions of packages lynx-cur suggests: pn lynx-cur-wrapper <none> (no description available) -- debconf information excludedTitle: Test for C1 Control Code
Displaying a Euro symbol is incorrect in HTML 4 (but the expected result in HTML 5): .
See http://www.w3.org/International/questions/qa-controls.
HTML parser test:
- I'm ¬it; I tell you.
- I'm ¬it; I tell you.
In HTML 5, the two lines above should be rendered in the same way. See HTML 5, Tokenizing character references.