Your message dated Sun, 27 Apr 2025 12:42:01 +0200
with message-id <10e96b3a-43e9-41d2-85e1-669a8c787...@sury.org>
and subject line Closing old tidy-html5 bugs
has caused the Debian Bug report #607065,
regarding "tidy -asxhtml -utf8 --add-xml-decl yes" doesn't specify the encoding
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact ow...@bugs.debian.org
immediately.)
--
607065: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=607065
Debian Bug Tracking System
Contact ow...@bugs.debian.org with problems
--- Begin Message ---
Package: tidy
Version: 20091223cvs-1
Severity: normal
"tidy -asxhtml -utf8 --add-xml-decl yes" doesn't specify the encoding.
The consequence is that the XML processor cannot reliably determine
the encoding at that time. For instance, libxml2 will assume that the
output encoding should be US-ASCII (though it will be able to read
UTF-8 sequences as required), so that
echo é | tidy -asxhtml -utf8 --add-xml-decl yes | xmllint -
gives:
<?xml version="1.0"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
<meta name="generator" content="HTML Tidy for Linux (vers 25 March 2009), see
www.w3.org" />
<title></title>
</head>
<body>
é
</body>
</html>
See the "é" that has been written as a character reference due to the
absence of declared encoding.
Note that the behavior of xmllint won't change:
https://bugzilla.gnome.org/show_bug.cgi?id=350208
-- System Information:
Debian Release: squeeze/sid
APT prefers unstable
APT policy: (500, 'unstable'), (500, 'testing'), (1, 'experimental')
Architecture: amd64 (x86_64)
Kernel: Linux 2.6.32-5-amd64 (SMP w/8 CPU cores)
Locale: LANG=POSIX, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages tidy depends on:
ii libc6 2.11.2-7 Embedded GNU C Library: Shared lib
ii libtidy-0.99-0 20091223cvs-1 HTML syntax checker and reformatte
tidy recommends no packages.
Versions of packages tidy suggests:
ii tidy-doc 20091223cvs-1 HTML syntax checker and reformatte
-- no debconf information
--- End Message ---
--- Begin Message ---
Version: 2:5.8.0-1
I am closing all the bugs that had been filled before 5.8.0 upstream version
that are **upstream** issues.
If the bug that you've reported can still be found in 5.8.0, please retest this,
make sure that the upstream bugs are actually filled upstream and reopen
the bug in Debian BTS and correctly set the "forwarded" attribute on the
issue.
Ondrej
--
Ondřej Surý (He/Him)
ond...@sury.org
--- End Message ---