Your message dated Sun, 27 Apr 2025 12:42:01 +0200
with message-id <10e96b3a-43e9-41d2-85e1-669a8c787...@sury.org>
and subject line Closing old tidy-html5 bugs
has caused the Debian Bug report #624558,
regarding Malformed html file causing tidy to misbehave
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact ow...@bugs.debian.org
immediately.)


-- 
624558: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=624558
Debian Bug Tracking System
Contact ow...@bugs.debian.org with problems
--- Begin Message ---
Package: tidy
Version: 20091223cvs-1
Severity: normal

Hi, 

I use tidy as the solo tool to convert (malformed) html files as
(standard) xml files. Here is an example of an html file that causes
tidy to misbehave:

 $ cat test.htm | tidy -asxml > bad.xml

 $ grep -A2  Ads.displayAd test.htm bad.xml
 test.htm:               Ads.displayAd();
 test.htm-               /* ]]> */
 test.htm-       </script>
 --
 bad.xml:                Ads.displayAd();        
 bad.xml-                /* ]]> */
 bad.xml-//]]>

I.e., in the 'tidy -asxml' output there is an extra '//]]>' element,
which causes my Perl XML parser to choke:

 not well-formed (invalid token) at line 109, column 4, byte 4299:
                 Ads.displayAd();        
                 /* ]]> */
 //]]>
 ===^
 </script> <noscript><a href= . . . 
  at /usr/lib/perl5/XML/Parser.pm line 187

Moreover, adding the extra element seems to be an inconsistent
behaviour of tidy:

 $ grep -A2 ' ]]> ' bad.xml
                 /* ]]> */
 //]]>
 </script> <noscript><a href=
 --
                         /* ]]> */
 </script> <noscript><a href=. . . 

I'll attach the test file in next email.
Please pass on this bug report to tidy developers. 

Thanks a lot!

-- System Information:
Debian Release: 6.0
  APT prefers stable
  APT policy: (600, 'stable'), (500, 'stable'), (50, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.36-grml64 (SMP w/1 CPU core; PREEMPT)
Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)
Shell: /bin/sh linked to /bin/dash

Versions of packages tidy depends on:
ii  libc6                      2.11.2-10     Embedded GNU C Library: Shared lib
ii  libtidy-0.99-0             20091223cvs-1 HTML syntax checker and reformatte

tidy recommends no packages.

Versions of packages tidy suggests:
pn  tidy-doc                      <none>     (no description available)

-- no debconf information



--- End Message ---
--- Begin Message ---
Version: 2:5.8.0-1

I am closing all the bugs that had been filled before 5.8.0 upstream version
that are **upstream** issues.

If the bug that you've reported can still be found in 5.8.0, please retest this,
make sure that the upstream bugs are actually filled upstream and reopen
the bug in Debian BTS and correctly set the "forwarded" attribute on the
issue.

Ondrej
--
Ondřej Surý (He/Him)
ond...@sury.org

--- End Message ---

Reply via email to