This bug doesn't seem to be still present in testing, although something
very similar happens with the GET script:

$ perl -e 'use LWP::UserAgent;my $ua=LWP::UserAgent->new; my $response=
$ua->get("http://www.math.ucl.ac.be/membres/vanschaftingen/publications.fr.html";);
 $response->decoded_content();'

$ LANG=C GET -d
http://www.math.ucl.ac.be/membres/vanschaftingen/publications.fr.html
Parsing of undecoded UTF-8 will give garbage when decoding entities
at /usr/share/perl5/LWP/Protocol.pm line 114.

All seems to be connected with HTML::HeadParser, which in its POD says:

"Note that the C<HTML::HeadParser> might get confused if raw undecoded
UTF-8 is passed to the parse() method.  Make sure the strings are
properly decoded before passing them on."

This seems to be the same issue as in #386565. Shall we close this bug
and only keep the latter?

-- 
Martín Ferrari <[EMAIL PROTECTED]>

Attachment: signature.asc
Description: Esta parte del mensaje está firmada digitalmente

Reply via email to