-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

> From:
> Paul Wise <[EMAIL PROTECTED]>
> On Fri, 2008-03-28 at 20:55 +0100, Noèl Köthe wrote:
> 
>> I'm not able to reproduce this (old:() problem report since some hours
>> with wget 1.11.1-1 and this commandline:
>>
>> wget -r -np -c
>> http://meetings-archive.debian.net/pub/debian-meetings/2005/debconf5/ogg_theora/720x576/
>>
>> Can you still reproduce this problem?
> 
> It looks like it is finally fixed! Thanks for the followup, closing :)

Maybe not. This bug sounds a lot like one we're tracking upstream:
https://savannah.gnu.org/bugs/index.php?20496
(That bug is closed, but because it has been split into two separate
issues, not because the underlying problem is gone.)

The situation will occur when Wget tries to see if there's more to a
very large file, and receives a 416 Requested Range Not Satisfiable,
with a content-type header of text/html (perhaps because the server is
returning an HTML error page). Wget will think that the original
_document_ was HTML, and try to parse the thing on disk. Wget's parser
needs the complete file in a buffer (ick!), so Wget will mmap() it: or,
if that fails, it will malloc it (and that's when this problem shows up).

The solution to this isn't obvious. Probably Wget should ignore the
Content-Type of 416, but then it doesn't have a way to tell whether it
should be parsing the file or not. Perhaps an extra HEAD request, or a
heuristic to see if Wget thinks that it's text/html. We probably won't
be fixing it until the Session Info Database features is added (which
will remember the original content types for files).

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer,
and GNU Wget Project Maintainer.
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFH9vnw7M8hyUobTrERAv56AJ9790dOir3xApULmk8AuQz6tL4n4ACfRXCt
jW0hQgKF0eQBdpuZac9ntcI=
=5imI
-----END PGP SIGNATURE-----


Reply via email to