The problem is definitely in libxml2 (not just python), as a C app segfaults under the same circumstances.
-- htmlParseDoc segfaults when an empty body is passed https://launchpad.net/bugs/61679 -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs