Package: python-beautifulsoup Version: 3.0.4-1 Another test case: http://www.singular-tech.com/ Also via dmoz, I don't know what this site is. Apparently, the charset encoding of that site is busted. That's at least what the W3C validator says. It should be UTF-8, it mostly is, but not all data is valid UTF-8.
This again causes UnicodeDammit.originalEncoding and UnicodeDammi.unicode to be None, causing the concatenation type erro exception in the SGML parser. --- System information. --- Architecture: i386 Kernel: Linux 2.6.23.9 Debian Release: lenny/sid 500 unstable www.debian-multimedia.org 500 unstable ftp.de.debian.org 1 experimental ftp.de.debian.org --- Package information. --- Depends (Version) | Installed =============================-+-=========== python (>= 2.2) | 2.4.4-6 python-support (>= 0.2) | 0.7.5 best regards, Erich Schubert -- erich@(vitavonni.de|debian.org) -- GPG Key ID: 4B3A135C (o_ Friends are those who reach out for //\ your hand but touch your heart. V_/_ Das größte Hindernis beim Erkennen der Wahrheit ist nicht die Falschheit, sondern die Halbwahrheit. --- L. N. Tolstoi