In the few cases I had where BeautifulSoup couldn't handle poorly formed HTML, I've found that html5lib was able to get the job done. And of course, lxml is great too, but has a bit more overhead installation-wis.
_______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: http://mail.python.org/mailman/listinfo/tutor