Re: [Tutor] Editing html using python

2010-02-15 Thread Serdar Tumgoren
In the few cases I had where BeautifulSoup couldn't handle poorly formed HTML, I've found that html5lib was able to get the job done. And of course, lxml is great too, but has a bit more overhead installation-wis. ___ Tutor maillist - Tutor@python.org T

Re: [Tutor] Editing html using python

2010-02-15 Thread Stefan Behnel
Kent Johnson, 15.02.2010 14:05: > On Mon, Feb 15, 2010 at 7:28 AM, Amit Sethi wrote: >> Well ya I was kind of hoping to know about more tools and recommendations on >> how to edit broken html . > > This page lists several alternatives: html5, lxml, elementtree: > http://www.crummy.com/software/Bea

Re: [Tutor] Editing html using python

2010-02-15 Thread Kent Johnson
On Mon, Feb 15, 2010 at 7:28 AM, Amit Sethi wrote: > Well ya I was kind of hoping to know about more tools and recommendations on > how to edit broken html . This page lists several alternatives: html5, lxml, elementtree: http://www.crummy.com/software/BeautifulSoup/3.1-problems.html > Their wer

Re: [Tutor] Editing html using python

2010-02-15 Thread Stefan Behnel
Amit Sethi, 15.02.2010 13:28: > Well ya I was kind of hoping to know about more tools and recommendations on > how to edit broken html. You already found lxml, don't think you can do any better. Note that you shouldn't "edit broken html" but "fix broken HTML and then edit correct HTML". Writing ou

Re: [Tutor] Editing html using python

2010-02-15 Thread Amit Sethi
Well ya I was kind of hoping to know about more tools and recommendations on how to edit broken html . Their were tags that I wanted to replace using beautifulSoup however i saw that the replacewith function was not available for many of the tags I wanted to replace . Also BeautifulSoup does not h

Re: [Tutor] Editing html using python

2010-02-15 Thread Alan Gauld
"Luke Paireepinart" wrote with BeautifulSoup were. Also I think beautifulsoup is part of the Python standardlib now, isn't it? Why do you think it is not maintained? I think you may be getting confused with element tree which is part of the standard lib? BS is still an add-on. Alan G.

Re: [Tutor] Editing html using python

2010-02-15 Thread Stefan Behnel
Luke Paireepinart, 14.02.2010 13:12: > Also I think beautifulsoup is part of the Python standardlib now, isn't it? No, it's not. Stefan ___ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: http://mail.python.org/mailma

Re: [Tutor] Editing html using python

2010-02-14 Thread Kent Johnson
On Sun, Feb 14, 2010 at 6:10 AM, Amit Sethi wrote: > Hi I need to edit html programmatically . Sadly the html might be broken at > places . I was using BeautifulSoup but there were lots of problems and it is > also not maintained can some one guide me to any tutorials on editing html > using lxml

Re: [Tutor] Editing html using python

2010-02-14 Thread Luke Paireepinart
On Sun, Feb 14, 2010 at 5:10 AM, Amit Sethi wrote: > Hi I need to edit html programmatically . Sadly the html might be broken at > places . I was using BeautifulSoup but there were lots of problems and it is > also not maintained can some one guide me to any tutorials on editing html > using lxml

[Tutor] Editing html using python

2010-02-14 Thread Amit Sethi
Hi I need to edit html programmatically . Sadly the html might be broken at places . I was using BeautifulSoup but there were lots of problems and it is also not maintained can some one guide me to any tutorials on editing html using lxml . -- A-M-I-T S|S _