In the few cases I had where BeautifulSoup couldn't handle poorly formed
HTML, I've found that html5lib was able to get the job done. And of course,
lxml is great too, but has a bit more overhead installation-wis.
___
Tutor maillist - Tutor@python.org
T
Kent Johnson, 15.02.2010 14:05:
> On Mon, Feb 15, 2010 at 7:28 AM, Amit Sethi wrote:
>> Well ya I was kind of hoping to know about more tools and recommendations on
>> how to edit broken html .
>
> This page lists several alternatives: html5, lxml, elementtree:
> http://www.crummy.com/software/Bea
On Mon, Feb 15, 2010 at 7:28 AM, Amit Sethi wrote:
> Well ya I was kind of hoping to know about more tools and recommendations on
> how to edit broken html .
This page lists several alternatives: html5, lxml, elementtree:
http://www.crummy.com/software/BeautifulSoup/3.1-problems.html
> Their wer
Amit Sethi, 15.02.2010 13:28:
> Well ya I was kind of hoping to know about more tools and recommendations on
> how to edit broken html.
You already found lxml, don't think you can do any better. Note that you
shouldn't "edit broken html" but "fix broken HTML and then edit correct
HTML". Writing ou
Well ya I was kind of hoping to know about more tools and recommendations on
how to edit broken html . Their were tags that I wanted to replace using
beautifulSoup
however i saw that the replacewith function was not available for many of
the tags I
wanted to replace . Also BeautifulSoup does not h
"Luke Paireepinart" wrote
with BeautifulSoup were. Also I think beautifulsoup is part of the
Python
standardlib now, isn't it? Why do you think it is not maintained?
I think you may be getting confused with element tree which is part
of the standard lib? BS is still an add-on.
Alan G.
Luke Paireepinart, 14.02.2010 13:12:
> Also I think beautifulsoup is part of the Python standardlib now, isn't it?
No, it's not.
Stefan
___
Tutor maillist - Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailma
On Sun, Feb 14, 2010 at 6:10 AM, Amit Sethi wrote:
> Hi I need to edit html programmatically . Sadly the html might be broken at
> places . I was using BeautifulSoup but there were lots of problems and it is
> also not maintained can some one guide me to any tutorials on editing html
> using lxml
On Sun, Feb 14, 2010 at 5:10 AM, Amit Sethi wrote:
> Hi I need to edit html programmatically . Sadly the html might be broken at
> places . I was using BeautifulSoup but there were lots of problems and it is
> also not maintained can some one guide me to any tutorials on editing html
> using lxml
Hi I need to edit html programmatically . Sadly the html might be broken at
places . I was using BeautifulSoup but there were lots of problems and it is
also not maintained can some one guide me to any tutorials on editing html
using lxml .
--
A-M-I-T S|S
_
10 matches
Mail list logo