On Thu, 2008-05-29 at 17:30 +0200, Francesco Potorti` wrote:
> webcheck:   http://en.wikipedia.org/wiki/News_aggregator
[...]
>   File "/usr/share/webcheck/parsers/html/beautifulsoup.py", line 45, in parse
>     fromEncoding=str(link.encoding))
>   File "/var/lib/python-support/python2.5/BeautifulSoup.py", line 1282, in 
> __init__
>     BeautifulStoneSoup.__init__(self, *args, **kwargs)
[...]
>   File "/usr/lib/python2.5/sgmllib.py", line 285, in parse_starttag
>     self._convert_ref, attrvalue)
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xa0 in position 0: 
> ordinal not in range(128)

Thanks for using webcheck and thanks for reporting this bug.

I believe this and the other crash are a result of a bug
in BeautifulSoup. I have reassigned the bug report to that package.

Anyway the SVN version of webcheck has been "fixed" to catch errors
while parsing the page log them and otherwise ignore the page.

-- 
-- arthur - [EMAIL PROTECTED] - http://people.debian.org/~adejong --

Attachment: signature.asc
Description: This is a digitally signed message part

Reply via email to