Package: webcheck
Version: 1.10.1
Severity: normal

Hi Arthur!

I'm continuting to try and run webcheck on the debian website;  it now
fails with a crash in beautifulsoup:

webcheck:   http://www.slf.ch/
Traceback (most recent call last):
  File "/usr/bin/webcheck", line 249, in ?
    main()
  File "/usr/bin/webcheck", line 224, in main
    site.crawl(fp) # this will take a while
  File "/usr/share/webcheck/crawler.py", line 216, in crawl
    link.fetch()
  File "/usr/share/webcheck/crawler.py", line 538, in fetch
    parsermodule.parse(content, self)
  File "/usr/share/webcheck/parsers/html/beautifulsoup.py", line 60, in parse
    base = myurllib.normalizeurl(htmlunescape(base['href']).strip())
  File "/var/lib/python-support/python2.4/BeautifulSoup.py", line 419, in 
__getitem__
    return self._getAttrMap()[key]
KeyError: 'href'


This occurs after running it on http://www.nl.debian.org for a while;
continuing with webcheck -c does work though, and webcheck doesn't
crash then anymore...

-- System Information:
Debian Release: 4.0
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.6.18-4-686
Locale: LANG=en_GB.UTF-8, LC_CTYPE=en_GB.UTF-8 (charmap=UTF-8)

Versions of packages webcheck depends on:
ii  python                        2.4.4-2    An interactive high-level object-o
ii  python-support                0.5.6      automated rebuilding support for p

Versions of packages webcheck recommends:
ii  python-beautifulsoup          3.0.1-2    error-tolerant HTML parser for Pyt

-- no debconf information


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to