Albert-Jan Roskam wrote: > How can I walk (as in os.walk) or glob a website? I want to download all > the pdfs from a website (using urllib.urlretrieve), extract certain > figures (using pypdf- is this flexible enough?) and make some > statistics/graphs from those figures (using rpy and R). I forgot what the > process of 'automatically downloading' is called again, something that > sounds like 'whacking' (??)
If you've downloaded a source distribution of python you should have this little sucker on your harddisk: http://hg.python.org/cpython/file/31cd146d725c/Tools/webchecker/websucker.py _______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: http://mail.python.org/mailman/listinfo/tutor