Re: [Tutor] retrieve URLs and text from web pages

2010-06-29 Thread Mark Lawrence
On 29/06/2010 17:32, Joel Goldstick wrote: [big snips] It might not be completely relevant, but there is nothing to stop anybody mixing regex and/or string methods. Horses for courses? Kindest regards. Mark Lawrence. ___ Tutor maillist - Tutor@

Re: [Tutor] retrieve URLs and text from web pages

2010-06-29 Thread Joel Goldstick
On Tue, Jun 29, 2010 at 12:26 PM, Tino Dai wrote: > > >> I decided to go with Regular Expressions to modify the text. In the >>> Python.org it is stated that they provide more options and flexibilty >>> compared to strings and their modules. >>> >> >> "their modules" referring to the "string" m

Re: [Tutor] retrieve URLs and text from web pages

2010-06-29 Thread Tino Dai
> > I decided to go with Regular Expressions to modify the text. In the >> Python.org it is stated that they provide more options and flexibilty >> compared to strings and their modules. >> > > "their modules" referring to the "string" module and the string methods > here, I assume. > > Regular e

Re: [Tutor] retrieve URLs and text from web pages

2010-06-29 Thread Stefan Behnel
Khawla Al-Wehaibi, 29.06.2010 17:12: I decided to go with Regular Expressions to modify the text. In the Python.org it is stated that they provide more options and flexibilty compared to strings and their modules. "their modules" referring to the "string" module and the string methods here, I

Re: [Tutor] retrieve URLs and text from web pages

2010-06-29 Thread Khawla Al-Wehaibi
: From: Tino Dai Subject: Re: [Tutor] retrieve URLs and text from web pages To: "Khawla Al-Wehaibi" Cc: tutor@python.org Date: Tuesday, 29 June, 2010, 5:34 On Sun, Jun 27, 2010 at 12:15 PM, Khawla Al-Wehaibi wrote: Hi, I’m new to programming. I’m currently learning python to w

Re: [Tutor] retrieve URLs and text from web pages

2010-06-28 Thread Tino Dai
On Sun, Jun 27, 2010 at 12:15 PM, Khawla Al-Wehaibi wrote: > Hi, > > I’m new to programming. I’m currently learning python to write a web > crawler to extract all text from a web page, in addition to, crawling to > further URLs and collecting the text there. The idea is to place all the > extracte