On 04/10/2007, Christopher Spears <[EMAIL PROTECTED]> wrote:
> Obviously, I can just create a pattern "\d+ \w+ \w+".
> However, the pattern would be useless if I had a
> street name like 3120 De la Cruz Boulevard.  Any
> hints?

Possibly you could create a list of street types ('Street', 'Road',
'Crescent', 'Lane', etc.), then construct a regular expression
matching "a number, followed by one or more words, followed by a
street type".

Another thing you could do: put the street number, street name, and
street type into separate named groups, so you could do things like
"m['number']" to get the street number from the match object.

Finally, street addresses can be very dirty -- you get people writing
things like:

"Flat A, 13 Main St"
"13A Main St"
"13-A Main St"
"13/A Main St"
"13-15 Main St"

and other variations.  Some people earn good money doing address cleaning :-)

-- 
John.
_______________________________________________
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Reply via email to