Jim Popovitch writes:

 > On Fri, Feb 22, 2008 at 4:03 PM, Mark Sapiro <[EMAIL PROTECTED]> wrote:
 > So, the problem seems to appear with commas too which makes me wonder
 > if this can be resolved with this:
 > 
 >    urlpat = re.compile(r'(\w+://[^>)\s]+?)(\.|,)?(\s|$)') # URLs in text
 > 
 > but then I got to thinking about any other punctuation make that
 > follows a URL... and my mind started spinning :-)
 > 
 > Any ideas, anyone?

Unfortunately sre doesn't support POSIX character classes (like
[:punct:]) AFAIK, but I would say it's a good idea to make that

   urlpat = re.compile(r'(\w+://[^>)\s]+?)[#,.::\'"!?()]?(\s|$)') # URLs in text

for starters.  It would be better to replace it with a real
URL-matching regexp, though.

------------------------------------------------------
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=show&amp;file=faq01.027.htp

Reply via email to