Jim Popovitch writes: > On Fri, Feb 22, 2008 at 4:03 PM, Mark Sapiro <[EMAIL PROTECTED]> wrote: > So, the problem seems to appear with commas too which makes me wonder > if this can be resolved with this: > > urlpat = re.compile(r'(\w+://[^>)\s]+?)(\.|,)?(\s|$)') # URLs in text > > but then I got to thinking about any other punctuation make that > follows a URL... and my mind started spinning :-) > > Any ideas, anyone?
Unfortunately sre doesn't support POSIX character classes (like [:punct:]) AFAIK, but I would say it's a good idea to make that urlpat = re.compile(r'(\w+://[^>)\s]+?)[#,.::\'"!?()]?(\s|$)') # URLs in text for starters. It would be better to replace it with a real URL-matching regexp, though. ------------------------------------------------------ Mailman-Users mailing list Mailman-Users@python.org http://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: http://mail.python.org/mailman/options/mailman-users/archive%40jab.org Security Policy: http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp