Re: [R] using regular expressions to retrieve a digit-digit-dot structure from a string

Marc Schwartz Mon, 08 Jun 2009 10:46:06 -0700

On Jun 8, 2009, at 12:34 PM, Marc Schwartz wrote:

On Jun 8, 2009, at 9:15 AM, Mark Heckmann wrote:
Hi,
i need to recognize itemization structures in strings which followthe
format: "digit-digit-dot" like e.g.



1.

2.

19.

211.
Given the string " This happened in the 21. century." (the dotbehind 21 isused in German instead of 21st) I want know where the dots are butI do not
want the 21.-dot to be returned as well.
I am not good at regular expressions. How can I retrieve orrecognize dots
excluding the digit-digit-dot structure?



TIA, Mark
vec <- c("1.", "2.", "19.", "211.", "This happened in the 21.century")
> grep("^[0-9]+\\.", vec, value = TRUE)
[1] "1."   "2."   "19."  "211."
The regex "^[0-9]+\\." is interpreted as "match one or more digitsfollowed by a period, only at the beginning of the line". The caret'^' defines the beginning of the line, so that a sequence of numbersfollowed by a period in the middle of the line will not match.

I mis-read that last part of your query. I see that Henrique and Gaborhave provided what appear to be correct solutions.


Sorry for the confusion.

Marc

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] using regular expressions to retrieve a digit-digit-dot structure from a string

Reply via email to