Re: [R] regexpr with accents

arun Mon, 06 Aug 2012 00:12:20 -0700

HI,

It works with me.  I am using R 2.15 on Ubuntu 12.04.


 d1 <- data.frame(V1 = 1:5, V2=c("some text = 9", "some téxt=9","sóme tèxt=9",  
"söme text=9", "some têxt=9"))
d1
#  V1            V2
#1  1 some text = 9
#2  2   some téxt=9
#3  3   sóme tèxt=9
#4  4   söme text=9
#5  5   some têxt=9
  
d1$V1[regexpr("some téxt=9",d1$V2)>0]<-9
d1$V1[regexpr("söme text=9",d1$V2)>0] <-9
d1$V1[regexpr("some têxt=9",d1$V2)>0] <-9
d1$V1[regexpr("sóme tèxt=9",d1$V2)>0] <-9
d1$V1[regexpr("some text = 9",d1$V2)>0] <-9

d1
#  V1            V2
#1  9 some text = 9
#2  9   some téxt=9
#3  9   sóme tèxt=9
#4  9   söme text=9
#5  9   some têxt=9

A.K.




----- Original Message -----
From: Luca Meyer <lucam1...@gmail.com>
To: r-help@r-project.org
Cc: 
Sent: Monday, August 6, 2012 1:55 AM
Subject: [R] regexpr with accents

Hello,

I have build a syntax to find out if a given substring is included in a larger 
string that works like this:

d1$V1[regexpr("some text = 9",d1$V2)>0] <- 9

and this works all right till "some text" contains standard ASCII set. However, 
it does not work when accents are included as the following:

d1$V1[regexpr("some tèxt = 9",d1$V2)>0] <- 9

I have tried to substitute "è" with several wildcards but it did not work, can 
anyone suggest how to have the syntax parse the string ignoring the accent?

Thank you in advance,

Luca

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] regexpr with accents

Reply via email to