Hi all,
I'm writing a script to do some basic text analysis in R. Let's assume
I have a data frame named data which contains a column named 'utt'
which contains strings. Is there a straightforward way to achieve
something like this:
data$ContainsThe <- ifelse(startsWith(data$Utt,"the"),"y","n")
or
data$ContainsThe <- ifelse(contains(data$Utt,"the"),"y","n")
?
I tried using grep
data$ContainsThe <- ifelse(grep("the",data$Utt),"y","n")
but this doesn't work becausee grep only returns the rows for which
grep succeeded.
Thanks for any pointers
Claus
______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.