On Nov 18, 2011, at 9:28 AM, jim holtman wrote:
It is pretty straightforward in R:
x <- readLines(textConnection("sadf|asdf|asdf\tqwer|qwer|qwer\tzxcv|
zxcv|zxfcgv"))
closeAllConnections()
# convert tabs to newlines
x <- gsub("\t", "\n", x)
Did the rules get liberalized for escaping patterns? Or have I been
unnecessarily expending backslashes all these years. I thought that
one needed 3 blackslashes. This code does work and I am wondering if/
when I "didn't get the memo". (I do see that there is a line early in
the ?regex page that suggests I have been deluded all along.)
"The current implementation interprets \a as BEL, \e asESC, \f as FF,
\n as LF, \r as CR and \t as TAB."
> x <- readLines(textConnection("sadf|asdf|asdf\tqwer|qwer|qwer\tzxcv|
zxcv|zxfcgv"))
> closeAllConnections()
> # convert tabs to newlines
> x2 <- gsub("\\\t", "\n", x)
> x2
[1] "sadf|asdf|asdf\nqwer|qwer|qwer\nzxcv|zxcv|zxfcgv"
So I guess my question is (now) why the triple-slash technique even
works?
--
David.
# write out to a temp file and then read in as a data frame
myFile <- tempfile()
writeLines(x, con = myFile)
x.df <- read.table(myFile, sep = "|")
x.df
V1 V2 V3
1 sadf asdf asdf
2 qwer qwer qwer
3 zxcv zxcv zxfcgv
On Fri, Nov 18, 2011 at 9:13 AM, Langston, Jim
<jim.langs...@compuware.com> wrote:
Thanks Paul,
That's the path I was marching down, I was hoping for something
a little cleaner, I do the same with Perl or Java.
Jim
On 11/18/11 8:35 AM, "Paul Hiemstra" <paul.hiems...@knmi.nl> wrote:
Hi Jim,
You can read the text file using readLines. This puts each line in
the
file into an element of a list. Then you can go through the lines
manually (e.g. using grep, sub, strsplit) and create your
data.frame.
cheers,
Paul
On 11/18/2011 12:37 PM, Langston, Jim wrote:
Hi all,
I've been scratching and poking, but basically, the file I need
to read
has
two delimiters that I need to contend with. The first is that the
file
contains
tabs (\t) , instead of newlines (\n), and the second is that the
fields
have
| for the seperators. I can easily do a read if I first convert
the \t
to
\n
and then use read.table to get the file read with the |
separator. But,
what I would really like to do, is do this all within R. I have a
lot of
files
to read and do analysis on.
I can read the data into a table using the \t has delimiter, but
can't
figure
out how to take that table data and use the | for separation,
I've look
at
string splits, etc. but haven't figured out how to split the whole
table.
Any thoughts ? hints ?
Thanks,
Jim
The contents of this e-mail are intended for the named a...
{{dropped:6}}
The contents of this e-mail are intended for the named addressee
only. It contains information that may be confidential. Unless you
are the named addressee or an authorized designee, you may not copy
or use it, or disclose it to anyone else. If you received it in
error please notify us immediately and then destroy it.
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
--
Paul Hiemstra, Ph.D.
Global Climate Division
Royal Netherlands Meteorological Institute (KNMI)
Wilhelminalaan 10 | 3732 GK | De Bilt | Kamer B 3.39
P.O. Box 201 | 3730 AE | De Bilt
tel: +31 30 2206 494
http://intamap.geo.uu.nl/~paul
http://nl.linkedin.com/pub/paul-hiemstra/20/30b/770
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
--
Jim Holtman
Data Munger Guru
What is the problem that you are trying to solve?
Tell me what you want to do, not how you want to do it.
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
West Hartford, CT
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.