On 29/08/2011 3:29 PM, Imbeaud (Inserm U674) wrote:
Hi,
I have to deal with a huge .txt table (~485.577 rows and 469 columns,> 1.5
Go file)
I used the read.table function
> tmp=read.table("data.txt", header=TRUE, sep="\t", fill=TRUE,
na.strings="NA", comment.char="", stringsAsFactors = FALSE)
However, I encounter troubles in interpreting some "\t" separator.. A column
displays \t\t\t\t characters fusioning consecutive columns and rows?
see:
"Body;3UTR\t\t\t\t\t\t\t\t\t\n3\tcg00000109\t59755374\t59755374\t0.8353283\t2657\t7\t7\t122.8385\t241.5193\t354\t2303\t0.00\t0.005"
Is there any way to deal with such problem?
I would guess there is a problem with quotes in the file, so R thinks
that line is a quoted string.
Set quote="" to disable recognition of quotes. See ?read.table.
Duncan Murdoch
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.