On May 31, 2009, at 12:03 PM, David Winsemius wrote:


On May 31, 2009, at 10:49 AM, Hongyuan Cao wrote:

x = read.table("hongyuan_5_30_forsafe.txt", sep = "\t")

> x = read.table("/Users/davidwinsemius/Downloads/ hongyuan_5_30_forsafe.txt", sep = "\t")
> str(x)
'data.frame':   7131 obs. of  74 variables:
$ V1 : Factor w/ 7131 levels "","A28102_at",..: 1 934 120 122 118 126 124 130 128 134 ... $ V2 : Factor w/ 5528 levels "","2-Sep","6-Mar",..: 1 5528 NA NA NA NA NA NA NA NA ... $ V3 : Factor w/ 2375 levels "-1","-10","-100",..: 2374 2375 195 106 499 2262 287 483 1254 143 ... $ V4 : Factor w/ 2326 levels "-1","-10","-100",..: 2325 2326 75 533 1 1442 231 357 299 117 ...

My guess (which became a conclusion after testing) was that you have encountered the stringsAsFactors pitfall. (It's probably in the R Inferno someplace but my favorite quote is from Terry Terneau(sic):

The default action of turning every character string into a factor is a plague on the S language.
   ---Terry Therneau,  from s-news, 2004


--
David Winsemius
Heritage Laboratories
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to