Marc Jekel wrote:
Hi R Fans,

I stumbled across a strange (I think) bug in R 2.9.1. I have read in a data file with 5934 rows and 9 columns with the commands:

daten = data.frame(read.table("C:/fussball.dat",header=TRUE))

Then I needed a subset of the data file:

newd = daten[daten[,1]!=daten[,2],]

--> two values do not meet the logical specification and are dropped.

The strange thing about it: When I print the newd in the R Console, the output still shows 5934 rows. When I check the number of rows with NROW(newd) , I get 5932 as output. When I print newd[5934, ], I get NAs. When I print newd[5932, ] I get the row that is listed in line 5934 when I just type in newd. This is totally crazy! Has anyone had the same problem? Thanks for a post.

You're confusing row names and row numbers. When you printed newd, did you actually count the number of lines? Thought so...

It isn't any stranger than this:

> data.frame(x=rnorm(6),y=rnorm(6))[-5,]
           x          y
1  0.9457385 -1.1398275
2 -1.1683732 -0.7269941
3  0.9942821  0.9310146
4 -2.0839580 -0.6261567
6  1.7225233  0.2457897


--
   O__  ---- Peter Dalgaard             Ă˜ster Farimagsgade 5, Entr.B
  c/ /'_ --- Dept. of Biostatistics     PO Box 2099, 1014 Cph. K
 (*) \(*) -- University of Copenhagen   Denmark      Ph:  (+45) 35327918
~~~~~~~~~~ - (p.dalga...@biostat.ku.dk)              FAX: (+45) 35327907

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to