Re: [R] problem with factor levels

2012-12-04 Thread Jeremy.Shearman
Oh, your skepticism was spot on! I was using excel to check the output (silly, but I am still in the process of moving from excel to R) and there was a discrepancy in the number of output from R and excel. Turns out the problem was with excel and not with R at all. That's a relief. SOLVED -- V

[R] problem with factor levels

2012-12-04 Thread Jeremy.Shearman
Hi I have a data.frame with 371,718 obs. of 12 variables (see below for an str). My problem is with V1, a Factor w/ 93144 levels, there should actually be 93994 levels. Each entry looks like: comp[number]_c[number]_seq[number] for example comp215489_c0_seq40 R is grouping as though the last n