I have a dataset w/ 184K obs & 16 variables.  In SAS I proc sort nodupkey it
in seconds by 11 variables.
I tried to do the same thing in R using both the unique & then the
!duplicated functions but it just hangs there & I get no output.  Does
anyone know how to solve this?

This is how I tried to do it in R:


detail3 <-
[!duplicated(c(detail2$TDATE,detail2$FIRM,detail2$CM,detail2$BRANCH,
                             detail2$BEGTIME,
detail2$ENDTIME,detail2$OTYPE,detail2$OCOND,
                             detail2$ACCTYP
,detail2$OSIDE,detail2$SHARES,detail2$STOCKS,
                             detail2$STKFUL)),]

detail3 <-
unique(detail2[,c(detail2$TDATE,detail2$FIRM,detail2$CM,detail2$BRANCH,
          detail2$BEGTIME, detail2$ENDTIME,detail2$OTYPE,detail2$OCOND,
          detail2$ACCTYP ,detail2$OSIDE,detail2$SHARES,detail2$STOCKS,
          detail2$STKFUL)])




Thanks in advance



--
View this message in context: 
http://r.789695.n4.nabble.com/Deduping-in-R-by-multiple-variables-tp4641778.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to