On Fri, 8 May 2009, Rock Ouimet wrote:

I am new to R (ex SAS user) , and I cannot merge two data frames without
getting duplicated rows in the results. How to avoid this happening without
using the unique() function?

1. First data frame is called "tmv" with 6 variables and 239 rows:

tmv[1:10,]
     temps       nom        prenom sexe dist style
1  01:59:36       Cyr         Steve    H   45  free
2  02:09:55  Gosselin         Erick    H   45  free
3  02:12:18 Desfosses         Sacha    H   45  free
4  02:12:23  Lapointe     Sebastien    H   45  free
5  02:12:52    Labrie        Michel    H   45  free
6  02:12:54   Leblanc        Michel    H   45  free
7  02:13:02 Thibeault       Sylvain    H   45  free
8  02:13:49    Martel      Stephane    H   45  free
9  02:14:03    Lavoie Jean-Philippe    H   45  free
10 02:14:05    Boivin   Jean-Claude    H   45  free

Its structure is:
str(tmv)
'data.frame':   239 obs. of  6 variables:
$ temps :Class 'times'  atomic [1:239] 0.0831 0.0902 0.0919 0.0919 0.0923
...
 .. ..- attr(*, "format")= chr "h:m:s"
$ nom   : Factor w/ 167 levels "Aubut","Audy",..: 45 84 55 105 98 110 158
117 109 22 ...
$ prenom: Factor w/ 135 levels "Alain","Alexandre",..: 128 33 121 122 93 93
130 126 63 59 ...
$ sexe  : Factor w/ 2 levels "F","H": 2 2 2 2 2 2 2 2 2 2 ...
$ dist  : int  45 45 45 45 45 45 45 45 45 45 ...
$ style : Factor w/ 2 levels "clas","free": 2 2 2 2 2 2 2 2 2 2 ...


2. The second data frame is called "meil2" with 4 variables and 16 rows;
meil2[1:10,]
  dist sexe style     meil
1    38    F  clas 02:43:17
2    38    F  free 02:24:46
3    38    H  clas 02:37:36
4    38    H  free 01:59:35
5    45    F  clas 03:46:15
6    45    F  free 02:20:15
7    45    H  clas 02:30:07
8    45    H  free 01:59:36
9    38    F  clas 02:43:17
10   38    F  free 02:24:46


Lines 9 and 1 appear to be the same in meil2, as do 2 and 10.  If the 16 rows 
consist of two repeats of 8 rows that would explain why you are getting two 
copies of each individual in the output. unique(meil2) would have just the 
distinct rows.

     -thomas

Thomas Lumley                   Assoc. Professor, Biostatistics
tlum...@u.washington.edu        University of Washington, Seattle

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to