Thanks to Dirk for pointing it out in my blog : "thomas" and "tlumley" are the same person. Also "martyn" and "plummer", and "paul" and "murrell" ... updated number of commits per author below

> sort( table( simple$author ), decreasing=T )

     ripley    maechler      hornik          pd     murdoch       iacus
      20523       10207        6744        4655        1370         915
     leisch         jmc        luke       ihaka     murrell       bates
        743         643         618         481         427         350
    tlumley    rgentlem    urbaneks (no author)      duncan           r
        337         299         274         203         172          96
    plummer      lyndon       guido    deepayan      falcon        mike
         65          54          39          20          18           1


Not quite sure who "r" is (96 commits) and "mike" (1 commit). Also there are 203 commits that have "(no author)"

The other thing is that it does not include data from the pre-svn era.

Romain

On 10/09/2009 03:41 PM, Romain Francois wrote:

Hello,

Today, Brian Ripley commited the revision 50000 of R's svn repository.

I took this as an opportunity to do some data analysis of the log and
posted some code and graphics on my blog:
http://romainfrancois.blog.free.fr/index.php?post/2009/10/09/celebrating-R-commit-50000


The plots of the number of commits per day (and per month) indicates a
peak in 2002.

Here are the top 20 modified files :

 > most.updated <- head( sort( table( data$file ), decreasing=TRUE ), 20 )
 > cat( sprintf( "%40s : %6d", names(most.updated), most.updated ), sep
="\n" )
/trunk/NEWS : 5449
/trunk/date-stamp : 2219
/trunk/BUGS : 1485
/trunk/src/gnuwin32/CHANGES : 994
/trunk/configure : 979
/trunk/doc/manual/R-FAQ.texi : 948
/trunk/doc/manual/R-exts.texi : 814
/trunk/doc/manual/R-admin.texi : 666
/trunk/src/main/names.c : 634
/trunk/tests/reg-tests-1.R : 621
/trunk/src/gnuwin32/Makefile : 542
/trunk/FAQ : 471
/trunk/doc/html/faq.html : 458
/trunk/src/include/Defn.h : 455
/trunk/configure.ac : 447
/trunk/src/scripts/check.in : 433
/trunk/src/library/tools/R/QC.R : 397
/branches/unlabeled-1.1139.4/date-stamp : 386
/trunk/src/main/connections.c : 349
/trunk/src/main/plot.c : 325


The most modified file extensions (the second element indicates files
without extensions) :

 > head( sort( table( data$extension ), decreasing=T ) , 24 )

Rd c R po in mo h
36766 33999 29849 25221 10215 8806 6280 6147
texi gmo html save pot f win m4
5274 3705 1920 1668 1285 1148 1060 1041
pl ac afm pm mac y packages sh
910 527 482 461 402 384 311 215

Many thanks to the R core team for these 50 000 commits.

Romain



--
Romain Francois
Professional R Enthusiast
+33(0) 6 28 91 30 30
http://romainfrancois.blog.free.fr
|- http://tr.im/BcPw : celebrating R commit #50000
|- http://tr.im/ztCu : RGG #158:161: examples of package IDPmisc
`- http://tr.im/yw8E : New R package : sos

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to