On 8/13/2018 5:54 PM, Jeff King wrote:
So I try not to think too hard on metrics, and just use them to get a
rough view on who is active.

I've been very interested in measuring community involvement, with the knowledge that any metric is flawed and we should not ever say "this metric is how we measure the quality of a contributor". It can be helpful, though, to track some metrics and their change over time.

Here are a few measurements we can make:

1. Number of (non-merge) commit author tag-lines.

    using git repo:

  > git shortlog --no-merges --since 2017 -sne junio/next | head -n 20
   284  Nguyễn Thái Ngọc Duy <[email protected]>
   257  Jeff King <[email protected]>
   206  Stefan Beller <[email protected]>
   192  brian m. carlson <[email protected]>
   159  Brandon Williams <[email protected]>
   149  Junio C Hamano <[email protected]>
   137  Elijah Newren <[email protected]>
   116  René Scharfe <[email protected]>
   112  Johannes Schindelin <[email protected]>
   105  Ævar Arnfjörð Bjarmason <[email protected]>
    96  Jonathan Tan <[email protected]>
    93  SZEDER Gábor <[email protected]>
    78  Derrick Stolee <[email protected]>
    76  Martin Ågren <[email protected]>
    66  Michael Haggerty <[email protected]>
    61  Eric Sunshine <[email protected]>
    46  Christian Couder <[email protected]>
    36  Phillip Wood <[email protected]>
    35  Jonathan Nieder <[email protected]>
    33  Thomas Gummerer <[email protected]>

2. Number of other commit tag-lines (Reviewed-By, Helped-By, Reported-By, etc.).

    Using git repo:

    $ git log --since=2018-01-01 junio/next|grep by:|grep -v Signed-off-by:|sort|uniq -c|sort -nr|head -n 20

     66     Reviewed-by: Stefan Beller <[email protected]>
     22     Reviewed-by: Jeff King <[email protected]>
     19     Reviewed-by: Jonathan Tan <[email protected]>
     12     Helped-by: Eric Sunshine <[email protected]>
     11     Helped-by: Junio C Hamano <[email protected]>
      9     Helped-by: Jeff King <[email protected]>
      8     Reviewed-by: Elijah Newren <[email protected]>
      7     Reported-by: Ramsay Jones <[email protected]>
      7     Acked-by: Johannes Schindelin <[email protected]>
      7     Acked-by: Brandon Williams <[email protected]>
      6     Reviewed-by: Eric Sunshine <[email protected]>
      6     Helped-by: Johannes Schindelin <[email protected]>
      5     Mentored-by: Christian Couder <[email protected]>
      5     Acked-by: Johannes Schindelin <[email protected]>
      4     Reviewed-by: Jonathan Nieder <[email protected]>
      4     Reviewed-by: Johannes Schindelin <[email protected]>
      4     Helped-by: Stefan Beller <[email protected]>
      4     Helped-by: René Scharfe <[email protected]>
      3     Reviewed-by: Martin Ågren <[email protected]>
      3     Reviewed-by: Lars Schneider <[email protected]>

    (There does not appear to be enough density here to make a useful metric.)

3. Number of email messages sent.

    Using mailing list repo:

$ git shortlog --since 2017 -sne | head -n 20
  3749  Junio C Hamano <[email protected]>
  2213  Stefan Beller <[email protected]>
  2112  Jeff King <[email protected]>
  1106  Nguyễn Thái Ngọc Duy <[email protected]>
  1028  Johannes Schindelin <[email protected]>
   965  Ævar Arnfjörð Bjarmason <[email protected]>
   956  Brandon Williams <[email protected]>
   947  Eric Sunshine <[email protected]>
   890  Elijah Newren <[email protected]>
   753  brian m. carlson <[email protected]>
   677  Duy Nguyen <[email protected]>
   646  Jonathan Nieder <[email protected]>
   629  Derrick Stolee <[email protected]>
   545  Christian Couder <[email protected]>
   515  Jonathan Tan <[email protected]>
   425  Johannes Schindelin <[email protected]>
   425  Martin Ågren <[email protected]>
   420  Jeff Hostetler <[email protected]>
   420  SZEDER Gábor <[email protected]>
   363  Phillip Wood <[email protected]>

3. Number of threads started by user.

    (For this and the measurements below, I imported emails into a SQL table with columns [commit, author, date, message-id, in-reply-to, subject] and ran queries)

SELECT TOP 20
       COUNT(*) as NumSent
      ,[Author]
  FROM [git].[dbo].[mailing-list]
  WHERE [In-Reply-To] = ''
        AND CONVERT(DATETIME,[Date]) > CONVERT(DATETIME, '01-01-2018 00:00')
GROUP BY [Author]
ORDER BY NumSent DESC

| NumSent | Author                     |
|---------|----------------------------|
| 76      | Junio C Hamano             |
| 64      | Stefan Beller              |
| 54      | Philip Oakley              |
| 50      | Nguyá»…n Thái Ngọc Duy   |
| 49      | Robert P. J. Day           |
| 47      | Christian Couder           |
| 36      | Ramsay Jones               |
| 34      | Elijah Newren              |
| 34      | SZEDER Gábor              |
| 33      | Johannes Schindelin        |
| 31      | Jeff King                  |
| 30      | Ævar Arnfjörð Bjarmason |
| 24      | Jonathan Tan               |
| 22      | Alban Gruin                |
| 22      | brian m. carlson           |
| 18      | Randall S. Becker          |
| 15      | Paul-Sebastian Ungureanu   |
| 15      | Jeff Hostetler             |
| 15      | Brandon Williams           |
| 15      | Luke Diamand               |

4. Number of threads where the user participated

(This is measured by completing the transitive closure of In-Reply-To edges into a new 'BaseMessage' column.)

SELECT TOP 20
       COUNT(BaseMessage) as NumResponded
      ,Author
  FROM [git].[dbo].[mailing-list]
  WHERE [In-Reply-To] <> ''
        AND CONVERT(DATETIME,[Date]) > CONVERT(DATETIME, '01-01-2018 00:00')
GROUP BY Author
ORDER BY NumResponded DESC

| NumResponded | Author                     |
|--------------|----------------------------|
| 2084         | Junio C Hamano             |
| 1596         | Stefan Beller              |
| 1211         | Jeff King                  |
| 1120         | Johannes Schindelin        |
| 1021         | Nguyá»…n Thái Ngọc Duy   |
| 799          | Eric Sunshine              |
| 797          | Ævar Arnfjörð Bjarmason |
| 693          | Brandon Williams           |
| 654          | Duy Nguyen                 |
| 600          | Elijah Newren              |
| 593          | brian m. carlson           |
| 591          | Derrick Stolee             |
| 318          | SZEDER Gábor              |
| 299          | Jonathan Tan               |
| 286          | Christian Couder           |
| 263          | Jonathan Nieder            |
| 257          | Phillip Wood               |
| 256          | Derrick Stolee             |
| 238          | Taylor Blau                |
| 216          | Martin Ã…gren              |

(Note, some names have not been de-duplicated across multiple email addresses, but the email addresses are removed from these tables since I'm using a markdown generator that strips the emails in < >.)

If you have other ideas for fun measurements, then please let me know.

Thanks,

-Stolee


Reply via email to