On 8/13/2018 5:54 PM, Jeff King wrote:
So I try not to think too hard on metrics, and just use them to get a
rough view on who is active.
I've been very interested in measuring community involvement, with the
knowledge that any metric is flawed and we should not ever say "this
metric is how we measure the quality of a contributor". It can be
helpful, though, to track some metrics and their change over time.
Here are a few measurements we can make:
1. Number of (non-merge) commit author tag-lines.
using git repo:
> git shortlog --no-merges --since 2017 -sne junio/next | head -n 20
284 Nguyễn Thái Ngọc Duy <[email protected]>
257 Jeff King <[email protected]>
206 Stefan Beller <[email protected]>
192 brian m. carlson <[email protected]>
159 Brandon Williams <[email protected]>
149 Junio C Hamano <[email protected]>
137 Elijah Newren <[email protected]>
116 René Scharfe <[email protected]>
112 Johannes Schindelin <[email protected]>
105 Ævar Arnfjörð Bjarmason <[email protected]>
96 Jonathan Tan <[email protected]>
93 SZEDER Gábor <[email protected]>
78 Derrick Stolee <[email protected]>
76 Martin Ågren <[email protected]>
66 Michael Haggerty <[email protected]>
61 Eric Sunshine <[email protected]>
46 Christian Couder <[email protected]>
36 Phillip Wood <[email protected]>
35 Jonathan Nieder <[email protected]>
33 Thomas Gummerer <[email protected]>
2. Number of other commit tag-lines (Reviewed-By, Helped-By,
Reported-By, etc.).
Using git repo:
$ git log --since=2018-01-01 junio/next|grep by:|grep -v
Signed-off-by:|sort|uniq -c|sort -nr|head -n 20
66 Reviewed-by: Stefan Beller <[email protected]>
22 Reviewed-by: Jeff King <[email protected]>
19 Reviewed-by: Jonathan Tan <[email protected]>
12 Helped-by: Eric Sunshine <[email protected]>
11 Helped-by: Junio C Hamano <[email protected]>
9 Helped-by: Jeff King <[email protected]>
8 Reviewed-by: Elijah Newren <[email protected]>
7 Reported-by: Ramsay Jones <[email protected]>
7 Acked-by: Johannes Schindelin <[email protected]>
7 Acked-by: Brandon Williams <[email protected]>
6 Reviewed-by: Eric Sunshine <[email protected]>
6 Helped-by: Johannes Schindelin <[email protected]>
5 Mentored-by: Christian Couder <[email protected]>
5 Acked-by: Johannes Schindelin <[email protected]>
4 Reviewed-by: Jonathan Nieder <[email protected]>
4 Reviewed-by: Johannes Schindelin <[email protected]>
4 Helped-by: Stefan Beller <[email protected]>
4 Helped-by: René Scharfe <[email protected]>
3 Reviewed-by: Martin Ågren <[email protected]>
3 Reviewed-by: Lars Schneider <[email protected]>
(There does not appear to be enough density here to make a useful
metric.)
3. Number of email messages sent.
Using mailing list repo:
$ git shortlog --since 2017 -sne | head -n 20
3749 Junio C Hamano <[email protected]>
2213 Stefan Beller <[email protected]>
2112 Jeff King <[email protected]>
1106 Nguyễn Thái Ngọc Duy <[email protected]>
1028 Johannes Schindelin <[email protected]>
965 Ævar Arnfjörð Bjarmason <[email protected]>
956 Brandon Williams <[email protected]>
947 Eric Sunshine <[email protected]>
890 Elijah Newren <[email protected]>
753 brian m. carlson <[email protected]>
677 Duy Nguyen <[email protected]>
646 Jonathan Nieder <[email protected]>
629 Derrick Stolee <[email protected]>
545 Christian Couder <[email protected]>
515 Jonathan Tan <[email protected]>
425 Johannes Schindelin <[email protected]>
425 Martin Ågren <[email protected]>
420 Jeff Hostetler <[email protected]>
420 SZEDER Gábor <[email protected]>
363 Phillip Wood <[email protected]>
3. Number of threads started by user.
(For this and the measurements below, I imported emails into a SQL
table with columns [commit, author, date, message-id, in-reply-to,
subject] and ran queries)
SELECT TOP 20
COUNT(*) as NumSent
,[Author]
FROM [git].[dbo].[mailing-list]
WHERE [In-Reply-To] = ''
AND CONVERT(DATETIME,[Date]) > CONVERT(DATETIME, '01-01-2018
00:00')
GROUP BY [Author]
ORDER BY NumSent DESC
| NumSent | Author |
|---------|----------------------------|
| 76 | Junio C Hamano |
| 64 | Stefan Beller |
| 54 | Philip Oakley |
| 50 | Nguyá»…n Thái Ngá»c Duy |
| 49 | Robert P. J. Day |
| 47 | Christian Couder |
| 36 | Ramsay Jones |
| 34 | Elijah Newren |
| 34 | SZEDER Gábor |
| 33 | Johannes Schindelin |
| 31 | Jeff King |
| 30 | Ævar Arnfjörð Bjarmason |
| 24 | Jonathan Tan |
| 22 | Alban Gruin |
| 22 | brian m. carlson |
| 18 | Randall S. Becker |
| 15 | Paul-Sebastian Ungureanu |
| 15 | Jeff Hostetler |
| 15 | Brandon Williams |
| 15 | Luke Diamand |
4. Number of threads where the user participated
(This is measured by completing the transitive closure of In-Reply-To
edges into a new 'BaseMessage' column.)
SELECT TOP 20
COUNT(BaseMessage) as NumResponded
,Author
FROM [git].[dbo].[mailing-list]
WHERE [In-Reply-To] <> ''
AND CONVERT(DATETIME,[Date]) > CONVERT(DATETIME, '01-01-2018
00:00')
GROUP BY Author
ORDER BY NumResponded DESC
| NumResponded | Author |
|--------------|----------------------------|
| 2084 | Junio C Hamano |
| 1596 | Stefan Beller |
| 1211 | Jeff King |
| 1120 | Johannes Schindelin |
| 1021 | Nguyá»…n Thái Ngá»c Duy |
| 799 | Eric Sunshine |
| 797 | Ævar Arnfjörð Bjarmason |
| 693 | Brandon Williams |
| 654 | Duy Nguyen |
| 600 | Elijah Newren |
| 593 | brian m. carlson |
| 591 | Derrick Stolee |
| 318 | SZEDER Gábor |
| 299 | Jonathan Tan |
| 286 | Christian Couder |
| 263 | Jonathan Nieder |
| 257 | Phillip Wood |
| 256 | Derrick Stolee |
| 238 | Taylor Blau |
| 216 | Martin Ã…gren |
(Note, some names have not been de-duplicated across multiple email
addresses, but the email addresses are removed from these tables since
I'm using a markdown generator that strips the emails in < >.)
If you have other ideas for fun measurements, then please let me know.
Thanks,
-Stolee