Package: debian-i18n
Severity: normal
Dear all
It seems that the script that parses the l10n-language mailing list archives and
creates the databases in l10n.debian.org/coordination/00data/status.XX stops
(without errors) when there is a month in which no mails were sent to the list
(thinking that we arrived the current month/year).
See for example:
https://l10n.debian.org/coordination/turkish/tr.by_date.html
I believe this pages are generated using the data here:
https://l10n.debian.org/coordination/00data/
the status.tr file there stops parsing at year 2016 month 6:
Package:
Date: 2018-09-18
Year: 2016
Month: 6
Message: 0
Page: 1
This behavior can be easily reproduced in this way:
sudo apt install dl10n
git clone https://salsa.debian.org/l10n-team/dl10n.git
cd dl10n/
mkdir data
touch data/status.tr
./dl10n-spider --year 2016 --month 01 --message 0 tr
(it begins to parse and stops in 2016/06)
./dl10n-spider --year 2016 --month 07 --message 0 tr
(it continues to parse and stops in 2017/05)
./dl10n-spider --year 2017 --month 07 --message 0 tr
(it continues to parse and stops in 2017/11, etc.)
If you look at https://lists.debian.org/debian-l10n-turkish/ the script stops
when it arrives a year/month without mails, but can continue if we provide a
starting point after that empty month.
I've had a look at the code and I think that we should modify the code of the
file
https://salsa.debian.org/l10n-team/dl10n/blob/master/lib/Debian/L10n/Spider.pm ,
sub spider($$$$$@) (Line 173 on). I'm afraid I cannot provide a patch, the code
is difficult for me to understand and I'm not sure what should I change and how.
I hope somebody else can look at this...
Thanks
--
Laura Arjona Reina
https://wiki.debian.org/LauraArjona