Robin H. Johnson wrote:
> On Sun, Jul 05, 2009 at 02:44:07AM +0200, Sebastian Pipping wrote:
>> When collecting information on the SYNC variable for my Summer of Code
>> gentoo stats project I'd like to check if the URL in SYNC is publically
>> known or some private/secret rsync mirror.  The page behind
>>   http://mirrorstats.gentoo.org/rsync/
> Mirrorstats is known to be out of date, because somebody needs to sit
> down and integrate it with the datasources, so manual updates aren't
> needed. Even better, would be hooking it into bouncer2 for the sentry
> output.

What are these datasources?

What kind of integration are you thinking of?


> It needs somebody to update it and hook at into the SOURCE of this data:
> http://www.gentoo.org/main/en/mirrors3.xml
> 
> But wait, you say, that page is distfiles mirrors? Mirror-admin have a
> common data source, non-published as it contains private contact details
> for each administrator. From that data source, mirrors3 and rsync
> mirrors gets updated.

I see.


> mirrors.xml - old page, only used by mirrorselect now, manually updated.
> mirrors3.xml - new page, generated from internal dataset.
> mirrors2.xml - not a real page (See
> http://www.gentoo.org/main/en/mirrors2.xml?passthru=1 and the magic
> <mirrorlist> element.

Compared to

   [..]/mirrors.xml?passthru=1

it seems to me that on mirror3

   [..]/mirrors3.xml?passthru=1

passthru= is working in the opposite direction:

  1 turns style sheets on
  0 turns them off (default)

The one for mirrors3 makes less sense to me.
Is this inconsistency intended?


> Relatedly, the original author of mirrorselect retired from Gentoo
> several years ago. The tools-portage team maintain it now, so you should
> co-operate with them. It would be nice if they implemented the mirrors3
> usage too, I think mirror-admin asked them more than a year ago, but I
> can't find the bug.

I agree that would be a good idea and another reason to touch
mirrorselect.  Does it have a source repo somewhere?, not seen any.


> In the meantime, for your original question:
>> is the URL in SYNC public or private
> Simply check by matching against gentoo.org$ in the hostname part of the
> field.

Good idea, now implemented:

http://git.goodpoint.de/?p=smolt-gentoo.git;a=commitdiff;h=aeb14433e7c29a6045fb702775a3455ebb61aa1d


> P.S. Please report empty SYNC variables too ;-). These turn up when
> users/devs have their tree coming from a VCS instead of rsync.

Good point.  Now also implemented, same commit as above.



Sebastian


Reply via email to