On 29/03/15 at 17:47 +0200, Mattia Rizzolo wrote:
> package: qa.debian.org
> user: qa.debian....@packages.debian.org
> usertag: udd
> 
> From #debian-qa@OFTC
> <mapreri> lucas: might I ask you to provide udd dumps more often than daily?
>           like, every 6 hours? this would greatly help the public udd mirror 
> to
>           stay in sync (and I guess it could also help others, to avoid doing
>           things like what formorer complained the other day)
> <lucas>   mapreri: yes, but it would also be useful to switch to another dump
>           format
> <lucas>   mapreri: could you ask by mail? that way I could Cc the other users
>           of the UDD dumps, and see if we can come up with something that 
> works
>           for everyone
> 
> 
> So, here it is.
> Other than more frequent dumps I'd also welcome a different dump format, maybe
> smaller, to avoid downloading nearly 900MB every time I have to test 
> something.

Hi,

Currently three dumps are generated every day:
- usql.sql.gz, with all the data except the ldap, really_active_dds, and
  pts relations, which are considered "private" data and not suitable for
  wide exposure.
- udd-bugs.sql.xz, with only the bugs data (both archived and
  unarchived -- Andreas Tille was the main user of that -- is this still
  needed?)
- udd-popcon.sql.xz, with only the popcon data (codesearch.d.n needed
  that -- is this still needed?)

Before changing that, I'd like to understand:

1) what is the rationale for the public UDD mirror. Is there a way this
   could be provided from Debian infrastructure, for example by
   whitelisting specific hosts that need UDD access? Is there something
   here that could be acceptable for DSA (Cced)?

2) what is the rationale for the more frequent dumps. It's currently
   being dumped once a day. It's never going to be "in sync" with the
   live instance, unfortunately.

3) Would dumps in "custom format" (pg_dump -Fc) work for you? they allow
   parallel restore with pg_restore.

4) Could some tables be excluded from the dumps?

5) Couldn't you trigger the dumps from your side, by calling pg_dump
   inside an SSH connection to ullmann.d.o?

Lucas

Attachment: signature.asc
Description: Digital signature

Reply via email to