I am not in any way affiliated with Apache or Sonatype. So take my opinion with a grain of salt :)
Trying to mirror the entire Maven Central repository will unfortunately get you automatically banned. To circumvent that I would suggest you setup your own Maven Central mirror first. [1] [1] https://maven.apache.org/guides/mini/guide-mirror-settings.html#creating-your-own-mirror On Mon, Jun 14, 2021, 12:12 Boris Baldassari <[email protected]> wrote: > Hiho good people, > > I am currently developing a Maven repositories connector for the > Software Heritage Foundation [1]. In a nutshell, the SWH aims to > archive all existing source code in the world, and provides useful > publicly available services and related tools (unique IDs/DOIs, search, > datasets, graph tools..). It's all open-source, and many large forges > and software systems have already been archived (GitHub, GitLab, npm, > pypi, debian packages, CRAN..) [2]. Now we would like to archive the > Maven ecosystem. > > [1] https://www.softwareheritage.org/ > [2] https://archive.softwareheritage.org/ > > I'm reaching out to ask for wisdom and start a discussion about how this > could be achieved without impacting anybody, i.e. neither Maven > repositories maintainers nor the users. Our plan for now is to use the > maven indexer indexes for the listing, and then download poms and source > jars, in a way that we see as the most efficient and fair. We of course > respect all rate-limiting policies (and http error codes), and we are > polite and patient (although tenacious). > > So, here are my questions: > > * Who should we talk to to achieve that? i.e. are there maven repository > maintainers on the list, or do you know of a better place to ask? > > * Although we believe the above mentioned process is the most efficient > and fair one, maybe there is a better way to list, and archive artefact > sources? Any feedback or mere thoughts are welcome. > > > Thanks in advance, have a wonderful day! > > > -- > Boris Baldassari > Castalia Solutions -- Elegant Software Engineering > Web: http://castalia.solutions > Tel: +33 6 48 03 82 89 > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
