On Fri, Oct 7, 2011 at 1:46 PM, sebb <seb...@gmail.com> wrote: > On 7 October 2011 16:44, Gary Gregory <garydgreg...@gmail.com> wrote: > > On Fri, Oct 7, 2011 at 8:51 AM, sebb <seb...@gmail.com> wrote: > > > >> On 7 October 2011 13:44, Gary Gregory <garydgreg...@gmail.com> wrote: > >> > On Fri, Oct 7, 2011 at 2:01 AM, Henri Yandell <flame...@gmail.com> > >> wrote: > >> > > >> >> wget doesn't seem to like the url. curl is happy to do it, but it > >> >> doesn't do -r afaik. > >> >> > >> > > >> > Here is what I get with wget. How do I make it get the embedded URLs? > I > >> > don't care if it's curl, wget, or foobar, I just want instructions > that > >> > work. After figuring out all the Maven nonsense, now this. Sigh. > >> > >> It's easy enough to loop around the non-Maven files in the directory > >> if you cannot get the index parsing to work. > >> > >> Or even use Lynx on p.a.o and browse to the directory, and download from > >> there. > >> > >> If you cannot get it to work, let me know and I can help later (about > >> to be busy). > >> > > > > Yes please. :( It's this kind of ridiculous hoop jumping that makes me > put > > this task on the back burner, the one that's in the shed, deep in the > woods. > > Problem seems to be that the Nexus server has a robots.txt which does > not allow downloads from that directory. > > The following works for me: > > wget -r -l 1 -np -nH -nd -nv -e robots=off --wait 10 --no-check-certificate > URL >
Thank you! It's now downloading. I'll update the Wiki... Gary > > -r recursive > -l 1 1 level > -np no parent > -nH don't create host directories > -nd don't create directories > -nv quiet > -e robots=off ignore robots.txt > --wait 10 wait between retrievals > > > Gary > > > > > >> > >> >> wget -np -r --no-check-certificate > >> > > >> > https://repository.apache.org/content/repositories/orgapachecommons-027/commons-io/commons-io/2.1/ > >> > --2011-10-07 12:41:31-- > >> > > >> > https://repository.apache.org/content/repositories/orgapachecommons-027/commons-io/commons-io/2.1/ > >> > Resolving repository.apache.org... 140.211.11.57 > >> > Connecting to repository.apache.org|140.211.11.57|:443... connected. > >> > WARNING: cannot verify repository.apache.org's certificate, issued by > >> > `/C=US/ST=Arizona/L=Scottsdale/O=GoDaddy.com, Inc./OU= > >> > http://certificates.godaddy.com/repo > >> > sitory/CN=Go Daddy Secure Certification > Authority/serialNumber=07969287': > >> > Self-signed certificate encountered. > >> > HTTP request sent, awaiting response... 200 OK > >> > Length: unspecified [text/html] > >> > Saving to: ` > >> > > >> > repository.apache.org/content/repositories/orgapachecommons-027/commons-io/commons-io/2.1/index.html > >> > ' > >> > > >> > [ > >> > <=> > >> > ] 27,475 --.-K/s in 0.001s > >> > > >> > 2011-10-07 12:41:31 (22.9 MB/s) - ` > >> > > >> > repository.apache.org/content/repositories/orgapachecommons-027/commons-io/commons-io/2.1/index.html > >> ' > >> > saved [27475] > >> > > >> > Loading robots.txt; please ignore errors. > >> > --2011-10-07 12:41:31-- https://repository.apache.org/robots.txt > >> > Connecting to repository.apache.org|140.211.11.57|:443... connected. > >> > WARNING: cannot verify repository.apache.org's certificate, issued by > >> > `/C=US/ST=Arizona/L=Scottsdale/O=GoDaddy.com, Inc./OU= > >> > http://certificates.godaddy.com/repo > >> > sitory/CN=Go Daddy Secure Certification > Authority/serialNumber=07969287': > >> > Self-signed certificate encountered. > >> > HTTP request sent, awaiting response... 200 OK > >> > Length: unspecified [text/plain] > >> > Saving to: `repository.apache.org/robots.txt' > >> > > >> > [ > >> > <=> > >> > ] 86 --.-K/s in 0s > >> > > >> > 2011-10-07 12:41:31 (2.28 MB/s) - `repository.apache.org/robots.txt' > >> saved > >> > [86] > >> > > >> > FINISHED --2011-10-07 12:41:31-- > >> > Downloaded: 2 files, 27K in 0.001s (22.3 MB/s) > >> >> > >> > > >> > Gary > >> > > >> > > >> >> I used to use the grab_releases.sh script in > >> >> committers/tools/releases/, but it's based on the Apache web server > >> >> autoindex and needs changing to work with Nexus' format. > >> >> > >> >> Hen > >> >> > >> >> On Thu, Oct 6, 2011 at 5:58 PM, Gary Gregory <garydgreg...@gmail.com > > > >> >> wrote: > >> >> > Hi All, > >> >> > > >> >> > The instruction on https://wiki.apache.org/commons/UsingNexus say: > >> >> > > >> >> > wget -np -r > >> >> > > >> >> > >> > https://repository.apache.org/content/repositories/orgapachecommons-098/org/apache/commons/commons-foo/1.1/ > >> >> > > >> >> > Which for IO 2.1 means: > >> >> > > >> >> > wget -np -r --no-check-certificate > >> >> > > >> >> > >> > https://repository.apache.org/content/repositories/orgapachecommons-027/commons-io/commons-io/2.1/ > >> >> > > >> >> > When I do that from my home dir on p.a.o I get the index.html and > >> that's > >> >> it. > >> >> > uh? > >> >> > > >> >> > Are these instructions up to date? > >> >> > > >> >> > -- > >> >> > E-Mail: garydgreg...@gmail.com | ggreg...@apache.org > >> >> > JUnit in Action, 2nd Ed: <http://goog_1249600977> > http://bit.ly/ECvg0 > >> >> > Spring Batch in Action: <http://s.apache.org/HOq> > http://bit.ly/bqpbCK > >> >> > Blog: http://garygregory.wordpress.com > >> >> > Home: http://garygregory.com/ > >> >> > Tweet! http://twitter.com/GaryGregory > >> >> > > >> >> > >> >> --------------------------------------------------------------------- > >> >> To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org > >> >> For additional commands, e-mail: dev-h...@commons.apache.org > >> >> > >> >> > >> > > >> > > >> > -- > >> > E-Mail: garydgreg...@gmail.com | ggreg...@apache.org > >> > JUnit in Action, 2nd Ed: <http://goog_1249600977>http://bit.ly/ECvg0 > >> > Spring Batch in Action: <http://s.apache.org/HOq>http://bit.ly/bqpbCK > >> > Blog: http://garygregory.wordpress.com > >> > Home: http://garygregory.com/ > >> > Tweet! http://twitter.com/GaryGregory > >> > > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org > >> For additional commands, e-mail: dev-h...@commons.apache.org > >> > >> > > > > > > -- > > E-Mail: garydgreg...@gmail.com | ggreg...@apache.org > > JUnit in Action, 2nd Ed: <http://goog_1249600977>http://bit.ly/ECvg0 > > Spring Batch in Action: <http://s.apache.org/HOq>http://bit.ly/bqpbCK > > Blog: http://garygregory.wordpress.com > > Home: http://garygregory.com/ > > Tweet! http://twitter.com/GaryGregory > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org > For additional commands, e-mail: dev-h...@commons.apache.org > > -- E-Mail: garydgreg...@gmail.com | ggreg...@apache.org JUnit in Action, 2nd Ed: <http://goog_1249600977>http://bit.ly/ECvg0 Spring Batch in Action: <http://s.apache.org/HOq>http://bit.ly/bqpbCK Blog: http://garygregory.wordpress.com Home: http://garygregory.com/ Tweet! http://twitter.com/GaryGregory