Why bother with a script? Just wget a few high traffic websites (slashdot, yahoo, dailykos, google news) or similar into a file every now and again.
Tony. > -----Original Message----- > From: Johan Corveleyn [mailto:jcor...@gmail.com] > Sent: 13 January 2011 14:26 > To: krueger, Andreas (Andreas Krüger, DV-RATIO); > users@subversion.apache.org > Subject: Re: Trival merge of big text file: Dismal > performance, 540x faster if binary. > > On Thu, Jan 13, 2011 at 2:07 PM, Stefan Sperling > <s...@elego.de> wrote: > > On Thu, Jan 13, 2011 at 01:55:58PM +0100, Johan Corveleyn wrote: > >> Textual merging in svn makes use of a variant of the standard diff > >> algorithm, namely diff3. Just a couple of days ago, I finally > >> succeeded in making diff3 take advantage of those performance > >> improvements (haven't committed this to the branch yet, but maybe > >> I'll get to it tonight). > >> > >> Would you be able to build an svn client from source? If so, could > >> you perhaps build a client from > >> > http://svn.apache.org/repos/asf/subversion/branches/diff-optimization > >> s-bytes > >> ? > > > > Hey Johan, > > > > I would be interested in doing testing and reviewing the changes on > > your branch. There might still be enough time to get them into 1.7. > > Thanks, that would be great (btw, danielsh also expressed an > interest in reviewing the branch). I will try to give an > status update on the dev-list after I've committed the > changes for diff3. > > > I don't have any suitably large XML files though. > > If you and/or Andreas could provide some that would be great. > > I was thinking of writing a python script (as philip already > suggested) that can generate several variants of large files > with semi-random data. I have some prototype code for this > lying around, so if I find the time, I'll try to wrap this up > and send it to the dev list. OTOH, real-world examples are > probably even better. > > Cheers, > -- > Johan > > ______________________________________________________________________ > This email has been scanned by the MessageLabs Email Security System. > For more information please visit > http://www.messagelabs.com/email > ______________________________________________________________________ >