Re: [rdiff-backup-users] atomic increment files?

Marcel (Felix) Giannelia Wed, 11 Mar 2009 01:13:48 -0700

OK, I've written a quick & dirty utility to do this to increments afterthe fact. I've posted it in the wiki at the bottom of ContribScripts,but as I noted there it's probably very dicey to use and it needs realtesting. Anyone care to give it a go? ;)

It's my hope that this can one day be an alternative to"--remove-older-than" --- perhaps "--move-older-than".

An interesting thing about the output tarballs from my script: if Irdiff two of them, one of them plus the patch file is significantlysmaller than two of them (presumably because diffs on different days arenonetheless similar).* This is probably very dependent on what kind ofdata is being backed up, but it may lead to a way to make incrementstorage even more efficient (but also more fragile, since a restorewould take two levels of merging). It's also very possible that this isa clear indication that I've done something very wrong in my scriptthat's causing duplicate data in what are supposed to be separateincrements. Further testing is required ;)

*Example from my test set: a collected increment from 2008-10-04 is49MB, and the one from 2008-10-05 is also 49MB (total 98MB). An rdiffdelta file to turn 2008-10-04 into 2008-10-05 is only 18MB, so2008-10-04 plus the delta file is 67MB. Another delta to turn 2008-10-05into 2008-10-06 is also only 18MB, so the three of them together are85MB instead of 147MB. Again, this is probably highly dependent on thekind of data that's in these increments, but I'm surprised it works aswell as it does given that I'm tarring some already-gzipped files together.


~Felix.

On 07/03/09 15:03, Marcel (Felix) Giannelia wrote:

Is there any way of making rdiff-backup produce single files asincrements (say, by zipping them together when it produces them),instead of thousands of itty bitty files? One file per increment wouldmake the task of moving old ones onto archive DVD a lot easier, andwould make a lot less hardship for the target machine's filesystem,too. It probably wouldn't slow down restores all that much, asaccessing an archive file's directory structure is likely faster thandoing the same in a part of the filesystem containing many thousandsof files per directory.
Presently, I'm trying to do a du -s on our backup directories and itsat there for over an hour without having printed the size of thefirst one. According to top, du was using 50% of the total memory. Iknow that there are statistics files which I could add together, butin this case I want to use du to be sure because there's a chance thatthere might be stray files in our rdiff-backup-data directory. Also,creating so many files that commands like du cannot even function is,in my opinion, incorrect behaviour.
~Felix.


_______________________________________________
rdiff-backup-users mailing list at [email protected]
http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
Wiki URL:http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki




_______________________________________________
rdiff-backup-users mailing list at [email protected]
http://lists.nongnu.org/mailman/listinfo/rdiff-backup-users
Wiki URL: http://rdiff-backup.solutionsfirst.com.au/index.php/RdiffBackupWiki

Re: [rdiff-backup-users] atomic increment files?

Reply via email to