On Sat, Nov 9, 2024 at 5:30 AM ralfconn <mentaden...@yahoo.com> wrote:
>
> But I have a backup, no problem... till I realize the cron job had
> already run so it had overwritten the old files with the new, corrupt
> versions.
>

I highly recommend having multiple backups to avoid this sort of problem.

If you aren't wedded to rsync, then restic seems to be the platform of
choice these days.  Not sure offhand if it handles ntfs very well.  I
use duplicati for backing up windows hosts to an S3 backend and that
works great, but that is more of a windows solution (VSS and so on).
I imagine that restic doesn't care much about the filesystem if it is
running on linux and everything is mounted.

If you are wedded to the rsync approach where your backups are just a
big directory tree that you can easily access, then I suggest using
rsnapshot.  It is basically a wrapper around rsync that maintains a
backup history, in a very clean way.  Basically it does a hard-link
copy of your entire backup set to a new directory tree named by the
timestamp, and then it runs rsync to sync that new tree.  The result
is that you get file-level deduplication effectively, and otherwise
get what looks like a nice big full copy of the backup source in each
directory.  It probably won't be as efficient as something like restic
since I'm guessing that can do deduplication below the file level (I
think it can also deduplicate across multiple hosts/etc if you're
using it that way).  Most of these modern tools still use librsync
under the hood so the actual data transfer is just as efficient.

rsync by itself is nice for its simplicity, but it just isn't a very
elegant backup solution.  You can tell it to preserve old file
versions, but those end up stored next to the original files with
different filenames and that can be a real mess to restore if you
don't want to end up with all those old versions.  With
restic/rsnapshot you can go back in time but still get a clean
restore, and you can still extract individual files from various
snapshots.

If you really are running rsync at any kind of scale also consider
rclone, which is often faster since it can transfer multiple files in
parallel, which is useful if you're more bound by latency than disk
IOPS (often the case on solid state drives over networks).

-- 
Rich

Reply via email to