Stock RH 7 version, rsync-3.1.2-6.el7_6.1.x86_64. We've tried a number of recompiles. gcc, Intel. The only thing between identical compiles was the md4 vs md5.

/bin/rsync -lptgoDAH -v --numeric-ids -d --relative --delete --delete-after --files-from=...

I'm not asking for help. Just if anyone had attempted to change the algorithm into something much faster.

I refer you to this project https://cyan4973.github.io/xxHash/ where there is a table of speeds. Regardless of what anyone might speculate, we are pursuing this route of changing out the algorithm. Maybe it's all for naught. Maybe it isn't. But in a few weeks hopefully we'll have determined.

Thanks all,
Bill

On 6/18/19 9:02 AM, Ellis H. Wilson III wrote:
On 6/18/19 6:59 AM, Bill Wichser wrote:
Just for clarity here, we are NOT using the -c option.  The checksums happen whenever there is a transfer between the rsync source and the rsyncd on the other end.
...snip...
This is not some trivial rsync running at the top level.  There is code we wrote as well as integration with Jenkins.  When we recompiled rsync using MD4 instead of the MD5 we see a 20% increase in performance across the board.  This is what sparked my question.

We need more details to be of much use:

1. Specific rsync version and command line used.

2. Compilation options both normally and with your md4 changes.

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
https://beowulf.org/cgi-bin/mailman/listinfo/beowulf

Reply via email to