Re: [PATCH 5/5] crypto: arm/blake2b - add NEON-optimized BLAKE2b implementation

2020-12-16 Thread Eric Biggers
On Tue, Dec 15, 2020 at 03:47:08PM -0800, Eric Biggers wrote: > +// Execute one round of BLAKE2b by updating the state matrix v[0..15] in the > +// NEON registers q0-q7. The message block is in q8..q15. The stack pointer > +// points to a 32-byte aligned buffer containing a copy of q8 and q9, so

[PATCH 5/5] crypto: arm/blake2b - add NEON-optimized BLAKE2b implementation

2020-12-15 Thread Eric Biggers
From: Eric Biggers Add a NEON-optimized implementation of BLAKE2b. On Cortex-A7 (which these days is the most common ARM processor that doesn't have the ARMv8 Crypto Extensions), this is over twice as fast as SHA-256, and slightly faster than SHA-1. It is also almost three times as fast as the