On Wed, Jul 08, 2020 at 12:11:18PM +0300, Ard Biesheuvel wrote:
> Due to the fact that the x86 port does not support allocating objects
> on the stack with an alignment that exceeds 8 bytes, we have a rather
> ugly hack in the x86 code for ChaCha to ensure that the state array is
> aligned to 16 bytes, allowing the SSE3 implementation of the algorithm
> to use aligned loads.
> 
> Given that the performance benefit of using of aligned loads appears to
> be limited (~0.25% for 1k blocks using tcrypt on a Corei7-8650U), and
> the fact that this hack has leaked into generic ChaCha code, let's just
> remove it.
> 
> Cc: Martin Willi <mar...@strongswan.org>
> Cc: Herbert Xu <herb...@gondor.apana.org.au>
> Cc: Eric Biggers <ebigg...@kernel.org>
> Signed-off-by: Ard Biesheuvel <a...@kernel.org>
> ---
>  arch/x86/crypto/chacha-ssse3-x86_64.S | 16 ++++++++--------
>  arch/x86/crypto/chacha_glue.c         | 17 ++---------------
>  include/crypto/chacha.h               |  4 ----
>  3 files changed, 10 insertions(+), 27 deletions(-)

Patch applied.  Thanks.
-- 
Email: Herbert Xu <herb...@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt

Reply via email to