https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111354
Richard Biener <rguenth at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Resolution|--- |INVALID Status|UNCONFIRMED |RESOLVED Target| |x86_64-*-* --- Comment #4 from Richard Biener <rguenth at gcc dot gnu.org> --- On a Zen4 machine the code produced by GCC 12 (which btw matches what the source intrinsics do) is faster. Btw, both src and dst are aligned so that shouldn't be the issue here.