https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120201
Andrew Pinski <pinskia at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Keywords| |wrong-code Ever confirmed|0 |1 Status|UNCONFIRMED |NEW Last reconfirmed| |2025-05-09 --- Comment #3 from Andrew Pinski <pinskia at gcc dot gnu.org> --- Partition 0: size 384 align 16 transforms That looks correct. But this looks wrong: ;; [t.cc:17:20] _34 = [t.cc:17:20] MEM <vector(64) unsigned char> [(char * {ref-all})&transforms]; (insn 98 97 0 (set (reg:V64QI 111 [ _34 ]) (mem/c:V64QI (plus:DI (reg:DI 899) (const_int -448 [0xfffffffffffffe40])) [0 [t.cc:17:20] MEM <vector(64) unsigned char> [(char * {ref-all})&transforms]+0 S64 A512])) "t.cc":17:20 -1 (nil)) Notice the 512 bit alignment. But the alignment is correct in gimple: _34 = __MEM <unsigned char [[gnu::vector_size(64)]], 8> ((char * {ref-all})&transforms); Note the 8 which is 128 bit aligned. Note this needs -march=znver4 as -march=skylake-avx512 does not unroll the loop (I have not looked why though).