https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101296
--- Comment #4 from Richard Biener <rguenth at gcc dot gnu.org> --- Disabling vectorization for mult_su3_nn (the one with the vaddsubpd instructions) still reproduces 433.milc 9180 126 73.1 * 9180 133 69.2 * and thus a 5% slowdown.