https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103781
Andrew Pinski <pinskia at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|UNCONFIRMED |NEW Last reconfirmed| |2024-01-26 Ever confirmed|0 |1 --- Comment #6 from Andrew Pinski <pinskia at gcc dot gnu.org> --- Confirmed. Note if sve is turned on, we get: ``` .L2: ldr q30, [x1], 16 ldr q29, [x2], 16 mul z29.d, z30.d, z29.d add v31.2d, v31.2d, v29.2d cmp x1, x3 bne .L2 ``` For the inner loop on the trunk which is 100% what you want as then it is vectorized.