https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86504
Joel Hutton <joel.hutton at arm dot com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |joel.hutton at arm dot com --- Comment #8 from Joel Hutton <joel.hutton at arm dot com> --- (In reply to Richard Biener from comment #3) Hi Richard, > So the vectorization issue would be that basic-block vectorization doesn't > catch this in a very nice way - on x86 we pull out the invariant computation > and have a vectorized (outer) loop storing to d. Just a small clarification, do you mean to say that there is a difference between the way x86 and aarch64 handle this, as far as I can see they handle this in the same way.