https://gcc.gnu.org/bugzilla/show_bug.cgi?id=37150
--- Comment #16 from Richard Biener <rguenth at gcc dot gnu.org> --- (In reply to Joost VandeVondele from comment #15) > Created attachment 29738 [details] > maybe smaller testcase version ? > > Attached is what I think is roughly the smallest kernel of this type that we > have in the code. I checked this is at least partially vectorized with > ifort, but not so with gfortran trunk. It is still not such a small > testcase, I'm afraid. With BB vectorization enhancement this still doesn't vectorize because the four remaining stores in the function are not grouped. The non-reduced testcase has the same issue. I suppose to much scalarization has happened and we don't consider candidates that do not end in a vector write to memory.