https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102062
--- Comment #7 from Segher Boessenkool <segher at gcc dot gnu.org> --- Btw, -ftree-loop-vectorize -fvect-cost-model=cheap makes this 8 vectors per iteration (and very-cheap doesn't vectorise it). Maybe overkill, esp. when you look at the tail code, but that 8 vector core sure looks tight :-)