https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119298
--- Comment #6 from Richard Biener <rguenth at gcc dot gnu.org> --- So this is likely a Zen5 tuning thing that makes the vectorization profitable. Though since this just transforms stores this cannot be a STLF fail, instead it's likely the vector(4) long unsigned int builds from scalars that cause this. It's also not possible the add/sub insn cost change causes this. Could it be that this is in the end sth similar as we see with LBM?