https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101908
--- Comment #14 from Richard Biener <rguenth at gcc dot gnu.org> --- (In reply to hubicka from comment #12) > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101908 > > > > --- Comment #11 from Richard Biener <rguenth at gcc dot gnu.org> --- > > -mtune-ctrl=^sse_unaligned_load_optimal fixes the observed regression. > Interesting. I suppose we may want to run specs with generic model > changed this way to see if it cures other stlf problems? I can do that > if that makes sense. It will only help for V2DF I think, so no, not really. But an IPA idea of whether there's cross-call STLF issues might be nice. Generally doing wider stores is fine but of course if structs end up "misaligned" then doing wide loads tends to run into these issues. In theory the backend should have good enough knowledge to split the wide loads of the argument area near to the prologue because it should know how we stored to it. But then - just fix the CPUs :P