https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77730
--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> --- One thing I noticed is the generic vector cost model sucks for almost all cores. It causes to emit ld2/ld3 in cases where it would be more expensive than if doing it scalar.