https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65409
--- Comment #6 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Doesn't it generate worse code though? I mean, for BLKmode PARALLEL, forcing it into some very wide pseudo (TImode in this case, perhaps OImode for even larger ones?) and then storing might be more expensive than what my patch does.