https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115304
--- Comment #3 from ro at CeBiTec dot Uni-Bielefeld.DE <ro at CeBiTec dot Uni-Bielefeld.DE> --- > --- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> --- > It should only need vect32 - basically I assumed the target can compose the > 64bit vector from two 32bit elements. But it might be that for this to work > the loads would need to be aligned. > > What is needed is char-to-short unpacking and vector composition. Either > composing V2SImode or V8QImode from two V4QImode vectors. > > Does the following help? Unfortunately not: makes no difference AFAICS.