https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117031
Richard Biener <rguenth at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |rguenth at gcc dot gnu.org --- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> --- (In reply to Tamar Christina from comment #0) > GCC seems to miss that there is no gap between the group accesses and that > stride == 1. > test3 is vectorized linearly by GCC, so it seems this is missed optimization > in data ref analysis? The load-lanes look fine, so it must be the code generation for the HI to DI via SI conversions using unpacks you are complaining about? Using load-lanes is natural here. This isn't about permutes due to VF or so, isn't it?