https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66115
--- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> --- I think this is invalid as __m128i as an alignment requirement of 16byte but a+1 is not aligned to 16byte boundary. It just happens to work for -O2.