https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106146
--- Comment #3 from Andrew Pinski <pinskia at gcc dot gnu.org> --- So I see svadd_z directly emits the instruction, not leaving any way to optimize away the _z part before hand. I am not sure how to fix this though.