https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113079
Richard Biener <rguenth at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|UNCONFIRMED |NEW Ever confirmed|0 |1 Last reconfirmed| |2023-12-19 --- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> --- It's includes a lane reduction so we need to have those zero at least on GIMPLE (if we'd do it there) because the tree codes do not specify which lanes are reduced. The actual x86 instruction is probably fine so if you make V8QI operation available in the backend that should work without zeroing.