https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82426
Richard Biener <rguenth at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Target| |x86_64-*-* --- Comment #5 from Richard Biener <rguenth at gcc dot gnu.org> --- x86 actually does have V2SF, the issue is that there's an opportunity for V4SF vectorization and one for V2SF arriving at the same load groups and that causes a conflict (there's other PRs about this general issue), so we kill one part: t.C:18:12: missed: desired vector type conflicts with earlier one for _2 = b_35(D)->m11; t.C:18:12: note: removing SLP instance operations starting from: <retval>.dx = _27; also we have a bunch of live lanes off the remaining vectorized piece which makes code a bit awkward. Unfortunately we have no way to force 64bit vectors here (V2SF) to see whether splitting up the V4SFmode partition would help (I guess it would as can be seen from using 'double').