https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95905
--- Comment #1 from Gabriel Ravier <gabravier at gmail dot com> --- The same pattern with _mm_unpacklo_epi16/32 and the corresponding SSE4 intrinsics can also be optimized in the same way.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95905
--- Comment #1 from Gabriel Ravier <gabravier at gmail dot com> --- The same pattern with _mm_unpacklo_epi16/32 and the corresponding SSE4 intrinsics can also be optimized in the same way.