https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874
Andrew Pinski changed:
What|Removed |Added
Last reconfirmed||2023-11-12
Severity|normal
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874
--- Comment #3 from Hongtao.liu ---
> For the case of conditional (or loop masked) fold-left reductions the scalar
> fallback isn't implemented. But AVX512 has vpcompress that could be used
> to implement a more efficient sequence for a masked
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874
--- Comment #2 from Richard Biener ---
(In reply to Hongtao.liu from comment #1)
> For integer, We have _mm512_mask_reduce_add_epi32 defined as
>
> extern __inline int
> __attribute__ ((__gnu_inline__, __always_inline__, __artificial__))
> _mm5
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874
--- Comment #1 from Hongtao.liu ---
For integer, We have _mm512_mask_reduce_add_epi32 defined as
extern __inline int
__attribute__ ((__gnu_inline__, __always_inline__, __artificial__))
_mm512_mask_reduce_add_epi32 (__mmask16 __U, __m512i __A)
{