[Bug target/111874] Missed mask_fold_left_plus with AVX512

2023-11-12 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874 Andrew Pinski changed: What|Removed |Added Last reconfirmed||2023-11-12 Severity|normal

[Bug target/111874] Missed mask_fold_left_plus with AVX512

2023-10-23 Thread crazylht at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874 --- Comment #3 from Hongtao.liu --- > For the case of conditional (or loop masked) fold-left reductions the scalar > fallback isn't implemented. But AVX512 has vpcompress that could be used > to implement a more efficient sequence for a masked

[Bug target/111874] Missed mask_fold_left_plus with AVX512

2023-10-19 Thread rguenth at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874 --- Comment #2 from Richard Biener --- (In reply to Hongtao.liu from comment #1) > For integer, We have _mm512_mask_reduce_add_epi32 defined as > > extern __inline int > __attribute__ ((__gnu_inline__, __always_inline__, __artificial__)) > _mm5

[Bug target/111874] Missed mask_fold_left_plus with AVX512

2023-10-19 Thread crazylht at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111874 --- Comment #1 from Hongtao.liu --- For integer, We have _mm512_mask_reduce_add_epi32 defined as extern __inline int __attribute__ ((__gnu_inline__, __always_inline__, __artificial__)) _mm512_mask_reduce_add_epi32 (__mmask16 __U, __m512i __A) {