https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79481
--- Comment #4 from Wenzel Jakob <wen...@mitsuba-renderer.org> --- I think that's right. Clang e.g. also does this: #define _mm512_prefetch_i32gather_ps(index, addr, scale, hint) ({\ __builtin_ia32_gatherpfdps((__mmask16) -1, \ (__v16si)(__m512i)(index), (int const *)(addr), \ (int)(scale), (int)(hint)); })