https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97203
--- Comment #5 from Tom de Vries <vries at gcc dot gnu.org> --- FWIW, another aspect here is convergence (as usual). Looking at the SASS code for main$_omp_fn$0$impl, I don't find evidence for the usual divergence/convergence ops (SSY/SYNC), which might mean that the following shfl is executed in divergent mode, so, even if we would not get the memory access error, we would not get correct results.