https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96932
Tobias Burnus <burnus at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |burnus at gcc dot gnu.org --- Comment #3 from Tobias Burnus <burnus at gcc dot gnu.org> --- Crossref: PR100497 - fails on Volta without membar.sys; before atom.global.exch.b32 Unfortunately, compared to pre-Volta, it is very slow - membar.gl is still slow but a bit less. Using (→ sm_70) fence.sys / fence.gnu instead of fence.sc.{sys,gnu} (= membar.{sys,gl} on >= sm_70) does not seem to make a performance difference for PR100497.