https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96932

Tobias Burnus <burnus at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |burnus at gcc dot gnu.org

--- Comment #3 from Tobias Burnus <burnus at gcc dot gnu.org> ---
Crossref: PR100497 - fails on Volta without
  membar.sys;
before
  atom.global.exch.b32

Unfortunately, compared to pre-Volta, it is very slow - membar.gl is still slow
but a bit less.  Using (→ sm_70) fence.sys / fence.gnu instead of
fence.sc.{sys,gnu} (= membar.{sys,gl} on >= sm_70) does not seem to make a
performance difference for PR100497.

Reply via email to