https://gcc.gnu.org/g:ec03b6cec89e125eb5c955ca7180f06910f3527d

commit ec03b6cec89e125eb5c955ca7180f06910f3527d
Author: Thomas Schwinge <tho...@codesourcery.com>
Date:   Fri Mar 24 15:14:57 2023 +0100

    libgomp: Document OpenMP 'pinned' memory
    
            libgomp/
            * libgomp.texi (AMD Radeon, nvptx): Document OpenMP 'pinned'
            memory.

Diff:
---
 libgomp/ChangeLog.omp | 5 +++++
 libgomp/libgomp.texi  | 8 ++++++++
 2 files changed, 13 insertions(+)

diff --git a/libgomp/ChangeLog.omp b/libgomp/ChangeLog.omp
index 1b8f3dbaaf2..afb7dd62bc1 100644
--- a/libgomp/ChangeLog.omp
+++ b/libgomp/ChangeLog.omp
@@ -1,3 +1,8 @@
+2023-04-03  Thomas Schwinge  <tho...@codesourcery.com>
+
+       * libgomp.texi (AMD Radeon, nvptx): Document OpenMP 'pinned'
+       memory.
+
 2024-06-06  Jakub Jelinek  <ja...@redhat.com>
 
        * libgomp.texi (OpenMP 5.1 status): Mark Loop transformation constructs
diff --git a/libgomp/libgomp.texi b/libgomp/libgomp.texi
index 526e936ef52..bc105464208 100644
--- a/libgomp/libgomp.texi
+++ b/libgomp/libgomp.texi
@@ -6357,6 +6357,9 @@ The implementation remark:
 @item OpenMP code that has a @code{requires} directive with
       @code{unified_shared_memory} will remove any GCN device from the list of
       available devices (``host fallback'').
+@item OpenMP @emph{pinned} memory (@code{omp_atk_pinned},
+      @code{ompx_pinned_mem_alloc}, for example)
+      is allocated via @code{mmap}, @code{mlock}.
 @item The available stack size can be changed using the @code{GCN_STACK_SIZE}
       environment variable; the default is 32 kiB per thread.
 @item Low-latency memory (@code{omp_low_lat_mem_space}) is supported when the
@@ -6434,6 +6437,11 @@ The implementation remark:
 @item OpenMP code that has a @code{requires} directive with
       @code{unified_shared_memory} will remove any nvptx device from the
       list of available devices (``host fallback'').
+@item OpenMP @emph{pinned} memory (@code{omp_atk_pinned},
+      @code{ompx_pinned_mem_alloc}, for example)
+      is allocated via @code{cuMemHostAlloc} (CUDA Driver API).
+      This potentially helps optimization of host <-> device data
+      transfers.
 @item The default per-warp stack size is 128 kiB; see also @code{-msoft-stack}
       in the GCC manual.
 @item The OpenMP routines @code{omp_target_memcpy_rect} and

Reply via email to