On 07/04/2017 03:05 AM, Tom de Vries wrote: > On 07/03/2017 04:24 PM, Tom de Vries wrote: >> On 07/03/2017 04:08 PM, Thomas Schwinge wrote: >>> Hi! >>> >>> On Mon, 26 Jun 2017 17:29:11 +0200, Jakub Jelinek <ja...@redhat.com> >>> wrote: >>>> On Mon, Jun 26, 2017 at 03:26:57PM +0000, Joseph Myers wrote: >>>>> On Mon, 26 Jun 2017, Tom de Vries wrote: >>>>> >>>>>>> 2. Handle GOMP_OPENACC_NVPTX_{DISASM,SAVE_TEMPS} in libgomp nvptx >>>>>>> plugin >>>>>> >>>>>> This patch adds handling of: >>>>>> - GOMP_OPENACC_NVPTX_SAVE_TEMPS=[01], and >>>>>> - GOMP_OPENACC_NVPTX_DISASM=[01] >>> >>> Why the "OPENACC" in these names? >> >> I took the format from 'GOMP_OPENACC_DIM'. >> >>> Doesn't this debugging aid apply to >>> any variant of offloading? >> >> I guess you're right. These environment variables would also be >> applicable for f.i. offloading via openmp on nvptx. I'll strip the >> 'OPENACC_' bit from the variables. >> >>>>>> The filename used for dumping the module is plugin-nvptx.<pid>.cubin. >>> >>> Also, I suggest to make these names similar to their controlling >>> options, >>> that is: "gomp-nvptx*", for example. >>> >> >> Makes sense, will do. > > Changes in the patch series: > - removed OPENACC_ from environment variable names > - made temp files use gomp-nvptx prefix. > - fixed build error due to missing _GNU_SOURCE in libgomp-nvptx.c. > - merged the three GOMP_NVPTX_JIT patches into one > - rewrote GOMP_NVPTX_JIT to add no extra flags to the JIT compiler > invocation if GOMP_NVPTX_JIT if not defined, removing the need for > hardcoding default values > - added CU_JIT_TARGET to plugin/cuda/cuda.h > > Build on x86_64 with nvptx offloading enabled (using plugin/cuda/cuda.h). > > The patch series now looks like: > 1. Handle GOMP_NVPTX_{DISASM,SAVE_TEMPS} in libgomp nvptx plugin > 2. Handle GOMP_NVPTX_PTXRW in libgomp nvptx plugin > 3. Handle GOMP_NVPTX_JIT={-O[0-4],-ori,-arch=<n>} in libgomp nvptx > plugin > > I'll repost the patch series in reply to this email.
Ping. Can we get this patch series into trunk and og7? The ability to easily modify PTX code, via GOMP_NVPTX_PTXRW, is extremely helpful. It helped me isolate one problem already. Thanks, Cesar