r331938 - [CUDA] Added -f[no-]cuda-short-ptr option

2018-05-09 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed May 9 16:10:09 2018 New Revision: 331938 URL: http://llvm.org/viewvc/llvm-project?rev=331938&view=rev Log: [CUDA] Added -f[no-]cuda-short-ptr option The option enables use of 32-bit pointers for accessing const/local/shared memory. The feature is disabled by default. Diffe

r333098 - [CUDA] Fixed the list of GPUs supported by CUDA-9.

2018-05-23 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed May 23 09:45:23 2018 New Revision: 333098 URL: http://llvm.org/viewvc/llvm-project?rev=333098&view=rev Log: [CUDA] Fixed the list of GPUs supported by CUDA-9. Differential Revision: https://reviews.llvm.org/D47268 Modified: cfe/trunk/lib/Driver/ToolChains/Cuda.cpp Modi

r314129 - [CUDA] Fix names of __nvvm_vote* intrinsics.

2017-09-25 Thread Artem Belevich via cfe-commits
Author: tra Date: Mon Sep 25 10:55:26 2017 New Revision: 314129 URL: http://llvm.org/viewvc/llvm-project?rev=314129&view=rev Log: [CUDA] Fix names of __nvvm_vote* intrinsics. Also fixed a syntax error in activemask(). Differential Revision: https://reviews.llvm.org/D38188 Modified: cfe/trun

r314135 - [NVPTX] added match.{any, all}.sync instructions, intrinsics & builtins.

2017-09-25 Thread Artem Belevich via cfe-commits
Author: tra Date: Mon Sep 25 11:53:57 2017 New Revision: 314135 URL: http://llvm.org/viewvc/llvm-project?rev=314135&view=rev Log: [NVPTX] added match.{any,all}.sync instructions, intrinsics & builtins. Differential Revision: https://reviews.llvm.org/D38191 Modified: cfe/trunk/include/clang/B

r314223 - [NVPTX] added match.{any, all}.sync instructions, intrinsics & builtins.

2017-09-26 Thread Artem Belevich via cfe-commits
Author: tra Date: Tue Sep 26 10:07:23 2017 New Revision: 314223 URL: http://llvm.org/viewvc/llvm-project?rev=314223&view=rev Log: [NVPTX] added match.{any,all}.sync instructions, intrinsics & builtins. Differential Revision: https://reviews.llvm.org/D38191 Modified: cfe/trunk/include/clang/B

r314334 - [CUDA] Work around conflicting function definitions in CUDA-9 headers.

2017-09-27 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Sep 27 12:07:15 2017 New Revision: 314334 URL: http://llvm.org/viewvc/llvm-project?rev=314334&view=rev Log: [CUDA] Work around conflicting function definitions in CUDA-9 headers. Differential Revision: https://reviews.llvm.org/D38326 Modified: cfe/trunk/lib/Headers/__cl

r315624 - [CUDA] Added __hmma_m16n16k16_* builtins to support mma instructions on sm_70

2017-10-12 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Oct 12 14:32:19 2017 New Revision: 315624 URL: http://llvm.org/viewvc/llvm-project?rev=315624&view=rev Log: [CUDA] Added __hmma_m16n16k16_* builtins to support mma instructions on sm_70 Differential Revision: https://reviews.llvm.org/D38742 Added: cfe/trunk/test/CodeGen

r341115 - Reverted the "[CUDA/OpenMP] Define only some host macros during device compilation"

2018-08-30 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Aug 30 13:43:39 2018 New Revision: 341115 URL: http://llvm.org/viewvc/llvm-project?rev=341115&view=rev Log: Reverted the "[CUDA/OpenMP] Define only some host macros during device compilation" The changes were breaking CUDA compilation. Reverted revisions: r340681 D50845

r341118 - Revert the tests that should've been reverted in rL341115

2018-08-30 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Aug 30 13:53:15 2018 New Revision: 341118 URL: http://llvm.org/viewvc/llvm-project?rev=341118&view=rev Log: Revert the tests that should've been reverted in rL341115 Removed: cfe/trunk/test/Preprocessor/aux-triple.c Modified: cfe/trunk/test/Preprocessor/predefined-ma

r337587 - [CUDA] Provide integer SIMD functions for CUDA-9.2

2018-07-20 Thread Artem Belevich via cfe-commits
Author: tra Date: Fri Jul 20 10:44:34 2018 New Revision: 337587 URL: http://llvm.org/viewvc/llvm-project?rev=337587&view=rev Log: [CUDA] Provide integer SIMD functions for CUDA-9.2 CUDA-9.2 made all integer SIMD functions into compiler builtins, so clang no longer has access to the implementation

r325626 - [CUDA] Added missing __threadfence_system() function for CUDA9.

2018-02-20 Thread Artem Belevich via cfe-commits
Author: tra Date: Tue Feb 20 13:25:30 2018 New Revision: 325626 URL: http://llvm.org/viewvc/llvm-project?rev=325626&view=rev Log: [CUDA] Added missing __threadfence_system() function for CUDA9. Modified: cfe/trunk/lib/Headers/__clang_cuda_device_functions.h Modified: cfe/trunk/lib/Headers/__

r325814 - [CUDA] Added missing functions.

2018-02-22 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Feb 22 10:40:52 2018 New Revision: 325814 URL: http://llvm.org/viewvc/llvm-project?rev=325814&view=rev Log: [CUDA] Added missing functions. Initial commit missed sincos(float), llabs() and few atomics that we used to pull in from device_functions.hpp, which we no longer incl

r321326 - [CUDA] More fixes for __shfl_* intrinsics.

2017-12-21 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Dec 21 15:52:09 2017 New Revision: 321326 URL: http://llvm.org/viewvc/llvm-project?rev=321326&view=rev Log: [CUDA] More fixes for __shfl_* intrinsics. * __shfl_{up,down}* uses unsigned int for the third parameter. * added [unsigned] long overloads for non-sync shuffles. Dif

r334108 - [CUDA] Replace 'nv_weak' attributes in CUDA headers with 'weak'.

2018-06-06 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Jun 6 10:52:55 2018 New Revision: 334108 URL: http://llvm.org/viewvc/llvm-project?rev=334108&view=rev Log: [CUDA] Replace 'nv_weak' attributes in CUDA headers with 'weak'. Differential Revision: https://reviews.llvm.org/D47804 Modified: cfe/trunk/lib/Headers/__clang_cu

r334143 - [CUDA] Check initializers of instantiated template variables.

2018-06-06 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Jun 6 15:37:25 2018 New Revision: 334143 URL: http://llvm.org/viewvc/llvm-project?rev=334143&view=rev Log: [CUDA] Check initializers of instantiated template variables. We were already performing checks on non-template variables, but the checks on templated ones were missin

r335168 - [CUDA] Removed unused __nvvm_* builtins with non-generic pointers.

2018-06-20 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Jun 20 13:34:04 2018 New Revision: 335168 URL: http://llvm.org/viewvc/llvm-project?rev=335168&view=rev Log: [CUDA] Removed unused __nvvm_* builtins with non-generic pointers. They were hot even hooked into CGBuiltin's machinery. Even if they were, CUDA does not support AS-sp

r348662 - [CUDA] Added missing 'inline' for functions defined in a header.

2018-12-07 Thread Artem Belevich via cfe-commits
Author: tra Date: Fri Dec 7 14:20:53 2018 New Revision: 348662 URL: http://llvm.org/viewvc/llvm-project?rev=348662&view=rev Log: [CUDA] Added missing 'inline' for functions defined in a header. Modified: cfe/trunk/lib/Headers/cuda_wrappers/new Modified: cfe/trunk/lib/Headers/cuda_wrappers/n

r359248 - [CUDA] Implemented _[bi]mma* builtins.

2019-04-25 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Apr 25 15:28:09 2019 New Revision: 359248 URL: http://llvm.org/viewvc/llvm-project?rev=359248&view=rev Log: [CUDA] Implemented _[bi]mma* builtins. These builtins provide access to the new integer and sub-integer variants of MMA (matrix multiply-accumulate) instructions provi

r359838 - [CUDA] Do not pass deprecated option fo fatbinary

2019-05-02 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu May 2 15:37:19 2019 New Revision: 359838 URL: http://llvm.org/viewvc/llvm-project?rev=359838&view=rev Log: [CUDA] Do not pass deprecated option fo fatbinary CUDA 10.1 tools deprecated some command line options. fatbinary no longer needs --cuda. Differential Revision: https

[clang-tools-extra] r342514 - [clang-tidy] Replace redundant checks with an assert().

2018-09-18 Thread Artem Belevich via cfe-commits
Author: tra Date: Tue Sep 18 14:51:02 2018 New Revision: 342514 URL: http://llvm.org/viewvc/llvm-project?rev=342514&view=rev Log: [clang-tidy] Replace redundant checks with an assert(). findStyleKind is only called if D is an explicit identifier with a name, so the checks for operators will never

r342749 - [CUDA] Ignore uncallable functions when we check for usual deallocators.

2018-09-21 Thread Artem Belevich via cfe-commits
Author: tra Date: Fri Sep 21 10:29:33 2018 New Revision: 342749 URL: http://llvm.org/viewvc/llvm-project?rev=342749&view=rev Log: [CUDA] Ignore uncallable functions when we check for usual deallocators. Previously clang considered function variants from both sides of compilation and that resulted

r342752 - [CUDA] Fixed parsing of optional template-argument-list.

2018-09-21 Thread Artem Belevich via cfe-commits
Author: tra Date: Fri Sep 21 10:46:28 2018 New Revision: 342752 URL: http://llvm.org/viewvc/llvm-project?rev=342752&view=rev Log: [CUDA] Fixed parsing of optional template-argument-list. We need to consider all tokens that start with '>' when we're checking for the end of an empty template argume

r342924 - [CUDA] Added basic support for compiling with CUDA-10.0

2018-09-24 Thread Artem Belevich via cfe-commits
Author: tra Date: Mon Sep 24 16:10:44 2018 New Revision: 342924 URL: http://llvm.org/viewvc/llvm-project?rev=342924&view=rev Log: [CUDA] Added basic support for compiling with CUDA-10.0 Modified: cfe/trunk/include/clang/Basic/Cuda.h cfe/trunk/lib/Basic/Cuda.cpp cfe/trunk/lib/Basic/Tar

r343875 - [CUDA] Use all 64 bits of GUID in __nv_module_id

2018-10-05 Thread Artem Belevich via cfe-commits
Author: tra Date: Fri Oct 5 11:39:58 2018 New Revision: 343875 URL: http://llvm.org/viewvc/llvm-project?rev=343875&view=rev Log: [CUDA] Use all 64 bits of GUID in __nv_module_id getGUID() returns an uint64_t and "%x" only prints 32 bits of it. Use PRIx64 format string to print all 64 bits. Diff

r352798 - [CUDA] Propagate detected version of CUDA to cc1

2019-01-31 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Jan 31 13:32:24 2019 New Revision: 352798 URL: http://llvm.org/viewvc/llvm-project?rev=352798&view=rev Log: [CUDA] Propagate detected version of CUDA to cc1 ..and use it to control that parts of CUDA compilation that depend on the specific version of CUDA SDK. This patch ha

r352799 - [CUDA] add support for the new kernel launch API in CUDA-9.2+.

2019-01-31 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Jan 31 13:34:03 2019 New Revision: 352799 URL: http://llvm.org/viewvc/llvm-project?rev=352799&view=rev Log: [CUDA] add support for the new kernel launch API in CUDA-9.2+. Instead of calling CUDA runtime to arrange function arguments, the new API constructs arguments in a loc

r353232 - Basic CUDA-10 support.

2019-02-05 Thread Artem Belevich via cfe-commits
Author: tra Date: Tue Feb 5 14:38:58 2019 New Revision: 353232 URL: http://llvm.org/viewvc/llvm-project?rev=353232&view=rev Log: Basic CUDA-10 support. Differential Revision: https://reviews.llvm.org/D57771 Modified: cfe/trunk/include/clang/Basic/Cuda.h cfe/trunk/lib/Basic/Cuda.cpp

r312734 - [CUDA] Added rudimentary support for CUDA-9 and sm_70.

2017-09-07 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Sep 7 11:14:32 2017 New Revision: 312734 URL: http://llvm.org/viewvc/llvm-project?rev=312734&view=rev Log: [CUDA] Added rudimentary support for CUDA-9 and sm_70. For now CUDA-9 is not included in the list of CUDA versions clang searches for, so the path to CUDA-9 must be ex

r313369 - [CUDA] Work around a new quirk in CUDA9 headers.

2017-09-15 Thread Artem Belevich via cfe-commits
Author: tra Date: Fri Sep 15 10:30:53 2017 New Revision: 313369 URL: http://llvm.org/viewvc/llvm-project?rev=313369&view=rev Log: [CUDA] Work around a new quirk in CUDA9 headers. In CUDA-9 some of device-side math functions that we need are conditionally defined within '#if _GLIBCXX_MATH_H'. We n

r313820 - [NVPTX] Implemented shfl.sync instruction and supporting intrinsics/builtins.

2017-09-20 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Sep 20 14:23:07 2017 New Revision: 313820 URL: http://llvm.org/viewvc/llvm-project?rev=313820&view=rev Log: [NVPTX] Implemented shfl.sync instruction and supporting intrinsics/builtins. Differential Revision: https://reviews.llvm.org/D38090 Added: cfe/trunk/test/CodeGen

r313898 - [NVPTX] Implemented bar.warp.sync, barrier.sync, and vote{.sync} instructions/intrinsics/builtins.

2017-09-21 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Sep 21 11:44:49 2017 New Revision: 313898 URL: http://llvm.org/viewvc/llvm-project?rev=313898&view=rev Log: [NVPTX] Implemented bar.warp.sync, barrier.sync, and vote{.sync} instructions/intrinsics/builtins. Differential Revision: https://reviews.llvm.org/D38148 Modified:

r313899 - [CUDA] Fixed order of words in the names of shfl builtins.

2017-09-21 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Sep 21 11:46:39 2017 New Revision: 313899 URL: http://llvm.org/viewvc/llvm-project?rev=313899&view=rev Log: [CUDA] Fixed order of words in the names of shfl builtins. Differential Revision: https://reviews.llvm.org/D38147 Modified: cfe/trunk/lib/Headers/__clang_cuda_int

r349087 - [CUDA] Make all host-side shadows of device-side variables undef.

2018-12-13 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Dec 13 13:43:04 2018 New Revision: 349087 URL: http://llvm.org/viewvc/llvm-project?rev=349087&view=rev Log: [CUDA] Make all host-side shadows of device-side variables undef. The host-side code can't (and should not) access the values that may only exist on the device side. E

r349981 - [CUDA] Treat extern global variable shadows same as regular extern vars.

2018-12-21 Thread Artem Belevich via cfe-commits
Author: tra Date: Fri Dec 21 17:11:09 2018 New Revision: 349981 URL: http://llvm.org/viewvc/llvm-project?rev=349981&view=rev Log: [CUDA] Treat extern global variable shadows same as regular extern vars. This fixes compiler crash when we attempted to compile this code: extern __device__ int data;

r330280 - [CUDA] added missing __ldg(const signed char *)

2018-04-18 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Apr 18 11:33:43 2018 New Revision: 330280 URL: http://llvm.org/viewvc/llvm-project?rev=330280&view=rev Log: [CUDA] added missing __ldg(const signed char *) Differential Revision: https://reviews.llvm.org/D45780 Modified: cfe/trunk/lib/Headers/__clang_cuda_intrinsics.h

r330296 - [NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma instructions.

2018-04-18 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Apr 18 14:51:48 2018 New Revision: 330296 URL: http://llvm.org/viewvc/llvm-project?rev=330296&view=rev Log: [NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma instructions. The new instructions were added added for sm_70+ GPUs in CUDA-9.1. Differential

r330753 - [CUDA] Enable CUDA compilation with CUDA-9.2

2018-04-24 Thread Artem Belevich via cfe-commits
Author: tra Date: Tue Apr 24 11:23:19 2018 New Revision: 330753 URL: http://llvm.org/viewvc/llvm-project?rev=330753&view=rev Log: [CUDA] Enable CUDA compilation with CUDA-9.2 Differential Revision: https://reviews.llvm.org/D45827 Modified: cfe/trunk/include/clang/Basic/Cuda.h cfe/trunk/l

r335763 - [CUDA] Use atexit() to call module destructor.

2018-06-27 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Jun 27 11:32:51 2018 New Revision: 335763 URL: http://llvm.org/viewvc/llvm-project?rev=335763&view=rev Log: [CUDA] Use atexit() to call module destructor. This matches the way NVCC does it. Doing module cleanup at global destructor phase used to work, but is, apparently, too

Re: r335740 - [analyzer] Allow registering custom statically-linked analyzer checkers

2018-06-27 Thread Artem Belevich via cfe-commits
FYI, This commit breaks clang tests. It appears that StaticAnalysisTests misses dependency on clangFrontend. --Artem [60/134] Linking CXX executable tools/clang/unittests/StaticAnalyzer/StaticAnalysisTests FAILED: tools/clang/unittests/StaticAnalyzer/StaticAnalysisTests : && /usr/local/google/ho

r335880 - [CUDA] Place all CUDA sections in __NV_CUDA segment on Mac.

2018-06-28 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Jun 28 10:15:52 2018 New Revision: 335880 URL: http://llvm.org/viewvc/llvm-project?rev=335880&view=rev Log: [CUDA] Place all CUDA sections in __NV_CUDA segment on Mac. That's where CUDA binaries appear to put them. Differential Revision: https://reviews.llvm.org/D48615 Mod

Re: [clang] 8527c1e - Added constraints on cl-options.cu test

2020-04-06 Thread Artem Belevich via cfe-commits
uot; > > $ "c:\src\llvm_package_64c23127\build32_stage0\bin\filecheck.exe" > > "-check-prefix=GS-default" > > "C:\src\llvm_package_64c23127\llvm-project\clang\test\Driver\ > cl-options.cu" > > # command stderr: > &

[clang] d2e498b - [CUDA] Improve testing of libdevice detection.

2020-04-08 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-04-08T11:19:45-07:00 New Revision: d2e498b1725dd7b792c061387ff76df71cd7728a URL: https://github.com/llvm/llvm-project/commit/d2e498b1725dd7b792c061387ff76df71cd7728a DIFF: https://github.com/llvm/llvm-project/commit/d2e498b1725dd7b792c061387ff76df71cd7728a.diff

[clang] a9627b7 - [CUDA] Add partial support for recent CUDA versions.

2020-04-08 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-04-08T11:19:44-07:00 New Revision: a9627b7ea7e2b47488188cb1d80a23b24a8066b2 URL: https://github.com/llvm/llvm-project/commit/a9627b7ea7e2b47488188cb1d80a23b24a8066b2 DIFF: https://github.com/llvm/llvm-project/commit/a9627b7ea7e2b47488188cb1d80a23b24a8066b2.diff

[clang] 33386b2 - [CUDA] Simplify GPU variant handling. NFC.

2020-04-08 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-04-08T11:19:43-07:00 New Revision: 33386b20aa82c73e28b871cfa35c89a3808c9f92 URL: https://github.com/llvm/llvm-project/commit/33386b20aa82c73e28b871cfa35c89a3808c9f92 DIFF: https://github.com/llvm/llvm-project/commit/33386b20aa82c73e28b871cfa35c89a3808c9f92.diff

[clang] 6ed88af - [CUDA] Accept -x cu to indicate language is CUDA, transfer CUDA language flag to header-file arguments

2020-04-09 Thread Artem Belevich via cfe-commits
Author: ADRA Date: 2020-04-09T13:08:41-07:00 New Revision: 6ed88afd780cc2cd04e50e25c5d3ffafc07b1c1b URL: https://github.com/llvm/llvm-project/commit/6ed88afd780cc2cd04e50e25c5d3ffafc07b1c1b DIFF: https://github.com/llvm/llvm-project/commit/6ed88afd780cc2cd04e50e25c5d3ffafc07b1c1b.diff LOG: [CU

[clang] 8c635ba - [CUDA] Fix missed CUDA version mappings.

2020-04-13 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-04-13T15:54:12-07:00 New Revision: 8c635ba4a84f8b5dbde1984fe2d7eeefaf827ffe URL: https://github.com/llvm/llvm-project/commit/8c635ba4a84f8b5dbde1984fe2d7eeefaf827ffe DIFF: https://github.com/llvm/llvm-project/commit/8c635ba4a84f8b5dbde1984fe2d7eeefaf827ffe.diff

[clang] d700237 - [CUDA,HIP] Use VFS for SDK detection.

2020-06-15 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-06-15T12:54:44-07:00 New Revision: d700237f1aa1bc05d584a0f50fdad89370e17987 URL: https://github.com/llvm/llvm-project/commit/d700237f1aa1bc05d584a0f50fdad89370e17987 DIFF: https://github.com/llvm/llvm-project/commit/d700237f1aa1bc05d584a0f50fdad89370e17987.diff

[clang] ac20150 - [CUDA] make the test more hermetic

2020-06-17 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-06-17T15:22:45-07:00 New Revision: ac20150e299a41ade860f432741c1b8557ac8058 URL: https://github.com/llvm/llvm-project/commit/ac20150e299a41ade860f432741c1b8557ac8058 DIFF: https://github.com/llvm/llvm-project/commit/ac20150e299a41ade860f432741c1b8557ac8058.diff

[clang] 8e20516 - [CUDA] Define __CUDACC__ before standard library headers

2020-04-17 Thread Artem Belevich via cfe-commits
Author: Raul Tambre Date: 2020-04-17T12:56:13-07:00 New Revision: 8e20516540444618ad32dd11e835c05804053697 URL: https://github.com/llvm/llvm-project/commit/8e20516540444618ad32dd11e835c05804053697 DIFF: https://github.com/llvm/llvm-project/commit/8e20516540444618ad32dd11e835c05804053697.diff L

[clang] 7d057ef - [CUDA] Work around a bug in rint/nearbyint caused by a broken implementation provided by CUDA.

2020-08-05 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-08-05T13:13:48-07:00 New Revision: 7d057efddc00ba7d03e6e684f23dd9b09fbd0527 URL: https://github.com/llvm/llvm-project/commit/7d057efddc00ba7d03e6e684f23dd9b09fbd0527 DIFF: https://github.com/llvm/llvm-project/commit/7d057efddc00ba7d03e6e684f23dd9b09fbd0527.diff

[clang] 9c8ae40 - [ARM] Speed up arm-cortex-cpus.c test

2020-08-10 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-08-10T14:27:19-07:00 New Revision: 9c8ae40860311e94de0a898101818f706228e958 URL: https://github.com/llvm/llvm-project/commit/9c8ae40860311e94de0a898101818f706228e958 DIFF: https://github.com/llvm/llvm-project/commit/9c8ae40860311e94de0a898101818f706228e958.diff

[clang] ec5f793 - [OpenMP] split execution of a long test into smaller parts.

2020-08-11 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-08-11T11:52:40-07:00 New Revision: ec5f793996f4dc86d339db88c0836e0cf4e8abea URL: https://github.com/llvm/llvm-project/commit/ec5f793996f4dc86d339db88c0836e0cf4e8abea DIFF: https://github.com/llvm/llvm-project/commit/ec5f793996f4dc86d339db88c0836e0cf4e8abea.diff

[clang] bf6a26b - Revert D77954 -- it breaks Eigen & Tensorflow.

2020-05-05 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-05-05T14:07:31-07:00 New Revision: bf6a26b066382e0f41bf023c781d84061c542307 URL: https://github.com/llvm/llvm-project/commit/bf6a26b066382e0f41bf023c781d84061c542307 DIFF: https://github.com/llvm/llvm-project/commit/bf6a26b066382e0f41bf023c781d84061c542307.diff

[clang] 844096b - [CUDA] Make NVVM builtins available with CUDA-11/PTX6.5

2020-05-05 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-05-05T15:43:32-07:00 New Revision: 844096b996a0b17d3f380af323614a5fe31b8a68 URL: https://github.com/llvm/llvm-project/commit/844096b996a0b17d3f380af323614a5fe31b8a68 DIFF: https://github.com/llvm/llvm-project/commit/844096b996a0b17d3f380af323614a5fe31b8a68.diff

[clang] 314f99e - [CUDA] Enable existing builtins for PTX7.0 as well.

2020-05-06 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-05-06T14:24:21-07:00 New Revision: 314f99e7d42ded663386190a54b5831dc4a6f3c1 URL: https://github.com/llvm/llvm-project/commit/314f99e7d42ded663386190a54b5831dc4a6f3c1 DIFF: https://github.com/llvm/llvm-project/commit/314f99e7d42ded663386190a54b5831dc4a6f3c1.diff

[clang] ef649e8 - Revert "[CUDA][HIP] Workaround for resolving host device function against wrong-sided function"

2020-05-18 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2020-05-18T12:22:55-07:00 New Revision: ef649e8fd5d1748764a9afca3ce0b80113a6a239 URL: https://github.com/llvm/llvm-project/commit/ef649e8fd5d1748764a9afca3ce0b80113a6a239 DIFF: https://github.com/llvm/llvm-project/commit/ef649e8fd5d1748764a9afca3ce0b80113a6a239.diff

[PATCH] D24944: [CUDA] Added __nvvm_atom_{sys|cta}_* builtins for sm_60 GPUs.

2016-09-26 Thread Artem Belevich via cfe-commits
tra created this revision. tra added a reviewer: jlebar. tra added a subscriber: cfe-commits. Herald added subscribers: jlebar, jholewinski. https://reviews.llvm.org/D24944 Files: include/clang/Basic/BuiltinsNVPTX.def lib/Basic/Targets.cpp lib/CodeGen/CGBuiltin.cpp test/CodeGen/builtins-n

[PATCH] D24946: [CUDA] Added support for CUDA-8

2016-09-26 Thread Artem Belevich via cfe-commits
tra created this revision. tra added a reviewer: jlebar. tra added a subscriber: cfe-commits. Herald added a subscriber: jlebar. https://reviews.llvm.org/D24946 Files: lib/Driver/ToolChains.cpp lib/Headers/__clang_cuda_runtime_wrapper.h test/Driver/Inputs/CUDA/usr/local/cuda/nvvm/libdevice

Re: [PATCH] D24975: [CUDA] Add #pragma clang force_cuda_host_device_{begin, end} pragmas.

2016-09-27 Thread Artem Belevich via cfe-commits
tra added a comment. LGTM. Should we add new pragma description to docs/LanguageExtensions.rst ? https://reviews.llvm.org/D24975 ___ cfe-commits mailing list cfe-commits@lists.llvm.org http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Re: [PATCH] D24977: [CUDA] Declare our __device__ math functions in the same inline namespace as our standard library.

2016-09-27 Thread Artem Belevich via cfe-commits
tra accepted this revision. tra added a comment. This revision is now accepted and ready to land. That is way too much knowledge about details of standard library implementation. If it changes, I suspect users will end up with a rather uninformative error. Is there a way to produce somewhat more s

Re: [PATCH] D24979: [CUDA] Support and std::min/max on the device.

2016-09-27 Thread Artem Belevich via cfe-commits
tra added a subscriber: echristo. tra added a comment. This looks like fix-includes and it may be somewhat shaky if users start messing with include paths. You may want to get @echristo's input on that. I' personally would prefer to force-include these files. I suspect it will not change things

Re: [PATCH] D24977: [CUDA] Declare our __device__ math functions in the same inline namespace as our standard library.

2016-09-27 Thread Artem Belevich via cfe-commits
tra added a comment. OK. https://reviews.llvm.org/D24977 ___ cfe-commits mailing list cfe-commits@lists.llvm.org http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Re: [PATCH] D24946: [CUDA] Added support for CUDA-8

2016-09-27 Thread Artem Belevich via cfe-commits
tra marked an inline comment as done. Comment at: lib/Headers/__clang_cuda_runtime_wrapper.h:156 @@ +155,3 @@ +#endif +#endif + jlebar wrote: > I don't understand what we are doing here... > > We're saying, if __USE_FAST_MATH__ is defined, and if it's not equal t

Re: [PATCH] D24946: [CUDA] Added support for CUDA-8

2016-09-27 Thread Artem Belevich via cfe-commits
tra updated this revision to Diff 72707. tra added a comment. addressed Justin's comments. https://reviews.llvm.org/D24946 Files: lib/Driver/ToolChains.cpp lib/Headers/__clang_cuda_runtime_wrapper.h test/Driver/Inputs/CUDA/usr/local/cuda/nvvm/libdevice/libdevice.compute_20.10.bc test/

r282609 - [CUDA] added __nvvm_atom_{sys|cta}_* builtins.

2016-09-28 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Sep 28 12:47:35 2016 New Revision: 282609 URL: http://llvm.org/viewvc/llvm-project?rev=282609&view=rev Log: [CUDA] added __nvvm_atom_{sys|cta}_* builtins. These builtins are available on sm_60+ GPU only. Differential Revision: https://reviews.llvm.org/D24944 Modified:

r282610 - [CUDA] Added support for CUDA-8

2016-09-28 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Sep 28 12:47:40 2016 New Revision: 282610 URL: http://llvm.org/viewvc/llvm-project?rev=282610&view=rev Log: [CUDA] Added support for CUDA-8 Differential Revision: https://reviews.llvm.org/D24946 Added: cfe/trunk/test/Driver/Inputs/CUDA/usr/local/cuda/nvvm/libdevice/lib

Re: [PATCH] D24944: [CUDA] Added __nvvm_atom_{sys|cta}_* builtins for sm_60 GPUs.

2016-09-28 Thread Artem Belevich via cfe-commits
This revision was automatically updated to reflect the committed changes. Closed by commit rL282609: [CUDA] added __nvvm_atom_{sys|cta}_* builtins. (authored by tra). Changed prior to commit: https://reviews.llvm.org/D24944?vs=72584&id=72862#toc Repository: rL LLVM https://reviews.llvm.org/

Re: [PATCH] D24946: [CUDA] Added support for CUDA-8

2016-09-28 Thread Artem Belevich via cfe-commits
This revision was automatically updated to reflect the committed changes. Closed by commit rL282610: [CUDA] Added support for CUDA-8 (authored by tra). Changed prior to commit: https://reviews.llvm.org/D24946?vs=72707&id=72863#toc Repository: rL LLVM https://reviews.llvm.org/D24946 Files:

Re: [PATCH] D25036: [CUDA] Disallow exceptions in device code.

2016-09-28 Thread Artem Belevich via cfe-commits
tra accepted this revision. tra added a comment. This revision is now accepted and ready to land. One question, LGTM otherwise. Comment at: clang/lib/Sema/SemaExprCXX.cpp:688 @@ +687,3 @@ + if (getLangOpts().CUDA) +CheckCUDAExceptionExpr(OpLoc, "throw"); +

[PATCH] D25125: [CUDA] Disallow 'extern __shared__' variables.

2016-09-30 Thread Artem Belevich via cfe-commits
tra accepted this revision. tra added a reviewer: tra. tra added a comment. This revision is now accepted and ready to land. LGTM. https://reviews.llvm.org/D25125 ___ cfe-commits mailing list cfe-commits@lists.llvm.org http://lists.llvm.org/cgi-bin/

[PATCH] D25129: [CUDA] Disallow __constant__ local variables.

2016-09-30 Thread Artem Belevich via cfe-commits
tra accepted this revision. tra added a comment. This revision is now accepted and ready to land. LGTM. > DiagnosticSemaKinds.td:6727 > def err_cuda_extern_shared : Error<"__shared__ variable %0 cannot be > 'extern'">; > +def err_cuda_nonglobal_constant : Error<"__constant__ variables must be

[PATCH] D25129: [CUDA] Disallow __constant__ local variables.

2016-09-30 Thread Artem Belevich via cfe-commits
tra added inline comments. > jlebar wrote in DiagnosticSemaKinds.td:6727 > That's still a "global variable"? Or do you think calling it such will be > confusing? It's not clear whether you mean global storage class or global namespace. The code checks for global storage, but error message is c

r283964 - Added REQUIRED triples to the test that fails on some ARM buildbots.

2016-10-11 Thread Artem Belevich via cfe-commits
Author: tra Date: Tue Oct 11 21:08:08 2016 New Revision: 283964 URL: http://llvm.org/viewvc/llvm-project?rev=283964&view=rev Log: Added REQUIRED triples to the test that fails on some ARM buildbots. Modified: cfe/trunk/test/SemaCUDA/function-overload-hd.cu Modified: cfe/trunk/test/SemaCUDA/f

r319201 - [CUDA] Report "unsupported VLA" errors only on device side.

2017-11-28 Thread Artem Belevich via cfe-commits
Author: tra Date: Tue Nov 28 10:51:42 2017 New Revision: 319201 URL: http://llvm.org/viewvc/llvm-project?rev=319201&view=rev Log: [CUDA] Report "unsupported VLA" errors only on device side. This fixes erroneously reported CUDA compilation errors in host-side code during device-side compilation.

r319485 - [CUDA] Tweak CUDA wrappers to make cuda-9 work with libc++

2017-11-30 Thread Artem Belevich via cfe-commits
Author: tra Date: Thu Nov 30 14:22:21 2017 New Revision: 319485 URL: http://llvm.org/viewvc/llvm-project?rev=319485&view=rev Log: [CUDA] Tweak CUDA wrappers to make cuda-9 work with libc++ CUDA-9 headers check for specific libc++ version and ifdef out some of the definitions we need if LIBCPP_VER

r319908 - [CUDA] Added overloads for '[unsigned] long' variants of shfl builtins.

2017-12-06 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Dec 6 09:40:35 2017 New Revision: 319908 URL: http://llvm.org/viewvc/llvm-project?rev=319908&view=rev Log: [CUDA] Added overloads for '[unsigned] long' variants of shfl builtins. Differential Revision: https://reviews.llvm.org/D40871 Modified: cfe/trunk/lib/Headers/__c

r319909 - [NVPTX, CUDA] Added llvm.nvvm.fns intrinsic and matching __nvvm_fns builtin in clang.

2017-12-06 Thread Artem Belevich via cfe-commits
Author: tra Date: Wed Dec 6 09:50:05 2017 New Revision: 319909 URL: http://llvm.org/viewvc/llvm-project?rev=319909&view=rev Log: [NVPTX,CUDA] Added llvm.nvvm.fns intrinsic and matching __nvvm_fns builtin in clang. Differential Revision: https://reviews.llvm.org/D40872 Modified: cfe/trunk/

[clang] 54c47ff - [CUDA] Allow using -o with -fsyntax-only

2022-09-01 Thread Artem Belevich via cfe-commits
Author: Artem Belevich Date: 2022-09-01T15:52:36-07:00 New Revision: 54c47ff9398fbd5fa7e4120b3286adfb4f736ec8 URL: https://github.com/llvm/llvm-project/commit/54c47ff9398fbd5fa7e4120b3286adfb4f736ec8 DIFF: https://github.com/llvm/llvm-project/commit/54c47ff9398fbd5fa7e4120b3286adfb4f736ec8.diff

[clang] [llvm] [openmp] ReworkCtorDtor (PR #71739)

2023-11-08 Thread Artem Belevich via cfe-commits
@@ -95,7 +95,7 @@ using namespace llvm; static cl::opt LowerCtorDtor("nvptx-lower-global-ctor-dtor", cl::desc("Lower GPU ctor / dtors to globals on the device."), - cl::init(false), cl::Hidden); + cl::init(true), cl::Hidde

[openmp] [llvm] [clang] ReworkCtorDtor (PR #71739)

2023-11-08 Thread Artem Belevich via cfe-commits
@@ -95,7 +95,7 @@ using namespace llvm; static cl::opt LowerCtorDtor("nvptx-lower-global-ctor-dtor", cl::desc("Lower GPU ctor / dtors to globals on the device."), - cl::init(false), cl::Hidden); + cl::init(true), cl::Hidde

[clang] [CUDA][HIP] Make template implicitly host device (PR #70369)

2023-11-09 Thread Artem Belevich via cfe-commits
Artem-B wrote: Now that we're making an even larger class of functions implicitly HD, the last logical step would be to make *all* unattributed functions implicitly HD, too (in a separate patch). After all, a template is as GPU-portable (or not) as a regular function. Unlike constexpr or compi

[clang] [CUDA][HIP] Make template implicitly host device (PR #70369)

2023-11-09 Thread Artem Belevich via cfe-commits
https://github.com/Artem-B approved this pull request. https://github.com/llvm/llvm-project/pull/70369 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[compiler-rt] [clang] [llvm] [HIP] support 128 bit int division (PR #71978)

2023-11-10 Thread Artem Belevich via cfe-commits
Artem-B wrote: Would it be feasible to consider switching to the new offloading driver mode and really link with the library instead? It may be a conveniently isolated use case with little/no existing users that would disrupt. https://github.com/llvm/llvm-project/pull/71978 __

[compiler-rt] [clang] [llvm] [HIP] support 128 bit int division (PR #71978)

2023-11-13 Thread Artem Belevich via cfe-commits
Artem-B wrote: > I don't think we're in a position to actually enable that at this time. We > still don't have everything necessary to provide object linking, which this > seems to rely on OK. IR it is. https://github.com/llvm/llvm-project/pull/71978 __

[clang] [CUDA][Win32] Add `fma(long double,..)` to math forward declares. (PR #73756)

2023-12-04 Thread Artem Belevich via cfe-commits
@@ -70,6 +70,9 @@ __DEVICE__ double floor(double); __DEVICE__ float floor(float); __DEVICE__ double fma(double, double, double); __DEVICE__ float fma(float, float, float); +#ifdef _MSC_VER +__DEVICE__ long double fma(long double, long double, long double); Arte

[clang] [CUDA][HIP] Exclude external variables from constant promotion. (PR #73549)

2023-12-05 Thread Artem Belevich via cfe-commits
@@ -104,3 +106,14 @@ void fun() { (void) b; (void) var_host_only; } + +extern __global__ void external_func(); +extern void* const external_dep[] = { + (void*)(external_func) +}; +extern void* const external_arr[] = {}; + +void* host_fun() { + (void) external_dep; + (voi

[clang] [CUDA][HIP] Exclude external variables from constant promotion. (PR #73549)

2023-12-05 Thread Artem Belevich via cfe-commits
@@ -104,3 +106,14 @@ void fun() { (void) b; (void) var_host_only; } + +extern __global__ void external_func(); +extern void* const external_dep[] = { Artem-B wrote: This array is nomiannly host-only entity and should not be emitted on GPU at all, IMO. In

[clang] [CUDA][Win32] Add `fma(long double,..)` to math forward declares. (PR #73756)

2023-12-05 Thread Artem Belevich via cfe-commits
@@ -70,6 +70,9 @@ __DEVICE__ double floor(double); __DEVICE__ float floor(float); __DEVICE__ double fma(double, double, double); __DEVICE__ float fma(float, float, float); +#ifdef _MSC_VER +__DEVICE__ long double fma(long double, long double, long double); Arte

[clang] [CUDA][Win32] Add `fma(long double,..)` to math forward declares. (PR #73756)

2023-12-06 Thread Artem Belevich via cfe-commits
@@ -70,6 +70,9 @@ __DEVICE__ double floor(double); __DEVICE__ float floor(float); __DEVICE__ double fma(double, double, double); __DEVICE__ float fma(float, float, float); +#ifdef _MSC_VER +__DEVICE__ long double fma(long double, long double, long double); Arte

[clang] [CUDA][Win32] Add `fma(long double,..)` to math forward declares. (PR #73756)

2023-12-07 Thread Artem Belevich via cfe-commits
Artem-B wrote: I'm not familiar enough with MSVC. @rnk -- what's the best way to check for compilation with microsoft's stardard C++ library? https://github.com/llvm/llvm-project/pull/73756 ___ cfe-commits mailing list cfe-commits@lists.llvm.org htt

[clang] [llvm] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
https://github.com/Artem-B created https://github.com/llvm/llvm-project/pull/74895 None >From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001 From: Artem Belevich Date: Wed, 6 Dec 2023 12:11:38 -0800 Subject: [PATCH] [CUDA] Add support for CUDA-12.3 and sm_90a --- clang/doc

[clang] [llvm] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
Artem-B wrote: @ezhulenev FYI. https://github.com/llvm/llvm-project/pull/74895 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[clang] [llvm] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
https://github.com/Artem-B edited https://github.com/llvm/llvm-project/pull/74895 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[llvm] [clang] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
@@ -80,8 +85,10 @@ class NVPTXSubtarget : public NVPTXGenSubtargetInfo { bool allowFP16Math() const; bool hasMaskOperator() const { return PTXVersion >= 71; } bool hasNoReturn() const { return SmVersion >= 30 && PTXVersion >= 64; } - unsigned int getSmVersion() const { r

[llvm] [clang] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
https://github.com/Artem-B edited https://github.com/llvm/llvm-project/pull/74895 ___ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[llvm] [clang] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
https://github.com/Artem-B updated https://github.com/llvm/llvm-project/pull/74895 >From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001 From: Artem Belevich Date: Wed, 6 Dec 2023 12:11:38 -0800 Subject: [PATCH 1/2] [CUDA] Add support for CUDA-12.3 and sm_90a --- clang/docs/

[llvm] [clang] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
@@ -80,8 +85,10 @@ class NVPTXSubtarget : public NVPTXGenSubtargetInfo { bool allowFP16Math() const; bool hasMaskOperator() const { return PTXVersion >= 71; } bool hasNoReturn() const { return SmVersion >= 30 && PTXVersion >= 64; } - unsigned int getSmVersion() const { r

[llvm] [clang] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
https://github.com/Artem-B updated https://github.com/llvm/llvm-project/pull/74895 >From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001 From: Artem Belevich Date: Wed, 6 Dec 2023 12:11:38 -0800 Subject: [PATCH 1/3] [CUDA] Add support for CUDA-12.3 and sm_90a --- clang/docs/

[clang] [llvm] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

2023-12-08 Thread Artem Belevich via cfe-commits
https://github.com/Artem-B updated https://github.com/llvm/llvm-project/pull/74895 >From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001 From: Artem Belevich Date: Wed, 6 Dec 2023 12:11:38 -0800 Subject: [PATCH 1/3] [CUDA] Add support for CUDA-12.3 and sm_90a --- clang/docs/

[compiler-rt] [clang] [clang-tools-extra] [flang] [llvm] [Legalizer] Expand fmaximum and fminimum (PR #67301)

2023-12-11 Thread Artem Belevich via cfe-commits
Artem-B wrote: What are the next steps needed to move this patch forward? It would be great if we could land it, as we already have real-world need for lowering fminimum/fmaximum on targets that do not have native support for it. https://github.com/llvm/llvm-project/pull/67301 ___

  1   2   3   4   5   6   7   8   9   10   >