Author: tra
Date: Wed May 9 16:10:09 2018
New Revision: 331938
URL: http://llvm.org/viewvc/llvm-project?rev=331938&view=rev
Log:
[CUDA] Added -f[no-]cuda-short-ptr option
The option enables use of 32-bit pointers for accessing
const/local/shared memory. The feature is disabled by default.
Diffe
Author: tra
Date: Wed May 23 09:45:23 2018
New Revision: 333098
URL: http://llvm.org/viewvc/llvm-project?rev=333098&view=rev
Log:
[CUDA] Fixed the list of GPUs supported by CUDA-9.
Differential Revision: https://reviews.llvm.org/D47268
Modified:
cfe/trunk/lib/Driver/ToolChains/Cuda.cpp
Modi
Author: tra
Date: Mon Sep 25 10:55:26 2017
New Revision: 314129
URL: http://llvm.org/viewvc/llvm-project?rev=314129&view=rev
Log:
[CUDA] Fix names of __nvvm_vote* intrinsics.
Also fixed a syntax error in activemask().
Differential Revision: https://reviews.llvm.org/D38188
Modified:
cfe/trun
Author: tra
Date: Mon Sep 25 11:53:57 2017
New Revision: 314135
URL: http://llvm.org/viewvc/llvm-project?rev=314135&view=rev
Log:
[NVPTX] added match.{any,all}.sync instructions, intrinsics & builtins.
Differential Revision: https://reviews.llvm.org/D38191
Modified:
cfe/trunk/include/clang/B
Author: tra
Date: Tue Sep 26 10:07:23 2017
New Revision: 314223
URL: http://llvm.org/viewvc/llvm-project?rev=314223&view=rev
Log:
[NVPTX] added match.{any,all}.sync instructions, intrinsics & builtins.
Differential Revision: https://reviews.llvm.org/D38191
Modified:
cfe/trunk/include/clang/B
Author: tra
Date: Wed Sep 27 12:07:15 2017
New Revision: 314334
URL: http://llvm.org/viewvc/llvm-project?rev=314334&view=rev
Log:
[CUDA] Work around conflicting function definitions in CUDA-9 headers.
Differential Revision: https://reviews.llvm.org/D38326
Modified:
cfe/trunk/lib/Headers/__cl
Author: tra
Date: Thu Oct 12 14:32:19 2017
New Revision: 315624
URL: http://llvm.org/viewvc/llvm-project?rev=315624&view=rev
Log:
[CUDA] Added __hmma_m16n16k16_* builtins to support mma instructions on sm_70
Differential Revision: https://reviews.llvm.org/D38742
Added:
cfe/trunk/test/CodeGen
Author: tra
Date: Thu Aug 30 13:43:39 2018
New Revision: 341115
URL: http://llvm.org/viewvc/llvm-project?rev=341115&view=rev
Log:
Reverted the "[CUDA/OpenMP] Define only some host macros during device
compilation"
The changes were breaking CUDA compilation.
Reverted revisions:
r340681 D50845
Author: tra
Date: Thu Aug 30 13:53:15 2018
New Revision: 341118
URL: http://llvm.org/viewvc/llvm-project?rev=341118&view=rev
Log:
Revert the tests that should've been reverted in rL341115
Removed:
cfe/trunk/test/Preprocessor/aux-triple.c
Modified:
cfe/trunk/test/Preprocessor/predefined-ma
Author: tra
Date: Fri Jul 20 10:44:34 2018
New Revision: 337587
URL: http://llvm.org/viewvc/llvm-project?rev=337587&view=rev
Log:
[CUDA] Provide integer SIMD functions for CUDA-9.2
CUDA-9.2 made all integer SIMD functions into compiler builtins,
so clang no longer has access to the implementation
Author: tra
Date: Tue Feb 20 13:25:30 2018
New Revision: 325626
URL: http://llvm.org/viewvc/llvm-project?rev=325626&view=rev
Log:
[CUDA] Added missing __threadfence_system() function for CUDA9.
Modified:
cfe/trunk/lib/Headers/__clang_cuda_device_functions.h
Modified: cfe/trunk/lib/Headers/__
Author: tra
Date: Thu Feb 22 10:40:52 2018
New Revision: 325814
URL: http://llvm.org/viewvc/llvm-project?rev=325814&view=rev
Log:
[CUDA] Added missing functions.
Initial commit missed sincos(float), llabs() and few atomics that we
used to pull in from device_functions.hpp, which we no longer incl
Author: tra
Date: Thu Dec 21 15:52:09 2017
New Revision: 321326
URL: http://llvm.org/viewvc/llvm-project?rev=321326&view=rev
Log:
[CUDA] More fixes for __shfl_* intrinsics.
* __shfl_{up,down}* uses unsigned int for the third parameter.
* added [unsigned] long overloads for non-sync shuffles.
Dif
Author: tra
Date: Wed Jun 6 10:52:55 2018
New Revision: 334108
URL: http://llvm.org/viewvc/llvm-project?rev=334108&view=rev
Log:
[CUDA] Replace 'nv_weak' attributes in CUDA headers with 'weak'.
Differential Revision: https://reviews.llvm.org/D47804
Modified:
cfe/trunk/lib/Headers/__clang_cu
Author: tra
Date: Wed Jun 6 15:37:25 2018
New Revision: 334143
URL: http://llvm.org/viewvc/llvm-project?rev=334143&view=rev
Log:
[CUDA] Check initializers of instantiated template variables.
We were already performing checks on non-template variables,
but the checks on templated ones were missin
Author: tra
Date: Wed Jun 20 13:34:04 2018
New Revision: 335168
URL: http://llvm.org/viewvc/llvm-project?rev=335168&view=rev
Log:
[CUDA] Removed unused __nvvm_* builtins with non-generic pointers.
They were hot even hooked into CGBuiltin's machinery. Even if they were,
CUDA does not support AS-sp
Author: tra
Date: Fri Dec 7 14:20:53 2018
New Revision: 348662
URL: http://llvm.org/viewvc/llvm-project?rev=348662&view=rev
Log:
[CUDA] Added missing 'inline' for functions defined in a header.
Modified:
cfe/trunk/lib/Headers/cuda_wrappers/new
Modified: cfe/trunk/lib/Headers/cuda_wrappers/n
Author: tra
Date: Thu Apr 25 15:28:09 2019
New Revision: 359248
URL: http://llvm.org/viewvc/llvm-project?rev=359248&view=rev
Log:
[CUDA] Implemented _[bi]mma* builtins.
These builtins provide access to the new integer and
sub-integer variants of MMA (matrix multiply-accumulate) instructions
provi
Author: tra
Date: Thu May 2 15:37:19 2019
New Revision: 359838
URL: http://llvm.org/viewvc/llvm-project?rev=359838&view=rev
Log:
[CUDA] Do not pass deprecated option fo fatbinary
CUDA 10.1 tools deprecated some command line options.
fatbinary no longer needs --cuda.
Differential Revision: https
Author: tra
Date: Tue Sep 18 14:51:02 2018
New Revision: 342514
URL: http://llvm.org/viewvc/llvm-project?rev=342514&view=rev
Log:
[clang-tidy] Replace redundant checks with an assert().
findStyleKind is only called if D is an explicit identifier with a name,
so the checks for operators will never
Author: tra
Date: Fri Sep 21 10:29:33 2018
New Revision: 342749
URL: http://llvm.org/viewvc/llvm-project?rev=342749&view=rev
Log:
[CUDA] Ignore uncallable functions when we check for usual deallocators.
Previously clang considered function variants from both sides of
compilation and that resulted
Author: tra
Date: Fri Sep 21 10:46:28 2018
New Revision: 342752
URL: http://llvm.org/viewvc/llvm-project?rev=342752&view=rev
Log:
[CUDA] Fixed parsing of optional template-argument-list.
We need to consider all tokens that start with '>' when
we're checking for the end of an empty template argume
Author: tra
Date: Mon Sep 24 16:10:44 2018
New Revision: 342924
URL: http://llvm.org/viewvc/llvm-project?rev=342924&view=rev
Log:
[CUDA] Added basic support for compiling with CUDA-10.0
Modified:
cfe/trunk/include/clang/Basic/Cuda.h
cfe/trunk/lib/Basic/Cuda.cpp
cfe/trunk/lib/Basic/Tar
Author: tra
Date: Fri Oct 5 11:39:58 2018
New Revision: 343875
URL: http://llvm.org/viewvc/llvm-project?rev=343875&view=rev
Log:
[CUDA] Use all 64 bits of GUID in __nv_module_id
getGUID() returns an uint64_t and "%x" only prints 32 bits of it.
Use PRIx64 format string to print all 64 bits.
Diff
Author: tra
Date: Thu Jan 31 13:32:24 2019
New Revision: 352798
URL: http://llvm.org/viewvc/llvm-project?rev=352798&view=rev
Log:
[CUDA] Propagate detected version of CUDA to cc1
..and use it to control that parts of CUDA compilation
that depend on the specific version of CUDA SDK.
This patch ha
Author: tra
Date: Thu Jan 31 13:34:03 2019
New Revision: 352799
URL: http://llvm.org/viewvc/llvm-project?rev=352799&view=rev
Log:
[CUDA] add support for the new kernel launch API in CUDA-9.2+.
Instead of calling CUDA runtime to arrange function arguments,
the new API constructs arguments in a loc
Author: tra
Date: Tue Feb 5 14:38:58 2019
New Revision: 353232
URL: http://llvm.org/viewvc/llvm-project?rev=353232&view=rev
Log:
Basic CUDA-10 support.
Differential Revision: https://reviews.llvm.org/D57771
Modified:
cfe/trunk/include/clang/Basic/Cuda.h
cfe/trunk/lib/Basic/Cuda.cpp
Author: tra
Date: Thu Sep 7 11:14:32 2017
New Revision: 312734
URL: http://llvm.org/viewvc/llvm-project?rev=312734&view=rev
Log:
[CUDA] Added rudimentary support for CUDA-9 and sm_70.
For now CUDA-9 is not included in the list of CUDA versions clang
searches for, so the path to CUDA-9 must be ex
Author: tra
Date: Fri Sep 15 10:30:53 2017
New Revision: 313369
URL: http://llvm.org/viewvc/llvm-project?rev=313369&view=rev
Log:
[CUDA] Work around a new quirk in CUDA9 headers.
In CUDA-9 some of device-side math functions that we need are conditionally
defined within '#if _GLIBCXX_MATH_H'. We n
Author: tra
Date: Wed Sep 20 14:23:07 2017
New Revision: 313820
URL: http://llvm.org/viewvc/llvm-project?rev=313820&view=rev
Log:
[NVPTX] Implemented shfl.sync instruction and supporting intrinsics/builtins.
Differential Revision: https://reviews.llvm.org/D38090
Added:
cfe/trunk/test/CodeGen
Author: tra
Date: Thu Sep 21 11:44:49 2017
New Revision: 313898
URL: http://llvm.org/viewvc/llvm-project?rev=313898&view=rev
Log:
[NVPTX] Implemented bar.warp.sync, barrier.sync, and vote{.sync}
instructions/intrinsics/builtins.
Differential Revision: https://reviews.llvm.org/D38148
Modified:
Author: tra
Date: Thu Sep 21 11:46:39 2017
New Revision: 313899
URL: http://llvm.org/viewvc/llvm-project?rev=313899&view=rev
Log:
[CUDA] Fixed order of words in the names of shfl builtins.
Differential Revision: https://reviews.llvm.org/D38147
Modified:
cfe/trunk/lib/Headers/__clang_cuda_int
Author: tra
Date: Thu Dec 13 13:43:04 2018
New Revision: 349087
URL: http://llvm.org/viewvc/llvm-project?rev=349087&view=rev
Log:
[CUDA] Make all host-side shadows of device-side variables undef.
The host-side code can't (and should not) access the values that may
only exist on the device side. E
Author: tra
Date: Fri Dec 21 17:11:09 2018
New Revision: 349981
URL: http://llvm.org/viewvc/llvm-project?rev=349981&view=rev
Log:
[CUDA] Treat extern global variable shadows same as regular extern vars.
This fixes compiler crash when we attempted to compile this code:
extern __device__ int data;
Author: tra
Date: Wed Apr 18 11:33:43 2018
New Revision: 330280
URL: http://llvm.org/viewvc/llvm-project?rev=330280&view=rev
Log:
[CUDA] added missing __ldg(const signed char *)
Differential Revision: https://reviews.llvm.org/D45780
Modified:
cfe/trunk/lib/Headers/__clang_cuda_intrinsics.h
Author: tra
Date: Wed Apr 18 14:51:48 2018
New Revision: 330296
URL: http://llvm.org/viewvc/llvm-project?rev=330296&view=rev
Log:
[NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma
instructions.
The new instructions were added added for sm_70+ GPUs in CUDA-9.1.
Differential
Author: tra
Date: Tue Apr 24 11:23:19 2018
New Revision: 330753
URL: http://llvm.org/viewvc/llvm-project?rev=330753&view=rev
Log:
[CUDA] Enable CUDA compilation with CUDA-9.2
Differential Revision: https://reviews.llvm.org/D45827
Modified:
cfe/trunk/include/clang/Basic/Cuda.h
cfe/trunk/l
Author: tra
Date: Wed Jun 27 11:32:51 2018
New Revision: 335763
URL: http://llvm.org/viewvc/llvm-project?rev=335763&view=rev
Log:
[CUDA] Use atexit() to call module destructor.
This matches the way NVCC does it. Doing module cleanup at global
destructor phase used to work, but is, apparently, too
FYI,
This commit breaks clang tests. It appears that StaticAnalysisTests misses
dependency on clangFrontend.
--Artem
[60/134] Linking CXX executable
tools/clang/unittests/StaticAnalyzer/StaticAnalysisTests
FAILED: tools/clang/unittests/StaticAnalyzer/StaticAnalysisTests
: && /usr/local/google/ho
Author: tra
Date: Thu Jun 28 10:15:52 2018
New Revision: 335880
URL: http://llvm.org/viewvc/llvm-project?rev=335880&view=rev
Log:
[CUDA] Place all CUDA sections in __NV_CUDA segment on Mac.
That's where CUDA binaries appear to put them.
Differential Revision: https://reviews.llvm.org/D48615
Mod
uot;
> > $ "c:\src\llvm_package_64c23127\build32_stage0\bin\filecheck.exe"
> > "-check-prefix=GS-default"
> > "C:\src\llvm_package_64c23127\llvm-project\clang\test\Driver\
> cl-options.cu"
> > # command stderr:
> &
Author: Artem Belevich
Date: 2020-04-08T11:19:45-07:00
New Revision: d2e498b1725dd7b792c061387ff76df71cd7728a
URL:
https://github.com/llvm/llvm-project/commit/d2e498b1725dd7b792c061387ff76df71cd7728a
DIFF:
https://github.com/llvm/llvm-project/commit/d2e498b1725dd7b792c061387ff76df71cd7728a.diff
Author: Artem Belevich
Date: 2020-04-08T11:19:44-07:00
New Revision: a9627b7ea7e2b47488188cb1d80a23b24a8066b2
URL:
https://github.com/llvm/llvm-project/commit/a9627b7ea7e2b47488188cb1d80a23b24a8066b2
DIFF:
https://github.com/llvm/llvm-project/commit/a9627b7ea7e2b47488188cb1d80a23b24a8066b2.diff
Author: Artem Belevich
Date: 2020-04-08T11:19:43-07:00
New Revision: 33386b20aa82c73e28b871cfa35c89a3808c9f92
URL:
https://github.com/llvm/llvm-project/commit/33386b20aa82c73e28b871cfa35c89a3808c9f92
DIFF:
https://github.com/llvm/llvm-project/commit/33386b20aa82c73e28b871cfa35c89a3808c9f92.diff
Author: ADRA
Date: 2020-04-09T13:08:41-07:00
New Revision: 6ed88afd780cc2cd04e50e25c5d3ffafc07b1c1b
URL:
https://github.com/llvm/llvm-project/commit/6ed88afd780cc2cd04e50e25c5d3ffafc07b1c1b
DIFF:
https://github.com/llvm/llvm-project/commit/6ed88afd780cc2cd04e50e25c5d3ffafc07b1c1b.diff
LOG: [CU
Author: Artem Belevich
Date: 2020-04-13T15:54:12-07:00
New Revision: 8c635ba4a84f8b5dbde1984fe2d7eeefaf827ffe
URL:
https://github.com/llvm/llvm-project/commit/8c635ba4a84f8b5dbde1984fe2d7eeefaf827ffe
DIFF:
https://github.com/llvm/llvm-project/commit/8c635ba4a84f8b5dbde1984fe2d7eeefaf827ffe.diff
Author: Artem Belevich
Date: 2020-06-15T12:54:44-07:00
New Revision: d700237f1aa1bc05d584a0f50fdad89370e17987
URL:
https://github.com/llvm/llvm-project/commit/d700237f1aa1bc05d584a0f50fdad89370e17987
DIFF:
https://github.com/llvm/llvm-project/commit/d700237f1aa1bc05d584a0f50fdad89370e17987.diff
Author: Artem Belevich
Date: 2020-06-17T15:22:45-07:00
New Revision: ac20150e299a41ade860f432741c1b8557ac8058
URL:
https://github.com/llvm/llvm-project/commit/ac20150e299a41ade860f432741c1b8557ac8058
DIFF:
https://github.com/llvm/llvm-project/commit/ac20150e299a41ade860f432741c1b8557ac8058.diff
Author: Raul Tambre
Date: 2020-04-17T12:56:13-07:00
New Revision: 8e20516540444618ad32dd11e835c05804053697
URL:
https://github.com/llvm/llvm-project/commit/8e20516540444618ad32dd11e835c05804053697
DIFF:
https://github.com/llvm/llvm-project/commit/8e20516540444618ad32dd11e835c05804053697.diff
L
Author: Artem Belevich
Date: 2020-08-05T13:13:48-07:00
New Revision: 7d057efddc00ba7d03e6e684f23dd9b09fbd0527
URL:
https://github.com/llvm/llvm-project/commit/7d057efddc00ba7d03e6e684f23dd9b09fbd0527
DIFF:
https://github.com/llvm/llvm-project/commit/7d057efddc00ba7d03e6e684f23dd9b09fbd0527.diff
Author: Artem Belevich
Date: 2020-08-10T14:27:19-07:00
New Revision: 9c8ae40860311e94de0a898101818f706228e958
URL:
https://github.com/llvm/llvm-project/commit/9c8ae40860311e94de0a898101818f706228e958
DIFF:
https://github.com/llvm/llvm-project/commit/9c8ae40860311e94de0a898101818f706228e958.diff
Author: Artem Belevich
Date: 2020-08-11T11:52:40-07:00
New Revision: ec5f793996f4dc86d339db88c0836e0cf4e8abea
URL:
https://github.com/llvm/llvm-project/commit/ec5f793996f4dc86d339db88c0836e0cf4e8abea
DIFF:
https://github.com/llvm/llvm-project/commit/ec5f793996f4dc86d339db88c0836e0cf4e8abea.diff
Author: Artem Belevich
Date: 2020-05-05T14:07:31-07:00
New Revision: bf6a26b066382e0f41bf023c781d84061c542307
URL:
https://github.com/llvm/llvm-project/commit/bf6a26b066382e0f41bf023c781d84061c542307
DIFF:
https://github.com/llvm/llvm-project/commit/bf6a26b066382e0f41bf023c781d84061c542307.diff
Author: Artem Belevich
Date: 2020-05-05T15:43:32-07:00
New Revision: 844096b996a0b17d3f380af323614a5fe31b8a68
URL:
https://github.com/llvm/llvm-project/commit/844096b996a0b17d3f380af323614a5fe31b8a68
DIFF:
https://github.com/llvm/llvm-project/commit/844096b996a0b17d3f380af323614a5fe31b8a68.diff
Author: Artem Belevich
Date: 2020-05-06T14:24:21-07:00
New Revision: 314f99e7d42ded663386190a54b5831dc4a6f3c1
URL:
https://github.com/llvm/llvm-project/commit/314f99e7d42ded663386190a54b5831dc4a6f3c1
DIFF:
https://github.com/llvm/llvm-project/commit/314f99e7d42ded663386190a54b5831dc4a6f3c1.diff
Author: Artem Belevich
Date: 2020-05-18T12:22:55-07:00
New Revision: ef649e8fd5d1748764a9afca3ce0b80113a6a239
URL:
https://github.com/llvm/llvm-project/commit/ef649e8fd5d1748764a9afca3ce0b80113a6a239
DIFF:
https://github.com/llvm/llvm-project/commit/ef649e8fd5d1748764a9afca3ce0b80113a6a239.diff
tra created this revision.
tra added a reviewer: jlebar.
tra added a subscriber: cfe-commits.
Herald added subscribers: jlebar, jholewinski.
https://reviews.llvm.org/D24944
Files:
include/clang/Basic/BuiltinsNVPTX.def
lib/Basic/Targets.cpp
lib/CodeGen/CGBuiltin.cpp
test/CodeGen/builtins-n
tra created this revision.
tra added a reviewer: jlebar.
tra added a subscriber: cfe-commits.
Herald added a subscriber: jlebar.
https://reviews.llvm.org/D24946
Files:
lib/Driver/ToolChains.cpp
lib/Headers/__clang_cuda_runtime_wrapper.h
test/Driver/Inputs/CUDA/usr/local/cuda/nvvm/libdevice
tra added a comment.
LGTM. Should we add new pragma description to docs/LanguageExtensions.rst ?
https://reviews.llvm.org/D24975
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
tra accepted this revision.
tra added a comment.
This revision is now accepted and ready to land.
That is way too much knowledge about details of standard library implementation.
If it changes, I suspect users will end up with a rather uninformative error.
Is there a way to produce somewhat more s
tra added a subscriber: echristo.
tra added a comment.
This looks like fix-includes and it may be somewhat shaky if users start
messing with include paths. You may want to get @echristo's input on that. I'
personally would prefer to force-include these files. I suspect it will not
change things
tra added a comment.
OK.
https://reviews.llvm.org/D24977
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
tra marked an inline comment as done.
Comment at: lib/Headers/__clang_cuda_runtime_wrapper.h:156
@@ +155,3 @@
+#endif
+#endif
+
jlebar wrote:
> I don't understand what we are doing here...
>
> We're saying, if __USE_FAST_MATH__ is defined, and if it's not equal t
tra updated this revision to Diff 72707.
tra added a comment.
addressed Justin's comments.
https://reviews.llvm.org/D24946
Files:
lib/Driver/ToolChains.cpp
lib/Headers/__clang_cuda_runtime_wrapper.h
test/Driver/Inputs/CUDA/usr/local/cuda/nvvm/libdevice/libdevice.compute_20.10.bc
test/
Author: tra
Date: Wed Sep 28 12:47:35 2016
New Revision: 282609
URL: http://llvm.org/viewvc/llvm-project?rev=282609&view=rev
Log:
[CUDA] added __nvvm_atom_{sys|cta}_* builtins.
These builtins are available on sm_60+ GPU only.
Differential Revision: https://reviews.llvm.org/D24944
Modified:
Author: tra
Date: Wed Sep 28 12:47:40 2016
New Revision: 282610
URL: http://llvm.org/viewvc/llvm-project?rev=282610&view=rev
Log:
[CUDA] Added support for CUDA-8
Differential Revision: https://reviews.llvm.org/D24946
Added:
cfe/trunk/test/Driver/Inputs/CUDA/usr/local/cuda/nvvm/libdevice/lib
This revision was automatically updated to reflect the committed changes.
Closed by commit rL282609: [CUDA] added __nvvm_atom_{sys|cta}_* builtins.
(authored by tra).
Changed prior to commit:
https://reviews.llvm.org/D24944?vs=72584&id=72862#toc
Repository:
rL LLVM
https://reviews.llvm.org/
This revision was automatically updated to reflect the committed changes.
Closed by commit rL282610: [CUDA] Added support for CUDA-8 (authored by tra).
Changed prior to commit:
https://reviews.llvm.org/D24946?vs=72707&id=72863#toc
Repository:
rL LLVM
https://reviews.llvm.org/D24946
Files:
tra accepted this revision.
tra added a comment.
This revision is now accepted and ready to land.
One question, LGTM otherwise.
Comment at: clang/lib/Sema/SemaExprCXX.cpp:688
@@ +687,3 @@
+ if (getLangOpts().CUDA)
+CheckCUDAExceptionExpr(OpLoc, "throw");
+
tra accepted this revision.
tra added a reviewer: tra.
tra added a comment.
This revision is now accepted and ready to land.
LGTM.
https://reviews.llvm.org/D25125
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/
tra accepted this revision.
tra added a comment.
This revision is now accepted and ready to land.
LGTM.
> DiagnosticSemaKinds.td:6727
> def err_cuda_extern_shared : Error<"__shared__ variable %0 cannot be
> 'extern'">;
> +def err_cuda_nonglobal_constant : Error<"__constant__ variables must be
tra added inline comments.
> jlebar wrote in DiagnosticSemaKinds.td:6727
> That's still a "global variable"? Or do you think calling it such will be
> confusing?
It's not clear whether you mean global storage class or global namespace.
The code checks for global storage, but error message is c
Author: tra
Date: Tue Oct 11 21:08:08 2016
New Revision: 283964
URL: http://llvm.org/viewvc/llvm-project?rev=283964&view=rev
Log:
Added REQUIRED triples to the test that fails on some ARM buildbots.
Modified:
cfe/trunk/test/SemaCUDA/function-overload-hd.cu
Modified: cfe/trunk/test/SemaCUDA/f
Author: tra
Date: Tue Nov 28 10:51:42 2017
New Revision: 319201
URL: http://llvm.org/viewvc/llvm-project?rev=319201&view=rev
Log:
[CUDA] Report "unsupported VLA" errors only on device side.
This fixes erroneously reported CUDA compilation errors
in host-side code during device-side compilation.
Author: tra
Date: Thu Nov 30 14:22:21 2017
New Revision: 319485
URL: http://llvm.org/viewvc/llvm-project?rev=319485&view=rev
Log:
[CUDA] Tweak CUDA wrappers to make cuda-9 work with libc++
CUDA-9 headers check for specific libc++ version and ifdef out
some of the definitions we need if LIBCPP_VER
Author: tra
Date: Wed Dec 6 09:40:35 2017
New Revision: 319908
URL: http://llvm.org/viewvc/llvm-project?rev=319908&view=rev
Log:
[CUDA] Added overloads for '[unsigned] long' variants of shfl builtins.
Differential Revision: https://reviews.llvm.org/D40871
Modified:
cfe/trunk/lib/Headers/__c
Author: tra
Date: Wed Dec 6 09:50:05 2017
New Revision: 319909
URL: http://llvm.org/viewvc/llvm-project?rev=319909&view=rev
Log:
[NVPTX,CUDA] Added llvm.nvvm.fns intrinsic and matching __nvvm_fns builtin in
clang.
Differential Revision: https://reviews.llvm.org/D40872
Modified:
cfe/trunk/
Author: Artem Belevich
Date: 2022-09-01T15:52:36-07:00
New Revision: 54c47ff9398fbd5fa7e4120b3286adfb4f736ec8
URL:
https://github.com/llvm/llvm-project/commit/54c47ff9398fbd5fa7e4120b3286adfb4f736ec8
DIFF:
https://github.com/llvm/llvm-project/commit/54c47ff9398fbd5fa7e4120b3286adfb4f736ec8.diff
@@ -95,7 +95,7 @@ using namespace llvm;
static cl::opt
LowerCtorDtor("nvptx-lower-global-ctor-dtor",
cl::desc("Lower GPU ctor / dtors to globals on the device."),
- cl::init(false), cl::Hidden);
+ cl::init(true), cl::Hidde
@@ -95,7 +95,7 @@ using namespace llvm;
static cl::opt
LowerCtorDtor("nvptx-lower-global-ctor-dtor",
cl::desc("Lower GPU ctor / dtors to globals on the device."),
- cl::init(false), cl::Hidden);
+ cl::init(true), cl::Hidde
Artem-B wrote:
Now that we're making an even larger class of functions implicitly HD, the last
logical step would be to make *all* unattributed functions implicitly HD, too
(in a separate patch). After all, a template is as GPU-portable (or not) as a
regular function. Unlike constexpr or compi
https://github.com/Artem-B approved this pull request.
https://github.com/llvm/llvm-project/pull/70369
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
Artem-B wrote:
Would it be feasible to consider switching to the new offloading driver mode
and really link with the library instead? It may be a conveniently isolated use
case with little/no existing users that would disrupt.
https://github.com/llvm/llvm-project/pull/71978
__
Artem-B wrote:
> I don't think we're in a position to actually enable that at this time. We
> still don't have everything necessary to provide object linking, which this
> seems to rely on
OK. IR it is.
https://github.com/llvm/llvm-project/pull/71978
__
@@ -70,6 +70,9 @@ __DEVICE__ double floor(double);
__DEVICE__ float floor(float);
__DEVICE__ double fma(double, double, double);
__DEVICE__ float fma(float, float, float);
+#ifdef _MSC_VER
+__DEVICE__ long double fma(long double, long double, long double);
Arte
@@ -104,3 +106,14 @@ void fun() {
(void) b;
(void) var_host_only;
}
+
+extern __global__ void external_func();
+extern void* const external_dep[] = {
+ (void*)(external_func)
+};
+extern void* const external_arr[] = {};
+
+void* host_fun() {
+ (void) external_dep;
+ (voi
@@ -104,3 +106,14 @@ void fun() {
(void) b;
(void) var_host_only;
}
+
+extern __global__ void external_func();
+extern void* const external_dep[] = {
Artem-B wrote:
This array is nomiannly host-only entity and should not be emitted on GPU at
all, IMO. In
@@ -70,6 +70,9 @@ __DEVICE__ double floor(double);
__DEVICE__ float floor(float);
__DEVICE__ double fma(double, double, double);
__DEVICE__ float fma(float, float, float);
+#ifdef _MSC_VER
+__DEVICE__ long double fma(long double, long double, long double);
Arte
@@ -70,6 +70,9 @@ __DEVICE__ double floor(double);
__DEVICE__ float floor(float);
__DEVICE__ double fma(double, double, double);
__DEVICE__ float fma(float, float, float);
+#ifdef _MSC_VER
+__DEVICE__ long double fma(long double, long double, long double);
Arte
Artem-B wrote:
I'm not familiar enough with MSVC.
@rnk -- what's the best way to check for compilation with microsoft's stardard
C++ library?
https://github.com/llvm/llvm-project/pull/73756
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
htt
https://github.com/Artem-B created
https://github.com/llvm/llvm-project/pull/74895
None
>From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001
From: Artem Belevich
Date: Wed, 6 Dec 2023 12:11:38 -0800
Subject: [PATCH] [CUDA] Add support for CUDA-12.3 and sm_90a
---
clang/doc
Artem-B wrote:
@ezhulenev FYI.
https://github.com/llvm/llvm-project/pull/74895
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/Artem-B edited
https://github.com/llvm/llvm-project/pull/74895
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
@@ -80,8 +85,10 @@ class NVPTXSubtarget : public NVPTXGenSubtargetInfo {
bool allowFP16Math() const;
bool hasMaskOperator() const { return PTXVersion >= 71; }
bool hasNoReturn() const { return SmVersion >= 30 && PTXVersion >= 64; }
- unsigned int getSmVersion() const { r
https://github.com/Artem-B edited
https://github.com/llvm/llvm-project/pull/74895
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/Artem-B updated
https://github.com/llvm/llvm-project/pull/74895
>From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001
From: Artem Belevich
Date: Wed, 6 Dec 2023 12:11:38 -0800
Subject: [PATCH 1/2] [CUDA] Add support for CUDA-12.3 and sm_90a
---
clang/docs/
@@ -80,8 +85,10 @@ class NVPTXSubtarget : public NVPTXGenSubtargetInfo {
bool allowFP16Math() const;
bool hasMaskOperator() const { return PTXVersion >= 71; }
bool hasNoReturn() const { return SmVersion >= 30 && PTXVersion >= 64; }
- unsigned int getSmVersion() const { r
https://github.com/Artem-B updated
https://github.com/llvm/llvm-project/pull/74895
>From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001
From: Artem Belevich
Date: Wed, 6 Dec 2023 12:11:38 -0800
Subject: [PATCH 1/3] [CUDA] Add support for CUDA-12.3 and sm_90a
---
clang/docs/
https://github.com/Artem-B updated
https://github.com/llvm/llvm-project/pull/74895
>From 3ce8e08b94e33480139e13ca9f0fd7b719ff2c3d Mon Sep 17 00:00:00 2001
From: Artem Belevich
Date: Wed, 6 Dec 2023 12:11:38 -0800
Subject: [PATCH 1/3] [CUDA] Add support for CUDA-12.3 and sm_90a
---
clang/docs/
Artem-B wrote:
What are the next steps needed to move this patch forward?
It would be great if we could land it, as we already have real-world need for
lowering fminimum/fmaximum on targets that do not have native support for it.
https://github.com/llvm/llvm-project/pull/67301
___
1 - 100 of 1193 matches
Mail list logo