https://github.com/sergey-kozub created
https://github.com/llvm/llvm-project/pull/97402
Adds support for sm_100 (Blackwell), similar to
https://github.com/llvm/llvm-project/pull/74895
One important aspect is that `sm_100` is not compatible with `sm_90a`, only
with `sm_90` - note the defines i
sergey-kozub wrote:
@Artem-B, @andportnoy, you might want to take a look
https://github.com/llvm/llvm-project/pull/97402
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/sergey-kozub converted_to_draft
https://github.com/llvm/llvm-project/pull/97402
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/97402
>From 02e1acff6ffa1ddc3a26b0edc7e89923ac38978f Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Tue, 2 Jul 2024 02:44:56 -0700
Subject: [PATCH] [CUDA] Add support for CUDA-12.6 and sm_100
---
clang/docs/R
https://github.com/sergey-kozub closed
https://github.com/llvm/llvm-project/pull/97402
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
sergey-kozub wrote:
This PR is redundant, closing.
https://github.com/llvm/llvm-project/pull/97402
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/sergey-kozub created
https://github.com/llvm/llvm-project/pull/102969
PTX ISA 8.1 supports FP8 conversions:
https://docs.nvidia.com/cuda/parallel-thread-execution/#data-movement-and-conversion-instructions-cvt
This PR adds the support for:
- cvt.rn.satfinite{.relu}.f8x2type.
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/102969
>From 72b9a5ff64807bf4722a7168e1210f849bef7071 Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Mon, 12 Aug 2024 12:52:01 -0700
Subject: [PATCH] [NVPTX] Add conversion intrinsics from/to fp8 types (e4m3,
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/102969
>From a696f131f97370bd8b9ec264d27555c6ace4d027 Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Mon, 12 Aug 2024 12:52:01 -0700
Subject: [PATCH] [NVPTX] Add conversion intrinsics from/to fp8 types (e4m3,
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/102969
>From 7db2478f59c5a4f46df040ed4799da815b28bc43 Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Mon, 12 Aug 2024 12:52:01 -0700
Subject: [PATCH] [NVPTX] Add conversion intrinsics from/to fp8 types (e4m3,
@@ -722,6 +722,37 @@ let hasSideEffects = false in {
defm CVT_f16x2 : CVT_FROM_FLOAT_V2_SM80<"f16x2", Int32Regs>;
defm CVT_bf16x2 : CVT_FROM_FLOAT_V2_SM80<"bf16x2", Int32Regs>;
+
+ // FP8 conversions.
+ multiclass CVT_TO_F8X2 {
+def _f32 :
+ NVPTXInst<(outs Int1
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/102969
>From ee028cccb41964223b87edb1db88710bac89080a Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Mon, 12 Aug 2024 12:52:01 -0700
Subject: [PATCH] [NVPTX] Add conversion intrinsics from/to fp8 types (e4m3,
@@ -722,6 +722,37 @@ let hasSideEffects = false in {
defm CVT_f16x2 : CVT_FROM_FLOAT_V2_SM80<"f16x2", Int32Regs>;
defm CVT_bf16x2 : CVT_FROM_FLOAT_V2_SM80<"bf16x2", Int32Regs>;
+
+ // FP8 conversions.
+ multiclass CVT_TO_F8X2 {
+def _f32 :
+ NVPTXInst<(outs Int1
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/102969
>From e74a0de37302baaf89bfe3230f561684ec5777db Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Mon, 12 Aug 2024 12:52:01 -0700
Subject: [PATCH] [NVPTX] Add conversion intrinsics from/to fp8 types (e4m3,
@@ -722,6 +722,37 @@ let hasSideEffects = false in {
defm CVT_f16x2 : CVT_FROM_FLOAT_V2_SM80<"f16x2", Int32Regs>;
defm CVT_bf16x2 : CVT_FROM_FLOAT_V2_SM80<"bf16x2", Int32Regs>;
+
+ // FP8 conversions.
+ multiclass CVT_TO_F8X2 {
+def _f32 :
+ NVPTXInst<(outs Int1
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/102969
>From b74c8bc6009fb2f905089345594b13c8bc75ca36 Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Mon, 12 Aug 2024 12:52:01 -0700
Subject: [PATCH] [NVPTX] Add conversion intrinsics from/to fp8 types (e4m3,
sergey-kozub wrote:
What do I need to push this through? I'm still seeing "1 workflow awaiting
approval" message.
Should I add some other reviewer(s) who are authorized to run workflows?
https://github.com/llvm/llvm-project/pull/102969
___
cfe-commits
sergey-kozub wrote:
> Looks like that it was clang-format check github was waiting on an approaval
> for. I've just clicked that button.
Now it's all green, thank you.
How do I actually upstream it? I don't see any buttons on this PR page that'd
allow me to submit it.
https://github.com/llvm
sergey-kozub wrote:
Thanks for pushing this, I'll continue by adding the support to XLA to use
these instructions.
https://github.com/llvm/llvm-project/pull/102969
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/
https://github.com/sergey-kozub closed
https://github.com/llvm/llvm-project/pull/123398
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/sergey-kozub closed
https://github.com/llvm/llvm-project/pull/123898
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/123898
>From 1c4a581d45b622591f5062830f2ff1e33b159a64 Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Wed, 22 Jan 2025 07:38:55 +
Subject: [PATCH] Remove incorrect CUDA defines
---
clang/include/clang/Basi
@@ -89,12 +89,8 @@ CudaVersion getCudaVersion(uint32_t raw_version) {
return CudaVersion::CUDA_125;
if (raw_version < 12070)
return CudaVersion::CUDA_126;
- if (raw_version < 12080)
-return CudaVersion::CUDA_127;
if (raw_version < 12090)
return CudaVersio
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/123398
>From dc43fbfbd29c1a088b8261cc2bfc7f6f7e5c7c2f Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Fri, 17 Jan 2025 21:00:49 +
Subject: [PATCH] Add support for PTX 8.6 and CUDA 12.6 (12.8)
---
clang/inc
https://github.com/sergey-kozub edited
https://github.com/llvm/llvm-project/pull/123398
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/123398
>From 04313ede76d272ec391361b9828e55d8a27b4bda Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Fri, 17 Jan 2025 21:00:49 +
Subject: [PATCH] Add support for PTX 8.6 and CUDA 12.6 (12.8)
---
clang/inc
@@ -682,6 +688,9 @@ void NVPTX::getNVPTXTargetFeatures(const Driver &D, const
llvm::Triple &Triple,
case CudaVersion::CUDA_##CUDA_VER:
\
PtxFeature = "+ptx" #PTX_VER;
\
break;
+
https://github.com/sergey-kozub updated
https://github.com/llvm/llvm-project/pull/123398
>From 2f909002b89628e2bb83391b2287aa00a7ecaaf3 Mon Sep 17 00:00:00 2001
From: Sergey Kozub
Date: Fri, 17 Jan 2025 21:00:49 +
Subject: [PATCH] Add support for PTX 8.6 and CUDA 12.6 (12.8)
---
clang/inc
https://github.com/sergey-kozub edited
https://github.com/llvm/llvm-project/pull/123398
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
https://github.com/sergey-kozub created
https://github.com/llvm/llvm-project/pull/123398
CUDA 12.8 supports PTX 8.6 which enables architecture "sm100a" (supports
Blackwell-specific instructions).
CUDA 12.7 technically does not exist, map it to PTX 8.5 (same as 12.6).
>From 92e4b10e940e9c08606
sergey-kozub wrote:
It's not clear from the logs why the "builkite" checks have failed. Running
"check-all" target locally suceeds.
Please advise.
https://github.com/llvm/llvm-project/pull/123398
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
sergey-kozub wrote:
This is now resolved by fixing the NVPTX test.
https://github.com/llvm/llvm-project/pull/123398
___
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
32 matches
Mail list logo