[llvm-branch-commits] [llvm] release/19.x: [LICM] allow MemoryAccess creation failure (#116813) (PR #117082)

2024-11-21 Thread Nikita Popov via llvm-branch-commits
https://github.com/nikic approved this pull request. LGTM https://github.com/llvm/llvm-project/pull/117082 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] release/19.x: [LICM] allow MemoryAccess creation failure (#116813) (PR #117082)

2024-11-21 Thread via llvm-branch-commits
https://github.com/DianQK updated https://github.com/llvm/llvm-project/pull/117082 >From d7c9977e092ee48d8bee2a2787af0d23b75cfee5 Mon Sep 17 00:00:00 2001 From: DianQK Date: Wed, 20 Nov 2024 19:52:51 +0800 Subject: [PATCH] [LICM] allow MemoryAccess creation failure (#116813) Fixes #116809. Af

[llvm-branch-commits] [llvm] release/19.x: [LICM] allow MemoryAccess creation failure (#116813) (PR #117082)

2024-11-21 Thread via llvm-branch-commits
@@ -192,6 +192,12 @@ class MemorySSAUpdater { const BasicBlock *BB, MemorySSA::InsertionPlace Point); + MemoryAccess *createMemoryAccessInBB2(Instruction *I, +

[llvm-branch-commits] [llvm] release/19.x: [LICM] allow MemoryAccess creation failure (#116813) (PR #117082)

2024-11-21 Thread Nikita Popov via llvm-branch-commits
@@ -192,6 +192,12 @@ class MemorySSAUpdater { const BasicBlock *BB, MemorySSA::InsertionPlace Point); + MemoryAccess *createMemoryAccessInBB2(Instruction *I, +

[llvm-branch-commits] [flang] [MLIR][OpenMP] Add Lowering support for OpenMP Declare Mapper directive (PR #117046)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
@@ -21,7 +21,7 @@ subroutine declare_mapper_1 type (my_type2):: t real :: x, y(nvals) !$omp declare mapper (my_type :: var) map (var, var%values (1:var%num_vals)) -!CHECK: not yet implemented: OpenMPDeclareMapperConstruct +!CHECK: not yet impleme

[llvm-branch-commits] [flang] [MLIR][OpenMP] Add Lowering support for OpenMP Declare Mapper directive (PR #117046)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
@@ -2701,7 +2702,39 @@ static void genOMP(lower::AbstractConverter &converter, lower::SymMap &symTable, semantics::SemanticsContext &semaCtx, lower::pft::Evaluation &eval, const parser::OpenMPDeclareMapperConstruct &declareMapperConstruct) { - TODO(converter.getC

[llvm-branch-commits] [llvm] release/19.x: [SCEV] Fix sext handling for `getConstantMultiple` (#117093) (PR #117136)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-llvm-transforms Author: None (llvmbot) Changes Backport 458dfbd855806461b4508bf8845cafe0411dbfd4 Requested by: @dtcxzyw --- Full diff: https://github.com/llvm/llvm-project/pull/117136.diff 3 Files Affected: - (modified) llvm/lib/Analysis/ScalarEvolu

[llvm-branch-commits] [llvm] release/19.x: [ConstraintElim] Bail out on non-dedicated exits when adding exiting conditions (#116627) (PR #117137)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-llvm-transforms Author: None (llvmbot) Changes Backport 52361d0368b79841be12156bf03cf8c1851e5df7 Requested by: @antoniofrighetto --- Full diff: https://github.com/llvm/llvm-project/pull/117137.diff 2 Files Affected: - (modified) llvm/lib/Transforms/

[llvm-branch-commits] [llvm] release/19.x: [ConstraintElim] Bail out on non-dedicated exits when adding exiting conditions (#116627) (PR #117137)

2024-11-21 Thread via llvm-branch-commits
https://github.com/llvmbot milestoned https://github.com/llvm/llvm-project/pull/117137 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] release/19.x: [ConstraintElim] Bail out on non-dedicated exits when adding exiting conditions (#116627) (PR #117137)

2024-11-21 Thread via llvm-branch-commits
https://github.com/llvmbot created https://github.com/llvm/llvm-project/pull/117137 Backport 52361d0368b79841be12156bf03cf8c1851e5df7 Requested by: @antoniofrighetto >From 4e3f5191928641fdf7298ee21fdf09ab0f17a53e Mon Sep 17 00:00:00 2001 From: Yingwei Zheng Date: Mon, 18 Nov 2024 23:41:04 +08

[llvm-branch-commits] [llvm] release/19.x: [ConstraintElim] Bail out on non-dedicated exits when adding exiting conditions (#116627) (PR #117137)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @fhahn What do you think about merging this PR to the release branch? https://github.com/llvm/llvm-project/pull/117137 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinf

[llvm-branch-commits] [llvm] [LoongArch] Fix GOT usage for `non-dso_local` function calls in large code model (PR #117134)

2024-11-21 Thread via llvm-branch-commits
https://github.com/wangleiat milestoned https://github.com/llvm/llvm-project/pull/117134 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] release/19.x: [SCEV] Fix sext handling for `getConstantMultiple` (#117093) (PR #117136)

2024-11-21 Thread via llvm-branch-commits
https://github.com/llvmbot created https://github.com/llvm/llvm-project/pull/117136 Backport 458dfbd855806461b4508bf8845cafe0411dbfd4 Requested by: @dtcxzyw >From f6c67ad7a20fe7bb535242c78b8f06cacc48d521 Mon Sep 17 00:00:00 2001 From: Yingwei Zheng Date: Thu, 21 Nov 2024 17:23:04 +0800 Subjec

[llvm-branch-commits] [llvm] release/19.x: [SCEV] Fix sext handling for `getConstantMultiple` (#117093) (PR #117136)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @antoniofrighetto What do you think about merging this PR to the release branch? https://github.com/llvm/llvm-project/pull/117136 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mail

[llvm-branch-commits] [llvm] release/19.x: [SCEV] Fix sext handling for `getConstantMultiple` (#117093) (PR #117136)

2024-11-21 Thread via llvm-branch-commits
https://github.com/llvmbot milestoned https://github.com/llvm/llvm-project/pull/117136 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] release/19.x: [SCEV] Fix sext handling for `getConstantMultiple` (#117093) (PR #117136)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-llvm-analysis Author: None (llvmbot) Changes Backport 458dfbd855806461b4508bf8845cafe0411dbfd4 Requested by: @dtcxzyw --- Full diff: https://github.com/llvm/llvm-project/pull/117136.diff 3 Files Affected: - (modified) llvm/lib/Analysis/ScalarEvoluti

[llvm-branch-commits] [llvm] release/19.x: [SCEV] Fix sext handling for `getConstantMultiple` (#117093) (PR #117136)

2024-11-21 Thread Nikita Popov via llvm-branch-commits
https://github.com/nikic approved this pull request. https://github.com/llvm/llvm-project/pull/117136 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] release/19.x: [LICM] allow MemoryAccess creation failure (#116813) (PR #117082)

2024-11-21 Thread Nikita Popov via llvm-branch-commits
@@ -190,7 +190,8 @@ class MemorySSAUpdater { /// inaccessible and it *must* have removeMemoryAccess called on it. MemoryAccess *createMemoryAccessInBB(Instruction *I, MemoryAccess *Definition, const BasicBlock *BB, -

[llvm-branch-commits] [lld] [llvm] release/19.x: [MC][LoongArch] Change default cpu in `MCSubtargetInfo`. (#114922) (PR #117105)

2024-11-21 Thread via llvm-branch-commits
heiher wrote: > Some tests need to be fixed. > > ``` > Failed Tests (3): > LLVM :: CodeGen/LoongArch/e_flags.ll > lld :: ELF/emulation-loongarch.s > lld :: ELF/loongarch-interlink.test > ``` Fixed. https://github.com/llvm/llvm-project/pull/117105 _

[llvm-branch-commits] [llvm] [LoongArch] Fix GOT usage for `non-dso_local` function calls in large code model (PR #117134)

2024-11-21 Thread via llvm-branch-commits
https://github.com/wangleiat created https://github.com/llvm/llvm-project/pull/117134 This commit fixes an issue in the large code model where non-dso_local function calls did not use the GOT as expected in PIC mode. Instead, direct PC-relative access was incorrectly applied, leading to linker

[llvm-branch-commits] [llvm] [LoongArch] Fix GOT usage for `non-dso_local` function calls in large code model (PR #117134)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-backend-loongarch Author: wanglei (wangleiat) Changes This commit fixes an issue in the large code model where non-dso_local function calls did not use the GOT as expected in PIC mode. Instead, direct PC-relative access was incorrectly applied, leading

[llvm-branch-commits] [llvm] [LoongArch] Fix GOT usage for `non-dso_local` function calls in large code model (PR #117134)

2024-11-21 Thread via llvm-branch-commits
https://github.com/wangleiat edited https://github.com/llvm/llvm-project/pull/117134 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] release/19.x: [LICM] allow MemoryAccess creation failure (#116813) (PR #117082)

2024-11-21 Thread via llvm-branch-commits
https://github.com/DianQK updated https://github.com/llvm/llvm-project/pull/117082 >From e3364b6e56999488106d990b5f0f907823afa42c Mon Sep 17 00:00:00 2001 From: DianQK Date: Wed, 20 Nov 2024 19:52:51 +0800 Subject: [PATCH] [LICM] allow MemoryAccess creation failure (#116813) Fixes #116809. Af

[llvm-branch-commits] [llvm] release/19.x: [ConstraintElim] Bail out on non-dedicated exits when adding exiting conditions (#116627) (PR #117137)

2024-11-21 Thread Florian Hahn via llvm-branch-commits
https://github.com/fhahn approved this pull request. LGTM to cherry pick, thanks! https://github.com/llvm/llvm-project/pull/117137 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/l

[llvm-branch-commits] [clang-tools-extra] 2a4a50d - Revert "[NFC] Explicitly pass a VFS when creating DiagnosticsEngine (#115852)"

2024-11-21 Thread via llvm-branch-commits
Author: Sylvestre Ledru Date: 2024-11-21T07:04:23-05:00 New Revision: 2a4a50d85689bb2ac51258c485fceb64dfb6cd73 URL: https://github.com/llvm/llvm-project/commit/2a4a50d85689bb2ac51258c485fceb64dfb6cd73 DIFF: https://github.com/llvm/llvm-project/commit/2a4a50d85689bb2ac51258c485fceb64dfb6cd73.dif

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread via llvm-branch-commits
github-actions[bot] wrote: :warning: Python code formatter, darker found issues in your code. :warning: You can test this locally with the following command: ``bash darker --check --diff -r c12869e010d892caf93d153c187db846ba995a9e...84c95d6c816004abe6c01eb754688fb35a666ffc flang

[llvm-branch-commits] [llvm] release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) (PR #117154)

2024-11-21 Thread via llvm-branch-commits
https://github.com/llvmbot created https://github.com/llvm/llvm-project/pull/117154 Backport a9b3ec154d7ab2d0896ac5c9f1e9a1266a37be80 ef102b4a6333a304e36dc623d5381257a7ef1ed6 Requested by: @fhahn >From fccca51f3cdf8f918643b2afa0d410590e3acf95 Mon Sep 17 00:00:00 2001 From: Florian Hahn Date:

[llvm-branch-commits] [llvm] release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) (PR #117154)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @david-arm What do you think about merging this PR to the release branch? https://github.com/llvm/llvm-project/pull/117154 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/lis

[llvm-branch-commits] [llvm] release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) (PR #117154)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-backend-aarch64 Author: None (llvmbot) Changes Backport a9b3ec154d7ab2d0896ac5c9f1e9a1266a37be80 ef102b4a6333a304e36dc623d5381257a7ef1ed6 Requested by: @fhahn --- Full diff: https://github.com/llvm/llvm-project/pull/117154.diff 3 Files Affected: -

[llvm-branch-commits] [llvm] release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) (PR #117154)

2024-11-21 Thread via llvm-branch-commits
https://github.com/llvmbot milestoned https://github.com/llvm/llvm-project/pull/117154 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [CodeGen][NewPM] Port SpillPlacement analysis to NPM (PR #116618)

2024-11-21 Thread Akshat Oke via llvm-branch-commits
https://github.com/optimisan updated https://github.com/llvm/llvm-project/pull/116618 >From 6408bcec55deafbf767a417684c2bfe3dd251068 Mon Sep 17 00:00:00 2001 From: Akshat Oke Date: Mon, 18 Nov 2024 12:42:00 + Subject: [PATCH 1/3] [CodeGen][NewPM] Port SpillPlacement analysis to NPM --- ll

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
@@ -329,14 +341,92 @@ AliasResult AliasAnalysis::alias(Source lhsSrc, Source rhsSrc, mlir::Value lhs, // AliasAnalysis: getModRef //===--===// +static bool isSavedLocal(const fir::AliasAnalysis::Source &src)

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
https://github.com/tblah edited https://github.com/llvm/llvm-project/pull/117164 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
@@ -329,14 +341,92 @@ AliasResult AliasAnalysis::alias(Source lhsSrc, Source rhsSrc, mlir::Value lhs, // AliasAnalysis: getModRef //===--===// +static bool isSavedLocal(const fir::AliasAnalysis::Source &src)

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
@@ -329,14 +341,92 @@ AliasResult AliasAnalysis::alias(Source lhsSrc, Source rhsSrc, mlir::Value lhs, // AliasAnalysis: getModRef //===--===// +static bool isSavedLocal(const fir::AliasAnalysis::Source &src)

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
https://github.com/tblah approved this pull request. Looks great to me. I have reviewed that this does implement the language rules you mentioned in the description (which match my understanding). Please wait for Peter to check those before merging. https://github.com/llvm/llvm-project/pull/11

[llvm-branch-commits] [flang] [flang][OpenMP] Use new modifier code in ORDER and SCHEDULE clauses (PR #117081)

2024-11-21 Thread Krzysztof Parzyszek via llvm-branch-commits
https://github.com/kparzysz updated https://github.com/llvm/llvm-project/pull/117081 >From 43bdfcdb48328fcdfe762734bd5a4c1df3987c4b Mon Sep 17 00:00:00 2001 From: Krzysztof Parzyszek Date: Mon, 18 Nov 2024 13:01:30 -0600 Subject: [PATCH 1/2] [flang][OpenMP] Use new modifier code in ORDER and S

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Razvan Lupusoru via llvm-branch-commits
https://github.com/razvanlupusoru approved this pull request. Looks amazing! I agree with the various limitations and as far as I can tell - the non-implemented TODOs are not a correctness problem - just a limitation. Do you have plans to add support for Fortran runtime calls also? I think a s

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread via llvm-branch-commits
https://github.com/jeanPerier created https://github.com/llvm/llvm-project/pull/117164 fir.call side effects are hard to describe in a useful way using `MemoryEffectOpInterface` because it is impossible to list which memory location a user procedure read/write without doing a data flow analysi

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-flang-fir-hlfir Author: None (jeanPerier) Changes fir.call side effects are hard to describe in a useful way using `MemoryEffectOpInterface` because it is impossible to list which memory location a user procedure read/write without doing a data flow an

[llvm-branch-commits] [clang] [llvm] AMDGPU: Shrink used number of registers for mfma scale based on format (PR #117047)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
arsenm wrote: ### Merge activity * **Nov 21, 11:47 AM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/117047). https://github.com/llvm/llvm-project/pull/117047

[llvm-branch-commits] [llvm] AMDGPU: Optimize mfma_scale intrinsics with 0 inputs (PR #116724)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
arsenm wrote: ### Merge activity * **Nov 21, 11:47 AM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/116724). https://github.com/llvm/llvm-project/pull/116724

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Peter Klausler via llvm-branch-commits
https://github.com/klausler approved this pull request. https://github.com/llvm/llvm-project/pull/117164 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Peter Klausler via llvm-branch-commits
@@ -0,0 +1,135 @@ +! RUN: bbc -emit-hlfir %s -o - | %python %S/gen_mod_ref_test.py | \ +! RUN: fir-opt -pass-pipeline='builtin.module(func.func(test-fir-alias-analysis-modref))' \ +! RUN: --mlir-disable-threading -o /dev/null 2>&1 | FileCheck %s + +! Test fir.call modref with i

[llvm-branch-commits] [flang] [MLIR][OpenMP] Add Lowering support for OpenMP Declare Mapper directive (PR #117046)

2024-11-21 Thread Akash Banerjee via llvm-branch-commits
@@ -21,7 +21,7 @@ subroutine declare_mapper_1 type (my_type2):: t real :: x, y(nvals) !$omp declare mapper (my_type :: var) map (var, var%values (1:var%num_vals)) -!CHECK: not yet implemented: OpenMPDeclareMapperConstruct +!CHECK: not yet impleme

[llvm-branch-commits] [llvm] AMDGPU: Add a baseline, non-comprehensive test for scaled mfma hazards (PR #117055)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117055 >From a0485e65e1c41a3113b68b7c4c3456f7d9337f97 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Mon, 4 Mar 2024 17:36:33 +0530 Subject: [PATCH] AMDGPU: Add a baseline, non-comprehensive test for scaled mfma h

[llvm-branch-commits] [flang] [flang] handle fir.call in AliasAnalysis::getModRef (PR #117164)

2024-11-21 Thread Peter Klausler via llvm-branch-commits
@@ -0,0 +1,68 @@ +! RUN: bbc -emit-hlfir %s -o - | %python %S/gen_mod_ref_test.py | \ +! RUN: fir-opt -pass-pipeline='builtin.module(func.func(test-fir-alias-analysis-modref))' \ +! RUN: --mlir-disable-threading -o /dev/null 2>&1 | FileCheck %s + +! Test fir.call modref for glo

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_mfma_f32_16x16x32_bf16 for gfx950 (PR #117053)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117053 >From 84c3383558d5962f78086b64244997ca7a2b8c01 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Mon, 29 Jan 2024 18:16:52 +0530 Subject: [PATCH] AMDGPU: Add v_mfma_f32_16x16x32_bf16 for gfx950 --- .../CodeGe

[llvm-branch-commits] [llvm] AMDGPU: Add a baseline, non-comprehensive test for scaled mfma hazards (PR #117055)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117055 >From a5ed11b07ab7ac28d304db851abf01c6b1230c24 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Mon, 4 Mar 2024 17:36:33 +0530 Subject: [PATCH] AMDGPU: Add a baseline, non-comprehensive test for scaled mfma h

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117259 >From d36a1301eb84377617c35c125e136230327eb3e9 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:43:00 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117257 >From 73f8fed93b6fd985cf79d384fee64fc506ceb062 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:09:21 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [flang] [flang][OpenMP] Use new modifier code in ORDER and SCHEDULE clauses (PR #117081)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
@@ -153,6 +153,16 @@ std::optional maybeApply(FuncTy &&func, return std::move(func(*arg)); } +template < +typename FuncTy, // +typename ArgTy, // +typename ResultTy = std::invoke_result_t> +std::optional maybeApplyToV(FuncTy &&func, const ArgTy *arg) { + if (!a

[llvm-branch-commits] [flang] [flang][OpenMP] Use new modifier code in ORDER and SCHEDULE clauses (PR #117081)

2024-11-21 Thread Tom Eccles via llvm-branch-commits
https://github.com/tblah edited https://github.com/llvm/llvm-project/pull/117081 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Sirish Pande via llvm-branch-commits
https://github.com/srpande approved this pull request. lgrm https://github.com/llvm/llvm-project/pull/117214 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117211 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 (PR #117235)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
arsenm wrote: ### Merge activity * **Nov 21, 7:53 PM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/117235). https://github.com/llvm/llvm-project/pull/117235 _

[llvm-branch-commits] [llvm] 14b474b - Revert "[LV][VPlan] Remove any-of reduction from precomputeCost. NFC (#117109)"

2024-11-21 Thread via llvm-branch-commits
Author: Elvis Wang Date: 2024-11-22T11:32:12+08:00 New Revision: 14b474be36144527a55b5d49954379a3484c5f84 URL: https://github.com/llvm/llvm-project/commit/14b474be36144527a55b5d49954379a3484c5f84 DIFF: https://github.com/llvm/llvm-project/commit/14b474be36144527a55b5d49954379a3484c5f84.diff LO

[llvm-branch-commits] [llvm] AMDGPU: Handle v_mfma_f64_16x16x4_f64 srcc write VGPR hazard change for gfx950 (PR #117283)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes Read by sgemm/dgemm in srcc after v_mfma_f64_16x16x4_f64 increases from 9 to 17 wait states. --- Full diff: https://github.com/llvm/llvm-project/pull/117283.diff 2 Files Affected: - (modified) ll

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard (PR #117262)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117262 >From 06412577e65e05abf3edc1a884edc8640b924933 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Thu, 7 Mar 2024 15:01:08 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard

[llvm-branch-commits] [llvm] AMDGPU: Handle vcmpx+permalane gfx950 hazard (PR #117286)

2024-11-21 Thread via llvm-branch-commits
github-actions[bot] wrote: :warning: C/C++ code formatter, clang-format found issues in your code. :warning: You can test this locally with the following command: ``bash git-clang-format --diff 52f540df160ad84aef090acb35c9372c270d758b 0cbee40e03bff1514abbf1e879522a4808175c1a --e

[llvm-branch-commits] [llvm] AMDGPU: Refine gfx950 xdl-write-vgpr hazard cases (PR #117285)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes The 2-pass XDL write VGPR, read by non-XDL SGEMM/DGEMM case was 1 wait state overly conservative. Previously, for gfx940, the XDL/non-XDL cases happened to have the same number of cycles in all cases

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117257 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117257 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117260 >From 426d5baaf7d373a6d35ead2af4515e108a6eb8b8 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Mon, 22 Jan 2024 12:40:54 +0700 Subject: [PATCH] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 (PR #117257)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117257 >From 698095bb278b20ff853018b997a563a2387eeca6 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:09:21 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_bf8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 (PR #117259)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117259 >From d5b3bb6210d19c81a935790c5267c3d97125a00d Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:43:00 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x64_fp8_fp8 for gfx950 --- clang/

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 (PR #117258)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117258 >From 32ccf3950258693e8ca7be1c7ecc6670debc2bf7 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:25:33 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 --- cla

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait state change (PR #117263)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117263 >From 087117bc3dc327237d52746813e932d4c8f0b8bc Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Wed, 6 Mar 2024 19:51:00 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait stat

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 (PR #117233)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
arsenm wrote: ### Merge activity * **Nov 21, 7:53 PM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/117233). https://github.com/llvm/llvm-project/pull/117233 _

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_bf8 for gfx950 (PR #117234)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
arsenm wrote: ### Merge activity * **Nov 21, 7:53 PM EST**: A user started a stack merge that includes this pull request via [Graphite](https://app.graphite.dev/github/pr/llvm/llvm-project/117234). https://github.com/llvm/llvm-project/pull/117234 _

[llvm-branch-commits] [llvm] AMDGPU: Handle vcmpx+permalane gfx950 hazard (PR #117286)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu Author: Matt Arsenault (arsenm) Changes Confusingly, this is a different hazard to the one on gfx10 with a subtarget feature. --- Full diff: https://github.com/llvm/llvm-project/pull/117286.diff 3 Files Affected: - (modified) llvm/lib/

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait state change (PR #117263)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117263 >From 736d914241979efb46b506fb45cee79e73bbd20e Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Wed, 6 Mar 2024 19:51:00 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 XDL-write-overlapped-smfma-src-c wait stat

[llvm-branch-commits] [llvm] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard (PR #117262)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117262 >From fc9424bd9d0d54a931f4059ff9a6f657f1c5a2dd Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Thu, 7 Mar 2024 15:01:08 +0530 Subject: [PATCH] AMDGPU: Handle gfx950 change in mfma_f64_16x16x4 + valu hazard

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 (PR #117258)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117258 >From 24576df683abfa29c9d7f4406a318b6b67701732 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Sat, 3 Feb 2024 21:25:33 +0530 Subject: [PATCH] AMDGPU: Add v_smfmac_f32_32x32x32x64_fp8_bf8 for gfx950 --- cla

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm updated https://github.com/llvm/llvm-project/pull/117260 >From 549b571ea25a06301f719778786a288d85604464 Mon Sep 17 00:00:00 2001 From: Matt Arsenault Date: Mon, 22 Jan 2024 12:40:54 +0700 Subject: [PATCH] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for

[llvm-branch-commits] [llvm] [LoongArch] Fix GOT usage for `non-dso_local` function calls in large code model (PR #117134)

2024-11-21 Thread Lu Weining via llvm-branch-commits
https://github.com/SixWeining approved this pull request. https://github.com/llvm/llvm-project/pull/117134 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits
https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits
@@ -58,6 +58,19 @@ bool hasFastVectorUnalignedAccess(StringRef CPU) { return Info && Info->FastVectorUnalignedAccess; } +bool hasValidCPUModel(StringRef CPU) { + const CPUModel CPUModel = getCPUModel(CPU); + return CPUModel.MVendorID != 0 && CPUModel.MArchID != 0 && --

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits
https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits
https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [clang] [llvm] [RISCV] Support __builtin_cpu_is (PR #116231)

2024-11-21 Thread Pengcheng Wang via llvm-branch-commits
https://github.com/wangpc-pp updated https://github.com/llvm/llvm-project/pull/116231 >From 9686a2c5c5276289e72d9098f497a9f246a1c457 Mon Sep 17 00:00:00 2001 From: Wang Pengcheng Date: Thu, 14 Nov 2024 22:06:45 +0800 Subject: [PATCH 1/4] Remove stale CHECKs Created using spr 1.3.6-beta.1 ---

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_permlane16_swap_b32 and v_permlane32_swap_b32 for gfx950 (PR #117260)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm created https://github.com/llvm/llvm-project/pull/117260 This was a bit annoying because these introduce a new special case encoding usage. op_sel is repurposed as a subset of dpp controls, and is eligible for VOP3->VOP1 shrinking. For some reason fi also uses an enum v

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_f16 for gfx950 (PR #117202)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-mc Author: Matt Arsenault (arsenm) Changes --- Patch is 27.26 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117202.diff 13 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def (+1)

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_f16 for gfx950 (PR #117202)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-clang Author: Matt Arsenault (arsenm) Changes --- Patch is 27.26 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117202.diff 13 Files Affected: - (modified) clang/include/clang/Basic/BuiltinsAMDGPU.def (+

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_f16 for gfx950 (PR #117202)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
arsenm wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/117202?utm_source=stack-comment-downstack-mergeability-warning";

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_f16 for gfx950 (PR #117202)

2024-11-21 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-backend-amdgpu @llvm/pr-subscribers-llvm-analysis Author: Matt Arsenault (arsenm) Changes --- Patch is 27.26 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/117202.diff 13 Files Affected: - (modified) cla

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_f16 for gfx950 (PR #117202)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117202 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_f16 for gfx950 (PR #117205)

2024-11-21 Thread Matt Arsenault via llvm-branch-commits
https://github.com/arsenm ready_for_review https://github.com/llvm/llvm-project/pull/117205 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [HLSL] Add RWBuffer::Load(Index) (PR #117018)

2024-11-21 Thread Helena Kotas via llvm-branch-commits
https://github.com/hekota edited https://github.com/llvm/llvm-project/pull/117018 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_f16 for gfx950 (PR #117205)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117205 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117211 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x64_bf16 for gfx950 (PR #117211)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117211 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_32x32x32_bf16 for gfx950 (PR #117212)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117212 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_16x16x128_i8 for gfx950 (PR #117213)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117213 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_i32_32x32x64_i8 for gfx950 (PR #117214)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117214 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 (PR #117233)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117233 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_bf8 for gfx950 (PR #117232)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117232 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_bf8_fp8 for gfx950 (PR #117233)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117233 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 (PR #117235)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian approved this pull request. https://github.com/llvm/llvm-project/pull/117235 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [clang] [llvm] AMDGPU: Add v_smfmac_f32_16x16x128_fp8_fp8 for gfx950 (PR #117235)

2024-11-21 Thread Shilei Tian via llvm-branch-commits
https://github.com/shiltian edited https://github.com/llvm/llvm-project/pull/117235 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

  1   2   >