[llvm-branch-commits] [llvm] [AMDGPU] Use reverse iteration in CodeGenPrepare (PR #145484)

2025-06-24 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh ready_for_review https://github.com/llvm/llvm-project/pull/145484 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [AMDGPU] Use reverse iteration in CodeGenPrepare (PR #145484)

2025-06-24 Thread Pierre van Houtryve via llvm-branch-commits
@@ -1634,29 +1634,18 @@ define float @v_recip_sqrt_f32_ulp25_contract(float %x) { ; IR-IEEE-SDAG-LABEL: v_recip_sqrt_f32_ulp25_contract: ; IR-IEEE-SDAG: ; %bb.0: ; IR-IEEE-SDAG-NEXT:s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) -; IR-IEEE-SDAG-NEXT:s_mov_b32 s4, 0xf800

[llvm-branch-commits] [llvm] [AMDGPU] Add KnownBits simplification combines to RegBankCombiner (PR #141591)

2025-06-26 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141591 >From 642055d06b93df98e4f57c226d170f236014077c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 12:29:02 +0200 Subject: [PATCH 1/2] [AMDGPU] Add KnownBits simplification combines to RegBankCombi

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-06-26 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141589 >From 6773e3b993fa25fa1087eae3db26d0244be6309b Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 11:16:16 +0200 Subject: [PATCH 1/2] [AMDGPU] Move S_BFE lowering into RegBankCombiner --- llvm/li

[llvm-branch-commits] [llvm] [AMDGPU] Add KnownBits simplification combines to RegBankCombiner (PR #141591)

2025-06-26 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141591 >From 642055d06b93df98e4f57c226d170f236014077c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 12:29:02 +0200 Subject: [PATCH 1/2] [AMDGPU] Add KnownBits simplification combines to RegBankCombi

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-06-26 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141589 >From 6773e3b993fa25fa1087eae3db26d0244be6309b Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 11:16:16 +0200 Subject: [PATCH 1/2] [AMDGPU] Move S_BFE lowering into RegBankCombiner --- llvm/li

[llvm-branch-commits] [llvm] [AMDGPU] Use reverse iteration in CodeGenPrepare (PR #145484)

2025-06-26 Thread Pierre van Houtryve via llvm-branch-commits
@@ -109,6 +110,7 @@ class AMDGPUCodeGenPrepareImpl bool FlowChanged = false; mutable Function *SqrtF32 = nullptr; mutable Function *LdexpF32 = nullptr; + mutable SetVector DeadVals; Pierre-vh wrote: I didn't use a set because I thought we could visit tw

[llvm-branch-commits] [llvm] [AMDGPU] New RegBankSelect: Add rules for `G_PTRTOINT` and `G_INTTOPTR` (PR #142604)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142604 >From 9fd34f632f194a025669b2c2c0f83d19fb48b00c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 15:08:06 +0200 Subject: [PATCH 1/3] [AMDGPU] New RegBankSelect: Add rules for `G_PTRTOINT` and `G_I

[llvm-branch-commits] [llvm] [AMDGPU] New RegBanKSelect: Add S128 types (PR #142601)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142601 >From 96669eee5e756faed679480521faafd9f1bad9d1 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 13:27:55 +0200 Subject: [PATCH] [AMDGPU] New RegBanKSelect: Add S128 types --- llvm/lib/Target/AMD

[llvm-branch-commits] [llvm] [AMDGPU] Improve test coverage for G_INTTOPTR and G_PTRTOINT (PR #142603)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142603 >From 3a47927dfaaa98cb0d2a336bfa416d2eb28e294d Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 15:03:01 +0200 Subject: [PATCH] [AMDGPU] Improve test coverage for G_INTTOPTR and G_PTRTOINT Test P

[llvm-branch-commits] [llvm] [AMDGPU] Improve test coverage for G_INTTOPTR and G_PTRTOINT (PR #142603)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142603 >From 3a47927dfaaa98cb0d2a336bfa416d2eb28e294d Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 15:03:01 +0200 Subject: [PATCH] [AMDGPU] Improve test coverage for G_INTTOPTR and G_PTRTOINT Test P

[llvm-branch-commits] [llvm] [AMDGPU] New RegBankSelect: Add Ptr32/Ptr64/Ptr128 (PR #142602)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142602 >From c69258d78459b8dcc89bec38a8a795763cd3dc80 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 14:40:38 +0200 Subject: [PATCH] [AMDGPU] New RegBankSelect: Add Ptr32/Ptr64/Ptr128 There's quite a

[llvm-branch-commits] [llvm] [AMDGPU] New RegBankSelect: Add Ptr32/Ptr64/Ptr128 (PR #142602)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142602 >From c69258d78459b8dcc89bec38a8a795763cd3dc80 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 14:40:38 +0200 Subject: [PATCH] [AMDGPU] New RegBankSelect: Add Ptr32/Ptr64/Ptr128 There's quite a

[llvm-branch-commits] [llvm] [AMDGPU] New RegBanKSelect: Add S128 types (PR #142601)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh edited https://github.com/llvm/llvm-project/pull/142601 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [AMDGPU] New RegBankSelect: Add missing S/VGPR pointer types (PR #142600)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh closed https://github.com/llvm/llvm-project/pull/142600 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [AMDGPU] New RegBanKSelect: Add S128 types (PR #142601)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142601 >From 96669eee5e756faed679480521faafd9f1bad9d1 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 13:27:55 +0200 Subject: [PATCH] [AMDGPU] New RegBanKSelect: Add S128 types --- llvm/lib/Target/AMD

[llvm-branch-commits] [llvm] [AMDGPU] New RegBankSelect: Add rules for `G_PTRTOINT` and `G_INTTOPTR` (PR #142604)

2025-06-04 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/142604 >From 9fd34f632f194a025669b2c2c0f83d19fb48b00c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 3 Jun 2025 15:08:06 +0200 Subject: [PATCH 1/3] [AMDGPU] New RegBankSelect: Add rules for `G_PTRTOINT` and `G_I

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #142789)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -137,7 +138,123 @@ class AMDGPURegBankLegalizeCombiner { return {MatchMI, MatchMI->getOperand(1).getReg()}; } + std::tuple tryMatchRALFromUnmerge(Register Src) { +auto *ReadAnyLane = MRI.getVRegDef(Src); Pierre-vh wrote: ```suggestion Machin

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #142789)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -137,7 +138,123 @@ class AMDGPURegBankLegalizeCombiner { return {MatchMI, MatchMI->getOperand(1).getReg()}; } + std::tuple tryMatchRALFromUnmerge(Register Src) { Pierre-vh wrote: ```suggestion std::pair tryMatchRALFromUnmerge(Register Src) { ```

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #142789)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -137,7 +138,123 @@ class AMDGPURegBankLegalizeCombiner { return {MatchMI, MatchMI->getOperand(1).getReg()}; } + std::tuple tryMatchRALFromUnmerge(Register Src) { +auto *ReadAnyLane = MRI.getVRegDef(Src); +if (ReadAnyLane->getOpcode() == AMDGPU::G_AMDGPU_READA

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #142789)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -137,7 +138,123 @@ class AMDGPURegBankLegalizeCombiner { return {MatchMI, MatchMI->getOperand(1).getReg()}; } + std::tuple tryMatchRALFromUnmerge(Register Src) { +auto *ReadAnyLane = MRI.getVRegDef(Src); +if (ReadAnyLane->getOpcode() == AMDGPU::G_AMDGPU_READA

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -117,45 +117,73 @@ static LLT getReadAnyLaneSplitTy(LLT Ty) { return LLT::scalar(32); } -static Register buildReadAnyLane(MachineIRBuilder &B, Register VgprSrc, - const RegisterBankInfo &RBI); +typedef std::functionhttps://github.com/llvm/l

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -117,45 +117,73 @@ static LLT getReadAnyLaneSplitTy(LLT Ty) { return LLT::scalar(32); } -static Register buildReadAnyLane(MachineIRBuilder &B, Register VgprSrc, - const RegisterBankInfo &RBI); +typedef std::function +ReadLaneFnTy; + +st

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -57,6 +57,226 @@ void RegBankLegalizeHelper::findRuleAndApplyMapping(MachineInstr &MI) { lower(MI, Mapping, WaterfallSgprs); } +bool RegBankLegalizeHelper::executeInWaterfallLoop( +MachineIRBuilder &B, iterator_range Range, +SmallSet &SGPROperandRegs) { + // Tra

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -1,6 +1,6 @@ ; NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py -; RUN: llc -global-isel -mtriple=amdgcn-mesa-mesa3d -stop-after=regbankselect -regbankselect-fast -o - %s | FileCheck %s -; RUN: llc -global-isel -mtriple=amdgcn-mesa-mesa3d -stop-afte

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -165,6 +165,8 @@ enum RegBankLLTMappingApplyID { Sgpr32Trunc, // Src only modifiers: waterfalls, extends + Sgpr32_W, + SgprV4S32_W, Pierre-vh wrote: Can you add a trailing comment or rename this ? The `_W` suffix is not immediately clear to me http

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -57,6 +57,226 @@ void RegBankLegalizeHelper::findRuleAndApplyMapping(MachineInstr &MI) { lower(MI, Mapping, WaterfallSgprs); } +bool RegBankLegalizeHelper::executeInWaterfallLoop( +MachineIRBuilder &B, iterator_range Range, +SmallSet &SGPROperandRegs) { + // Tra

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -894,6 +1121,15 @@ void RegBankLegalizeHelper::applyMappingSrc( } break; } +// sgpr waterfall, scalars and vectors +case Sgpr32_W: +case SgprV4S32_W: { + assert(Ty == getTyFromID(MethodIDs[i])); + if (RB != SgprRB) { +SgprWaterfa

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #142790)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -57,6 +57,226 @@ void RegBankLegalizeHelper::findRuleAndApplyMapping(MachineInstr &MI) { lower(MI, Mapping, WaterfallSgprs); } +bool RegBankLegalizeHelper::executeInWaterfallLoop( +MachineIRBuilder &B, iterator_range Range, +SmallSet &SGPROperandRegs) { + // Tra

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #142789)

2025-06-05 Thread Pierre van Houtryve via llvm-branch-commits
@@ -137,7 +138,123 @@ class AMDGPURegBankLegalizeCombiner { return {MatchMI, MatchMI->getOperand(1).getReg()}; } + std::tuple tryMatchRALFromUnmerge(Register Src) { +auto *ReadAnyLane = MRI.getVRegDef(Src); +if (ReadAnyLane->getOpcode() == AMDGPU::G_AMDGPU_READA

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From c7f21bb87a5ec7d8931132c0812daa9d2d70e282 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146053 >From 3d911c7154df77a7f56240498c55a82b863a0581 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:08:31 +0200 Subject: [PATCH 1/2] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizati

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146054 >From e90cd3e7ee0c3626cb24ba31cfa19a1b01fdef55 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:31:37 +0200 Subject: [PATCH 1/3] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences Fold

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146053 >From 3d911c7154df77a7f56240498c55a82b863a0581 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:08:31 +0200 Subject: [PATCH 1/2] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizati

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146054 >From e90cd3e7ee0c3626cb24ba31cfa19a1b01fdef55 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:31:37 +0200 Subject: [PATCH 1/3] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences Fold

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From c7f21bb87a5ec7d8931132c0812daa9d2d70e282 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #145912)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
@@ -57,6 +57,224 @@ void RegBankLegalizeHelper::findRuleAndApplyMapping(MachineInstr &MI) { lower(MI, Mapping, WaterfallSgprs); } +bool RegBankLegalizeHelper::executeInWaterfallLoop( +MachineIRBuilder &B, iterator_range Range, +SmallSet &SGPROperandRegs) { + // Tra

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Add waterfall lowering in regbanklegalize (PR #145912)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
@@ -188,7 +188,11 @@ enum RegBankLLTMappingApplyID { Sgpr32Trunc, - // Src only modifiers: waterfalls, extends + // Src only modifiers: execute in waterfall loop if divergent + Sgpr32_W, Pierre-vh wrote: nit: use a suffix like `_WF` for WaterFall? I kee

[llvm-branch-commits] [llvm] [AMDGPU] always emit a soft wait even if it is trivially ~0 (PR #147257)

2025-07-07 Thread Pierre van Houtryve via llvm-branch-commits
@@ -669,6 +679,7 @@ define amdgpu_kernel void @global_volatile_store_1( ; GFX12-WGP-NEXT:s_wait_kmcnt 0x0 ; GFX12-WGP-NEXT:s_wait_storecnt 0x0 ; GFX12-WGP-NEXT:global_store_b32 v0, v1, s[0:1] scope:SCOPE_SYS +; GFX12-WGP-NEXT:s_wait_loadcnt 0x3f

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-07-07 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141589 >From d906a978145aabae8b2d1a029477d5a08272ae8c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 11:16:16 +0200 Subject: [PATCH 1/4] [AMDGPU] Move S_BFE lowering into RegBankCombiner --- llvm/li

[llvm-branch-commits] [llvm] [AMDGPU] Add KnownBits simplification combines to RegBankCombiner (PR #141591)

2025-07-07 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141591 >From b386d126b9f560bf203fd044d81575ddfad2a8c6 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 12:29:02 +0200 Subject: [PATCH 1/2] [AMDGPU] Add KnownBits simplification combines to RegBankCombi

[llvm-branch-commits] [llvm] [AMDGPU] Add KnownBits simplification combines to RegBankCombiner (PR #141591)

2025-07-07 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141591 >From b386d126b9f560bf203fd044d81575ddfad2a8c6 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 12:29:02 +0200 Subject: [PATCH 1/2] [AMDGPU] Add KnownBits simplification combines to RegBankCombi

[llvm-branch-commits] [llvm] [AMDGPU] always emit a soft wait even if it is trivially ~0 (PR #147257)

2025-07-07 Thread Pierre van Houtryve via llvm-branch-commits
@@ -669,6 +679,7 @@ define amdgpu_kernel void @global_volatile_store_1( ; GFX12-WGP-NEXT:s_wait_kmcnt 0x0 ; GFX12-WGP-NEXT:s_wait_storecnt 0x0 ; GFX12-WGP-NEXT:global_store_b32 v0, v1, s[0:1] scope:SCOPE_SYS +; GFX12-WGP-NEXT:s_wait_loadcnt 0x3f

[llvm-branch-commits] [llvm] [AMDGPU] always emit a soft wait even if it is trivially ~0 (PR #147257)

2025-07-07 Thread Pierre van Houtryve via llvm-branch-commits
@@ -669,6 +679,7 @@ define amdgpu_kernel void @global_volatile_store_1( ; GFX12-WGP-NEXT:s_wait_kmcnt 0x0 ; GFX12-WGP-NEXT:s_wait_storecnt 0x0 ; GFX12-WGP-NEXT:global_store_b32 v0, v1, s[0:1] scope:SCOPE_SYS +; GFX12-WGP-NEXT:s_wait_loadcnt 0x3f

[llvm-branch-commits] [llvm] [AMDGPU][SDAG] Test ISD::PTRADD handling in various special cases (PR #145329)

2025-07-14 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh approved this pull request. https://github.com/llvm/llvm-project/pull/145329 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-07-14 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: ping https://github.com/llvm/llvm-project/pull/141589 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [AMDGPU][SDAG] Handle ISD::PTRADD in VOP3 patterns (PR #143881)

2025-07-14 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh approved this pull request. https://github.com/llvm/llvm-project/pull/143881 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [AMDGPU][SDAG] Enable ISD::PTRADD for 64-bit AS by default (PR #146076)

2025-07-14 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh approved this pull request. Test changes LGTM so I'm already giving this +1, though once the stack is approved wait a day or two to see if any other reviewer has concerns about default enabling this https://github.com/llvm/llvm-project/pull/146076 _

[llvm-branch-commits] [llvm] [AMDGPU] Use reverse iteration in CodeGenPrepare (PR #145484)

2025-06-25 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh created https://github.com/llvm/llvm-project/pull/145484 In order to make this easier, I also removed all "removeFromParent" calls from the visitors, instead adding instructions to a set of instructions to delete once the function has been visited. This avoids crash

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146053 >From f137136b2f527aaf1b2f2847e821085aabfc299e Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:08:31 +0200 Subject: [PATCH 1/2] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizati

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From d97992ef24abae69878fd1e49270bf0f7372ca39 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146053 >From f137136b2f527aaf1b2f2847e821085aabfc299e Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:08:31 +0200 Subject: [PATCH 1/2] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizati

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146054 >From 26615132899d40b8d245fd98d093ef8c26cdc3e1 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:31:37 +0200 Subject: [PATCH 1/2] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences Fold

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From d97992ef24abae69878fd1e49270bf0f7372ca39 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146054 >From 26615132899d40b8d245fd98d093ef8c26cdc3e1 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:31:37 +0200 Subject: [PATCH 1/2] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences Fold

[llvm-branch-commits] [llvm] [AMDGPU] Use reverse iteration in CodeGenPrepare (PR #145484)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/145484 >From b031681978e2b356c2ae8e65d6e08515c0044ac1 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 24 Jun 2025 11:35:58 +0200 Subject: [PATCH 1/2] [AMDGPU] Use reverse iteration in CodeGenPrepare In order to m

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: > Why DAG and not InstCombine for this? The intrinsics we want to optimize with this aren't lowered yet at IC https://github.com/llvm/llvm-project/pull/146054 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org htt

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146054 >From 17ac90ad1ee167f35321e01625a207f2b94ff523 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:31:37 +0200 Subject: [PATCH 1/2] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences Fold

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From c8532366fcb92ff3b3e7402118224f137e2c8980 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
@@ -392,6 +394,55 @@ void AMDGPURegBankCombinerImpl::applyCanonicalizeZextShiftAmt( MI.eraseFromParent(); } +bool AMDGPURegBankCombinerImpl::lowerUniformBFX(MachineInstr &MI) const { + assert(MI.getOpcode() == TargetOpcode::G_UBFX || + MI.getOpcode() == TargetOpcod

[llvm-branch-commits] [llvm] [NFC][AMDGPU] Refactor handling of `amdgpu-synchronize-as` MD on fences (PR #148630)

2025-07-14 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh created https://github.com/llvm/llvm-project/pull/148630 Directly plug it into the MMO instead, which is much cleaner. >From d8aad39519972bf64d868e9f54ac45862b92686a Mon Sep 17 00:00:00 2001 From: pvanhout Date: Mon, 14 Jul 2025 15:02:31 +0200 Subject: [PATCH] [NFC

[llvm-branch-commits] [llvm] [NFC][AMDGPU] Refactor handling of `amdgpu-synchronize-as` MD on fences (PR #148630)

2025-07-14 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/148630?utm_source=stack-comment-downstack-mergeability-warning

[llvm-branch-commits] [llvm] [NFC][AMDGPU] Refactor handling of `amdgpu-synchronize-as` MD on fences (PR #148630)

2025-07-14 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh ready_for_review https://github.com/llvm/llvm-project/pull/148630 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #145911)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
@@ -115,126 +117,233 @@ class AMDGPURegBankLegalizeCombiner { VgprRB(&RBI.getRegBank(AMDGPU::VGPRRegBankID)), VccRB(&RBI.getRegBank(AMDGPU::VCCRegBankID)) {}; - bool isLaneMask(Register Reg) { -const RegisterBank *RB = MRI.getRegBankOrNull(Reg); -if (R

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #145911)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
@@ -115,126 +117,233 @@ class AMDGPURegBankLegalizeCombiner { VgprRB(&RBI.getRegBank(AMDGPU::VGPRRegBankID)), VccRB(&RBI.getRegBank(AMDGPU::VCCRegBankID)) {}; - bool isLaneMask(Register Reg) { -const RegisterBank *RB = MRI.getRegBankOrNull(Reg); -if (R

[llvm-branch-commits] [llvm] [AMDGPU] Add KnownBits simplification combines to RegBankCombiner (PR #141591)

2025-07-16 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141591 >From 8e3be3121fdc7f06331f779224ce6f013cba8929 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 12:29:02 +0200 Subject: [PATCH 1/2] [AMDGPU] Add KnownBits simplification combines to RegBankCombi

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh created https://github.com/llvm/llvm-project/pull/146053 None >From 3f62ab3beb30abbf8c8c32dd79c0133f7ca122e0 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:08:31 +0200 Subject: [PATCH] [AMDGPU] Add tests for workgroup/workitem intrinsic optimiza

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh created https://github.com/llvm/llvm-project/pull/146055 Equivalent of the previous DAG patch for GISel. The shifts are BFXs in GISel, so the canonical form of the entire expression is different than in the DAG. The mask is not at the root of the expression, it remai

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh created https://github.com/llvm/llvm-project/pull/146054 Fold sequences where we extract a bunch of contiguous bits from a value, merge them into the low bit and then check if the low bits are zero or not. It seems like a strange sequence at first but it's an idiom

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/146053?utm_source=stack-comment-downstack-mergeability-warning

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/146054?utm_source=stack-comment-downstack-mergeability-warning

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: > [!WARNING] > This pull request is not mergeable via GitHub because a downstack PR is > open. Once all requirements are satisfied, merge this PR as a stack href="https://app.graphite.dev/github/pr/llvm/llvm-project/146055?utm_source=stack-comment-downstack-mergeability-warning

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh ready_for_review https://github.com/llvm/llvm-project/pull/146053 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh ready_for_review https://github.com/llvm/llvm-project/pull/146054 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh ready_for_review https://github.com/llvm/llvm-project/pull/146055 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-27 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh edited https://github.com/llvm/llvm-project/pull/146054 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [AMDGPU] Add KnownBits simplification combines to RegBankCombiner (PR #141591)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141591 >From 129c674e3c2ede4b1941b8dd40d55cbd4c23d722 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 12:29:02 +0200 Subject: [PATCH 1/2] [AMDGPU] Add KnownBits simplification combines to RegBankCombi

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141589 >From d906a978145aabae8b2d1a029477d5a08272ae8c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 11:16:16 +0200 Subject: [PATCH 1/3] [AMDGPU] Move S_BFE lowering into RegBankCombiner --- llvm/li

[llvm-branch-commits] [llvm] [AMDGPU] Add KnownBits simplification combines to RegBankCombiner (PR #141591)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141591 >From 129c674e3c2ede4b1941b8dd40d55cbd4c23d722 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 12:29:02 +0200 Subject: [PATCH 1/2] [AMDGPU] Add KnownBits simplification combines to RegBankCombi

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/141589 >From d906a978145aabae8b2d1a029477d5a08272ae8c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Tue, 27 May 2025 11:16:16 +0200 Subject: [PATCH 1/3] [AMDGPU] Move S_BFE lowering into RegBankCombiner --- llvm/li

[llvm-branch-commits] [llvm] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizations (PR #146053)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146053 >From 3f62ab3beb30abbf8c8c32dd79c0133f7ca122e0 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:08:31 +0200 Subject: [PATCH 1/2] [AMDGPU] Add tests for workgroup/workitem intrinsic optimizati

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From da05cc2d920917f0cb6f171b0d9e2e535836ca3c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From da05cc2d920917f0cb6f171b0d9e2e535836ca3c Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146054 >From 17ac90ad1ee167f35321e01625a207f2b94ff523 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:31:37 +0200 Subject: [PATCH 1/2] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences Fold

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
@@ -28909,13 +28909,97 @@ SDValue DAGCombiner::SimplifySelectCC(const SDLoc &DL, SDValue N0, SDValue N1, return SDValue(); } +static SDValue matchMergedBFX(SDValue Root, SelectionDAG &DAG, + const TargetLowering &TLI) { + // Match a pattern suc

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
@@ -28909,13 +28909,97 @@ SDValue DAGCombiner::SimplifySelectCC(const SDLoc &DL, SDValue N0, SDValue N1, return SDValue(); } +static SDValue matchMergedBFX(SDValue Root, SelectionDAG &DAG, + const TargetLowering &TLI) { + // Match a pattern suc

[llvm-branch-commits] [llvm] [AMDGPU] Move S_BFE lowering into RegBankCombiner (PR #141589)

2025-06-30 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: ping https://github.com/llvm/llvm-project/pull/141589 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [llvm] [GISel] Combine compare of bitfield extracts or'd together. (PR #146055)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146055 >From c8532366fcb92ff3b3e7402118224f137e2c8980 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Fri, 27 Jun 2025 12:04:53 +0200 Subject: [PATCH] [GISel] Combine compare of bitfield extracts or'd together. Equiva

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh updated https://github.com/llvm/llvm-project/pull/146054 >From 26615132899d40b8d245fd98d093ef8c26cdc3e1 Mon Sep 17 00:00:00 2001 From: pvanhout Date: Thu, 26 Jun 2025 13:31:37 +0200 Subject: [PATCH 1/3] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences Fold

[llvm-branch-commits] [llvm] [DAG] Fold (setcc ((x | x >> c0 | ...) & mask)) sequences (PR #146054)

2025-07-01 Thread Pierre van Houtryve via llvm-branch-commits
Pierre-vh wrote: > Does this also handle the case where _all_ of the values ORed together are > shifted, like `(setcc ((x >> c0 | x >> c1 | ...) & mask))` ? Yes, i added a test for it https://github.com/llvm/llvm-project/pull/146054 ___ llvm-branch-c

[llvm-branch-commits] [llvm] AMDGPU/GlobalISel: Improve readanylane combines in regbanklegalize (PR #145911)

2025-07-23 Thread Pierre van Houtryve via llvm-branch-commits
https://github.com/Pierre-vh approved this pull request. https://github.com/llvm/llvm-project/pull/145911 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

<    1   2   3