[Lldb-commits] [libcxxabi] [flang] [clang-tools-extra] [libcxx] [lldb] [clang] [llvm] [compiler-rt] [lld] [AMDGPU] GFX12: select @llvm.prefetch intrinsic (PR #74576)

2023-12-07 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd updated https://github.com/llvm/llvm-project/pull/74576 >From 23759746b66c33028ad2340b1e98067ebf1f8074 Mon Sep 17 00:00:00 2001 From: Stanislav Mekhanoshin Date: Tue, 28 Jun 2022 15:24:24 -0700 Subject: [PATCH] [AMDGPU] GFX12: select @llvm.prefetch intri

[Lldb-commits] [libcxxabi] [flang] [lld] [llvm] [compiler-rt] [lldb] [clang] [clang-tools-extra] [libcxx] [AMDGPU] GFX12: select @llvm.prefetch intrinsic (PR #74576)

2023-12-07 Thread Mariusz Sikora via lldb-commits
@@ -959,6 +967,32 @@ def : GCNPat < } } // let OtherPredicates = [HasShaderCyclesRegister] +def SIMM24bitPtr : ImmLeaf (Imm);}] +>; + +multiclass SMPrefetchPat { + def : GCNPat < +(smrd_prefetch (SMRDImm i64:$sbase, i32:$offset), timm, timm, (i32 cache_type)), +(!cas

[Lldb-commits] [libcxx] [flang] [clang-tools-extra] [libcxxabi] [compiler-rt] [clang] [lldb] [lld] [llvm] [AMDGPU] GFX12: select @llvm.prefetch intrinsic (PR #74576)

2023-12-07 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd updated https://github.com/llvm/llvm-project/pull/74576 >From 23759746b66c33028ad2340b1e98067ebf1f8074 Mon Sep 17 00:00:00 2001 From: Stanislav Mekhanoshin Date: Tue, 28 Jun 2022 15:24:24 -0700 Subject: [PATCH 1/2] [AMDGPU] GFX12: select @llvm.prefetch i

[Lldb-commits] [clang-tools-extra] [mlir] [flang] [lld] [libc] [llvm] [compiler-rt] [libcxx] [lldb] [clang] [AMDGPU] GFX12: Add Split Workgroup Barrier (PR #74836)

2023-12-13 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd closed https://github.com/llvm/llvm-project/pull/74836 ___ lldb-commits mailing list lldb-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-commits

[Lldb-commits] [llvm] [libcxx] [clang] [clang-tools-extra] [lldb] [libc] [flang] [mlir] [compiler-rt] [lld] [AMDGPU] GFX12: Add Split Workgroup Barrier (PR #74836)

2023-12-14 Thread Mariusz Sikora via lldb-commits
@@ -684,6 +684,51 @@ s_rndne_f16 s5, 0xfe0b s_rndne_f16 s5, 0x3456 // GFX12: encoding: [0xff,0x6e,0x85,0xbe,0x56,0x34,0x00,0x00] +s_barrier_signal -2 mariusz-sikora-at-amd wrote: Thanks ! https://github.com/llvm/llvm-project/pull/74836 __

[Lldb-commits] [clang] [llvm] [clang-tools-extra] [libc] [compiler-rt] [libcxx] [lldb] [mlir] [flang] [lld] [AMDGPU] GFX12: Add Split Workgroup Barrier (PR #74836)

2023-12-15 Thread Mariusz Sikora via lldb-commits
@@ -684,6 +684,51 @@ s_rndne_f16 s5, 0xfe0b s_rndne_f16 s5, 0x3456 // GFX12: encoding: [0xff,0x6e,0x85,0xbe,0x56,0x34,0x00,0x00] +s_barrier_signal -2 mariusz-sikora-at-amd wrote: Patch: https://github.com/llvm/llvm-project/pull/75575 https://github.com/llvm/

[Lldb-commits] [flang] [clang] [lldb] [libcxxabi] [lld] [compiler-rt] [clang-tools-extra] [llvm] [libcxx] [AMDGPU] GFX12: select @llvm.prefetch intrinsic (PR #74576)

2023-12-15 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd updated https://github.com/llvm/llvm-project/pull/74576 >From 23759746b66c33028ad2340b1e98067ebf1f8074 Mon Sep 17 00:00:00 2001 From: Stanislav Mekhanoshin Date: Tue, 28 Jun 2022 15:24:24 -0700 Subject: [PATCH 1/4] [AMDGPU] GFX12: select @llvm.prefetch i

[Lldb-commits] [llvm] [lld] [clang] [compiler-rt] [libcxxabi] [lldb] [clang-tools-extra] [flang] [libcxx] [AMDGPU] GFX12: select @llvm.prefetch intrinsic (PR #74576)

2023-12-15 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd closed https://github.com/llvm/llvm-project/pull/74576 ___ lldb-commits mailing list lldb-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-commits

[Lldb-commits] [llvm] [lldb] [libc] [clang] [flang] [clang-tools-extra] [libcxx] [mlir] [compiler-rt] [lld] GFX12: Add LoopDataPrefetchPass (PR #75625)

2023-12-18 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd updated https://github.com/llvm/llvm-project/pull/75625 >From de5303eb8a9e061dbd365922f85cad02bca5ec26 Mon Sep 17 00:00:00 2001 From: Stanislav Mekhanoshin Date: Tue, 5 Jul 2022 11:41:29 -0700 Subject: [PATCH 1/3] GFX12: Add LoopDataPrefetchPass It is c

[Lldb-commits] [llvm] [lldb] [libc] [clang] [flang] [clang-tools-extra] [libcxx] [mlir] [compiler-rt] [lld] GFX12: Add LoopDataPrefetchPass (PR #75625)

2023-12-18 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd updated https://github.com/llvm/llvm-project/pull/75625 >From de5303eb8a9e061dbd365922f85cad02bca5ec26 Mon Sep 17 00:00:00 2001 From: Stanislav Mekhanoshin Date: Tue, 5 Jul 2022 11:41:29 -0700 Subject: [PATCH 1/4] GFX12: Add LoopDataPrefetchPass It is c

[Lldb-commits] [lldb] [clang] [compiler-rt] [libcxx] [flang] [lld] [clang-tools-extra] [mlir] [llvm] [libc] GFX12: Add LoopDataPrefetchPass (PR #75625)

2023-12-18 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd closed https://github.com/llvm/llvm-project/pull/75625 ___ lldb-commits mailing list lldb-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-commits

[Lldb-commits] [lldb] [clang] [compiler-rt] [flang] [lld] [llvm] [libcxx] [libc] [clang-tools-extra] AMDGPU/GFX12: Add new dot4 fp8/bf8 instructions (PR #77892)

2024-01-18 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd updated https://github.com/llvm/llvm-project/pull/77892 >From 628a3d2b42cdcbd903e0830ab7d631ea7dc422b9 Mon Sep 17 00:00:00 2001 From: Petar Avramovic Date: Wed, 10 Jan 2024 12:17:58 +0100 Subject: [PATCH 1/2] AMDGPU/GFX12: Add new dot4 fp8/bf8 instructio

[Lldb-commits] [lldb] [clang] [compiler-rt] [flang] [lld] [llvm] [libcxx] [libc] [clang-tools-extra] AMDGPU/GFX12: Add new dot4 fp8/bf8 instructions (PR #77892)

2024-01-18 Thread Mariusz Sikora via lldb-commits
mariusz-sikora-at-amd wrote: Rebase to run tests https://github.com/llvm/llvm-project/pull/77892 ___ lldb-commits mailing list lldb-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-commits

[Lldb-commits] [clang] [libc] [lld] [compiler-rt] [libcxx] [clang-tools-extra] [flang] [lldb] [llvm] AMDGPU/GFX12: Add new dot4 fp8/bf8 instructions (PR #77892)

2024-01-18 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd closed https://github.com/llvm/llvm-project/pull/77892 ___ lldb-commits mailing list lldb-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-commits

[Lldb-commits] [clang-tools-extra] [compiler-rt] [lldb] [clang] [libcxx] [llvm] [flang] [lld] [libc] [AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (PR #78414)

2024-01-19 Thread Mariusz Sikora via lldb-commits
mariusz-sikora-at-amd wrote: > Can you add a GFX12 RUN line to > clang/test/CodeGenOpenCL/builtins-amdgcn-fp8.cl? That will probably require > adding "fp8-conversion-insts" to the GFX12 part of TargetParser.cpp. You can > do this in a separate patch if you want. Done https://github.com/llvm/

[Lldb-commits] [libc] [flang] [compiler-rt] [llvm] [clang-tools-extra] [lldb] [clang] [libcxx] [lld] [AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (PR #78414)

2024-01-22 Thread Mariusz Sikora via lldb-commits
mariusz-sikora-at-amd wrote: > Why is so there so much special casing in the assembler/disassembler? I'm not an original author of these change, but from what I understand it is a workaround to handle VOP3 instructions which have a single source but require the use of two bits from OPSEL. `V_C

[Lldb-commits] [clang] [lldb] [flang] [lld] [libc] [libcxx] [compiler-rt] [llvm] [clang-tools-extra] [AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (PR #78414)

2024-01-22 Thread Mariusz Sikora via lldb-commits
@@ -626,11 +629,82 @@ class Cvt_PK_F32_F8_Pat; -foreach Index = [0, -1] in { - def : Cvt_PK_F32_F8_Pat; - def : Cvt_PK_F32_F8_Pat; +let SubtargetPredicate = isGFX9Only in { + foreach Index = [0, -1] in { +def : Cvt_PK_F32_F8_Pat; +def : Cvt_PK_F32_F8_Pat; + } +} + +

[Lldb-commits] [clang] [lldb] [flang] [lld] [libc] [libcxx] [compiler-rt] [llvm] [clang-tools-extra] [AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (PR #78414)

2024-01-22 Thread Mariusz Sikora via lldb-commits
https://github.com/mariusz-sikora-at-amd edited https://github.com/llvm/llvm-project/pull/78414 ___ lldb-commits mailing list lldb-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-commits

[Lldb-commits] [flang] [lld] [clang-tools-extra] [compiler-rt] [llvm] [libcxx] [lldb] [libc] [clang] [AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (PR #78414)

2024-01-22 Thread Mariusz Sikora via lldb-commits
mariusz-sikora-at-amd wrote: > > Correct, some of these instructions use opsel[1] which in LLVM in stored in > > src1_modifiers so a dummy src1 is used. > > Why can't we just use `SRCMODS.OP_SEL_1` with src0? When referring to `SRCMODS.OP_SEL_1` you are referring to `src1_modifier` (second bi

[Lldb-commits] [lldb] [flang] [clang-tools-extra] [libcxx] [clang] [lld] [llvm] [compiler-rt] [libc] [AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (PR #78414)

2024-01-24 Thread Mariusz Sikora via lldb-commits
Mirko =?utf-8?q?Brkušanin?= , Mirko =?utf-8?q?Brkušanin?= ,Mirko Brkusanin ,Mariusz Sikora Message-ID: In-Reply-To: @@ -8770,6 +8781,22 @@ void AMDGPUAsmParser::cvtVOP3DPP(MCInst &Inst, const OperandVector &Operands, } } +int VdstInIdx = AMDGPU::getNamedOper

[Lldb-commits] [clang] [libc] [compiler-rt] [libcxx] [lldb] [pstl] [flang] [clang-tools-extra] [mlir] [openmp] [lld] [llvm] [AMDGPU][GFX12] VOP encoding and codegen - add support for v_cvt fp8/… (PR #

2024-01-24 Thread Mariusz Sikora via lldb-commits
Mirko =?utf-8?q?Brkušanin?= , Mirko =?utf-8?q?Brkušanin?= ,Mirko Brkusanin ,Mariusz Sikora Message-ID: In-Reply-To: https://github.com/mariusz-sikora-at-amd closed https://github.com/llvm/llvm-project/pull/78414 ___ lldb-commits mailing list lldb-co