Re: [PATCH v4] AArch64: Add LUTI ACLE for SVE2

2025-01-17 Thread Richard Sandiford
Saurabh Jha writes: > On 1/16/2025 8:44 AM, Richard Sandiford wrote: >> Thanks for the update. Mostly LGTM, but some comments below: >> >> writes: >>> diff --git a/gcc/config/aarch64/aarch64-sve2.md >>> b/gcc/config/aarch64/aarch64-sve2.md >>> index f8cfe08f4c0..0a1dc314f94 100644 >>> --- a/gc

Re: [PATCH v4] AArch64: Add LUTI ACLE for SVE2

2025-01-17 Thread Saurabh Jha
On 1/16/2025 8:44 AM, Richard Sandiford wrote: Thanks for the update. Mostly LGTM, but some comments below: writes: diff --git a/gcc/config/aarch64/aarch64-sve2.md b/gcc/config/aarch64/aarch64-sve2.md index f8cfe08f4c0..0a1dc314f94 100644 --- a/gcc/config/aarch64/aarch64-sve2.md +++ b/gcc

Re: [PATCH v4] AArch64: Add LUTI ACLE for SVE2

2025-01-16 Thread Richard Sandiford
Thanks for the update. Mostly LGTM, but some comments below: writes: > diff --git a/gcc/config/aarch64/aarch64-sve2.md > b/gcc/config/aarch64/aarch64-sve2.md > index f8cfe08f4c0..0a1dc314f94 100644 > --- a/gcc/config/aarch64/aarch64-sve2.md > +++ b/gcc/config/aarch64/aarch64-sve2.md > @@ -133,6

[PATCH v4] AArch64: Add LUTI ACLE for SVE2

2025-01-15 Thread saurabh.jha
This patch introduces support for LUTI2/LUTI4 ACLE for SVE2. LUTI instructions are used for efficient table lookups with 2-bit or 4-bit indices. LUTI2 reads indexed 8-bit or 16-bit elements from the low 128 bits of the table vector using packed 2-bit indices, while LUTI4 can read from the low 128