[PATCH 0/1] AArch64: LUTI2/LUTI4 ACLE for SVE2

2024-07-10 Thread vladimir.miloserdov
From: Vladimir Miloserdov Hi All, This patch introduces support for LUTI2/LUTI4 ACLE for SVE2. LUTI instructions are used for efficient table lookups with 2-bit or 4-bit indices. LUTI2 reads indexed 8-bit or 16-bit elements from the low 128 bits of the table vector using packed 2-bit indices, w

[PATCH 1/1] AArch64: Add LUTI ACLE for SVE2

2024-07-10 Thread vladimir.miloserdov
This patch introduces support for LUTI2/LUTI4 ACLE for SVE2. LUTI instructions are used for efficient table lookups with 2-bit or 4-bit indices. LUTI2 reads indexed 8-bit or 16-bit elements from the low 128 bits of the table vector using packed 2-bit indices, while LUTI4 can read from the low 128

[PATCH v2] AArch64: Add LUTI ACLE for SVE2

2024-07-23 Thread vladimir.miloserdov
Hi All, Changes since V1: add missing MD constraints, rename intrinsics, remove SME2 flag for LUT feature. Bootstrapped Regtested on aarch64-none-linux-gnu and no issues. This depends on "Extend aarch64_feature_flags to 128 bits" work which is soon to be submitted upstream as we ran out of 64-b

[PATCH 2/2] aarch64: Add AdvSIMD LUT extension and vluti2{q}_lane{q} intrinsics

2024-11-05 Thread vladimir.miloserdov
The AArch64 FEAT_LUT extension is optional from Armv9.2-a and mandatory from Armv9.5-a. This extension introduces instructions for lookup table read with 2-bit indices. This patch adds AdvSIMD LUT intrinsics for LUTI2, supporting table lookup with 2-bit packed indices. The following intrinsics ar

[PATCH 0/2] aarch64: Refactor infra for advsimd intrinsics, add AdvSIMD LUTI2

2024-11-05 Thread vladimir.miloserdov
From: Vladimir Miloserdov Hi All, This is a patch series that refactors infrastructure for advsimd pragma intrinsics and adds LUTI2 AdvSIMD intrinsics. Bootstrapped Regtested on aarch64-none-linux-gnu and no issues. The patches need to be committed for me as I don't have commit rights. Ok for

[PATCH 1/2] aarch64: Refactor infrastructure for advsimd intrinsics

2024-11-06 Thread vladimir.miloserdov
This patch refactors the infrastructure for defining advsimd pragma intrinsics, adding support for more flexible type and signature handling in future SIMD extensions. A new simd_type structure is introduced, which allows for consistent mode and qualifier management across various advsimd operati