Hi,

As subject, this patch rewrites the v[q]tbl Neon intrinsics to use RTL
builtins rather than inline assembly code, allowing for better scheduling
and optimization.

Regression tested and bootstrapped on aarch64-none-linux-gnu - no
issues.

Ok for master?

Thanks,
Jonathan

---

gcc/ChangeLog:

2021-02-12  Jonathan Wright  <jonathan.wri...@arm.com>

        * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
        generator macros.
        * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
        instead of inline asm.
        (vqtbl1_s8): Likewise.
        (vqtbl1_u8): Likewise.
        (vqtbl1q_p8): Likewise.
        (vqtbl1q_s8): Likewise.
        (vqtbl1q_u8): Likewise.
        (vtbl1_s8): Likewise.
        (vtbl1_u8): Likewise.
        (vtbl1_p8): Likewise.
        (vtbl2_s8): Likewise.
        (vtbl2_u8): Likewise.
        (vtbl2_p8): Likewise.

Attachment: rb14154.patch
Description: rb14154.patch

Reply via email to