Re: [Patch, GCC]Backporting r269039 to gcc8

2019-10-25 Thread Delia Burduv
Hello Jeff, Yes, it is a backport to gcc-8. No, I don't have commit access. Could you please commit it for me? Thanks, Delia From: Jeff Law Sent: 04 October 2019 22:27 To: Delia Burduv ; gcc-patches@gcc.gnu.org Cc: nd ; i...@airs.com ; rguent...@su

Re: [GCC][PATCH][AArch32] ACLE intrinsics bfloat16 vmmla and vfma for AArch32 AdvSIMD

2020-01-22 Thread Delia Burduv
Ping. I have read Richard Sandiford's comments on the AArch64 patches and I will apply what is relevant to this patch as well. Particularly, I will change the tests to use the exact input and output registers and I will change the types of the rtl patterns. On 12/20/19 6:44 PM, Delia B

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-01-22 Thread Delia Burduv
Ping. I will change the tests to use the exact input and output registers as Richard Sandiford suggested for the AArch64 patches. On 12/20/19 6:46 PM, Delia Burduv wrote: > This patch adds the ARMv8.6 ACLE BFloat16 store intrinsics > vst{q}_bf16 as part of the BFloat16 extension. &g

Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32

2020-01-22 Thread Delia Burduv
Ping. I will change the tests to use the exact input and output registers as Richard Sandiford suggested for the AArch64 patches. On 12/20/19 6:48 PM, Delia Burduv wrote: > This patch adds the ARMv8.6 ACLE BFloat16 load intrinsics vld{q}_bf16 > as part of the BFloat16 extension. &g

Re: [GCC][PATCH][AArch32] ACLE intrinsics bfloat16 vmmla and vfma for AArch32 AdvSIMD

2020-01-28 Thread Delia Burduv
Ping. From: Delia Burduv Sent: 22 January 2020 17:26 To: gcc-patches@gcc.gnu.org Cc: ni...@redhat.com ; Richard Earnshaw ; Ramana Radhakrishnan ; Kyrylo Tkachov Subject: Re: [GCC][PATCH][AArch32] ACLE intrinsics bfloat16 vmmla and vfma for AArch32 AdvSIMD

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-01-28 Thread Delia Burduv
Ping. From: Delia Burduv Sent: 22 January 2020 17:29 To: gcc-patches@gcc.gnu.org Cc: ni...@redhat.com ; Richard Earnshaw ; Kyrylo Tkachov ; Ramana Radhakrishnan Subject: Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32 Ping. I will

Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32

2020-01-28 Thread Delia Burduv
Ping. From: Delia Burduv Sent: 22 January 2020 17:31 To: gcc-patches@gcc.gnu.org Cc: ni...@redhat.com ; Richard Earnshaw ; Kyrylo Tkachov ; Ramana Radhakrishnan Subject: Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32 Ping. I will change the tests

Re: [GCC][PATCH][AArch64] ACLE intrinsics bfmmla and bfmlal for AArch64 AdvSIMD

2020-01-31 Thread Delia Burduv
Thank you, Richard! Here is the updated patch. The test that checks for errors when bf16 is disabled is in the bfcvt patch. Cheers, Delia gcc/ChangeLog: 2019-11-06 Delia Burduv * config/aarch64/aarch64-simd-builtins.def (bfcvtn): New built-in function

Re: [GCC][PATCH][AArch64] ACLE intrinsics for BFCVTN, BFCVTN2 (AArch64 AdvSIMD) and BFCVT (AArch64 FP)

2020-01-31 Thread Delia Burduv
type. gcc/testsuite/ChangeLog: 2020-01-31 Delia Burduv * gcc.target/aarch64/advsimd-intrinsics/bfcvt-compile.c: New test. * gcc.target/aarch64/advsimd-intrinsics/bfcvt-nobf16.c: New test. * gcc.target/aarch64/advsimd-intrinsics/bfcvt-nosimd.c

Re: [GCC][PATCH][AArch32] ACLE intrinsics bfloat16 vmmla and vfma for AArch32 AdvSIMD

2020-01-31 Thread Delia Burduv
Here is the updated patch. The changes are minor, so let me know if there is anything else to fix or if it can be committed. Thank you, Delia On 1/30/20 2:55 PM, Kyrill Tkachov wrote: > Hi Delia, > > > On 1/28/20 4:44 PM, Delia Burduv wr

Re: [GCC][PATCH][AArch64] ACLE intrinsics bfmmla and bfmlal for AArch64 AdvSIMD

2020-01-31 Thread Delia Burduv
Sure, here it is. I'll do that for the other patch too. Thanks, Delia On 1/31/20 3:37 PM, Richard Sandiford wrote: > Delia Burduv writes: >> Thank you, Richard! >> >> Here is the updated patch. The test that checks for errors when bf16 is >> disabled is in

[GCC][PATCH][AArch64] ACLE intrinsics bfmmla and bfmlal for AArch64 AdvSIMD

2019-12-20 Thread Delia Burduv
don't have commit rights, so if this is ok can someone please commit it for me? gcc/ChangeLog: 2019-10-29 Delia Burduv * config/aarch64/aarch64-simd-builtins.def (bfmmla): New built-in function. (bfmlalb): New built-in function. (bfmlalt): New bui

[GCC][PATCH][AArch64] ACLE intrinsics for BFCVTN, BFCVTN2 (AArch64 AdvSIMD) and BFCVT (AArch64 FP)

2019-12-20 Thread Delia Burduv
-elf. I don't have commit rights, so if this is ok can someone please commit it for me? gcc/ChangeLog: 2019-11-06 Delia Burduv * config/aarch64/aarch64-simd-builtins.def (bfcvtn): New built-in function. (bfcvtn_q): New built-in function. (bf

[GCC][PATCH][AArch32] ACLE intrinsics bfloat16 vmmla and vfma for AArch32 AdvSIMD

2019-12-20 Thread Delia Burduv
xpand. (neon_vget_high): Changed iterator to VQXBF. * config/arm/unspecs.md (UNSPEC_BFMMLA): New UNSPEC. (UNSPEC_BFMAB): New UNSPEC. (UNSPEC_BFMAT): New UNSPEC. 2019-11-12 Delia Burduv * gcc.target/arm/simd/bf16_ma_1.c: New test. * gcc.target/arm/simd/bf16_m

ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2019-12-20 Thread Delia Burduv
depends on the Arm back-end patche. (https://gcc.gnu.org/ml/gcc-patches/2019-12/msg01448.html) Tested for regression on arm-none-eabi and armeb-none-eabi. I don't have commit rights, so if this is ok can someone please commit it for me? gcc/ChangeLog: 2019-11-14 Delia Burduv * c

ACLE intrinsics: BFloat16 load intrinsics for AArch32

2019-12-20 Thread Delia Burduv
depends on the Arm back-end patche. (https://gcc.gnu.org/ml/gcc-patches/2019-12/msg01448.html) Tested for regression on arm-none-eabi and armeb-none-eabi. I don't have commit rights, so if this is ok can someone please commit it for me? gcc/ChangeLog: 2019-11-14 Delia Burduv * c

Re: [GCC][PATCH][AArch32] ACLE intrinsics bfloat16 vmmla and vfma for AArch32 AdvSIMD

2020-02-19 Thread Delia Burduv
Hi, Here is the latest version of the patch. It just has some minor formatting changes that were brought up by Richard Sandiford in the AArch64 patches Thanks, Delia On 1/31/20 3:23 PM, Delia Burduv wrote: Here is the updated patch. The changes are minor, so let me know if there is

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-02-19 Thread Delia Burduv
Hi, Here is the latest version of the patch. It just has some minor formatting changes that were brought up by Richard Sandiford in the AArch64 patches Thanks, Delia On 1/22/20 5:29 PM, Delia Burduv wrote: Ping. I will change the tests to use the exact input and output registers as

Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32

2020-02-19 Thread Delia Burduv
Hi, Here is the latest version of the patch. It just has some minor formatting changes that were brought up by Richard Sandiford in the AArch64 patches Thanks, Delia On 1/22/20 5:31 PM, Delia Burduv wrote: Ping. I will change the tests to use the exact input and output registers as

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-02-21 Thread Delia Burduv
Hi Kyrill, The arm_bf16.h is only used for scalar operations. That is how the aarch64 versions are implemented too. Thanks, Delia On 2/21/20 2:06 PM, Kyrill Tkachov wrote: Hi Delia, On 2/19/20 5:25 PM, Delia Burduv wrote: Hi, Here is the latest version of the patch. It just has some

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-03-03 Thread Delia Burduv
Hi, I made a mistake in the previous patch. This is the latest version. Please let me know if it is ok. Thanks, Delia On 2/21/20 3:18 PM, Delia Burduv wrote: Hi Kyrill, The arm_bf16.h is only used for scalar operations. That is how the aarch64 versions are implemented too. Thanks, Delia

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-03-03 Thread Delia Burduv
Sorry, I forgot the attachment. On 3/3/20 4:20 PM, Delia Burduv wrote: Hi, I made a mistake in the previous patch. This is the latest version. Please let me know if it is ok. Thanks, Delia On 2/21/20 3:18 PM, Delia Burduv wrote: Hi Kyrill, The arm_bf16.h is only used for scalar

Re: [GCC][PATCH][AArch64] ACLE intrinsics for BFCVTN, BFCVTN2 (AArch64 AdvSIMD) and BFCVT (AArch64 FP)

2020-03-03 Thread Delia Burduv
Hi, Here is the latest version of the patch. On 2/18/20 1:51 PM, Richard Sandiford wrote: Tamar Christina writes: Hi Richard, ..ffb5305e2e5ea1aadae07e82f d8e d6f9f247c1a9 --- /dev/null +++ b/gcc/testsuite/gcc.target/aarch64/advsimd-intrinsics/bfcvt-c

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-03-03 Thread Delia Burduv
Hi, I noticed that the patch doesn't apply cleanly. I fixed it and this is the latest version. Thanks, Delia On 3/3/20 4:23 PM, Delia Burduv wrote: Sorry, I forgot the attachment. On 3/3/20 4:20 PM, Delia Burduv wrote: Hi, I made a mistake in the previous patch. This is the l

Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32

2020-03-04 Thread Delia Burduv
and the updated ChangeLog. gcc/ChangeLog: 2019-03-04 Delia Burduv * config/arm/arm_neon.h (bfloat16_t): New typedef. (vld2_bf16): New. (vld2q_bf16): New. (vld3_bf16): New. (vld3q_bf16): New. (vld4_bf16): New. (vld4q_bf16): New

Re: [GCC][PATCH][AArch32] ACLE intrinsics bfloat16 vmmla and vfma for AArch32 AdvSIMD

2020-03-04 Thread Delia Burduv
Hi, This is the latest version of the patch. Thanks, Delia On 2/21/20 11:41 AM, Kyrill Tkachov wrote: Hi Delia, On 2/19/20 5:23 PM, Delia Burduv wrote: Hi, Here is the latest version of the patch. It just has some minor formatting changes that were brought up by Richard Sandiford in the

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-03-05 Thread Delia Burduv
Hi, This is the latest version of the patch. I am forcing -mfloat-abi=hard because the register allocator behaves differently depending on which float-abi is used. Thanks, Delia On 3/4/20 5:20 PM, Kyrill Tkachov wrote: Hi Delia, On 3/3/20 5:23 PM, Delia Burduv wrote: Hi, I noticed that

Re: ACLE intrinsics: BFloat16 store (vst{q}_bf16) intrinsics for AArch32

2020-03-05 Thread Delia Burduv
Hi, This is the latest version of the patch. I am forcing -mfloat-abi=hard because the register allocator behaves differently depending on the float-abi used. Thanks, Delia On 3/4/20 5:20 PM, Kyrill Tkachov wrote: Hi Delia, On 3/3/20 5:23 PM, Delia Burduv wrote: Hi, I noticed that the

Re: ACLE intrinsics: BFloat16 load intrinsics for AArch32

2020-03-05 Thread Delia Burduv
Hi, This is the latest version of the patch. I am forcing -mfloat-abi=hard because the code generated is slightly differently depending on the float-abi used. Thanks, Delia On 3/4/20 5:20 PM, Kyrill Tkachov wrote: Hi Delia, On 3/4/20 2:05 PM, Delia Burduv wrote: Hi, The previous version

Re: [GCC][PATCH][AArch64] ACLE intrinsics for BFCVTN, BFCVTN2 (AArch64 AdvSIMD) and BFCVT (AArch64 FP)

2020-03-05 Thread Delia Burduv
: gcc.target/aarch64/advsimd-intrinsics/bfcvt-nosimd.c -Og -g (test for excess errors) FAIL: gcc.target/aarch64/advsimd-intrinsics/bfcvt-nosimd.c -Os (test for excess errors) I think that's because: Delia Burduv writes: diff --git a/gcc/config/aarch64/arm_bf16.h b/gcc/config/aarch64/arm_b

[GCC][Patch]Bug fix: cannot convert 'const short int*' to 'const __bf16*'

2020-03-11 Thread Delia Burduv
-gnueabihf. No regression from before the original patch. Tests that failed or became unsupported because of the original tests now work as they did before it. gcc/ChangeLog: 2020-03-11 Delia Burduv * config/arm/arm-builtins.c (arm_init_simd_builtin_scalar_types): New

Re: [Patch, GCC]Backporting r269039 to gcc8

2019-10-04 Thread Delia Burduv
Ping. Has anyone had a look at the patch? Please let me know if it is fine. Thanks, Delia From: Delia Burduv Sent: 18 September 2019 15:54 To: gcc-patches@gcc.gnu.org Cc: nd ; l...@redhat.com ; i...@airs.com ; rguent...@suse.de Subject: [Patch, GCC

[Patch, GCC]Backporting r269039 to gcc8

2019-09-18 Thread Delia Burduv
ommit rights, so if it is ok can someone please commit it for me? Thanks, Delia gcc/ChangeLog: 2019-09-13  Delia Burduv      Backport from trunk     2019-02-20  Andre Vieira     PR target/86487     * lra-constraints.c(uses_hard_regs_p): Fix handling of     paradoxical SU