RE: [PATCH] x86: improve fast bfloat->float conversion

2023-07-11 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Tuesday, July 11, 2023 3:50 PM > To: Liu, Hongtao > Cc: Kirill Yukhin ; gcc-patches@gcc.gnu.org > Subject: Re: [PATCH] x86: improve fast bfloat->float conversion > > On 11.07.2023 08:45, Liu, Hongtao wrote:

Re: [PATCH] x86: improve fast bfloat->float conversion

2023-07-11 Thread Jakub Jelinek via Gcc-patches
On Tue, Jul 11, 2023 at 09:50:23AM +0200, Jan Beulich via Gcc-patches wrote: > > Quote from what Jakub said in [1]. > > --- > > This is not correct. > > While using such code for _mm_cvtsbh_ss is fine if it is documented not to > > raise exceptions and turn a sNaN into a qNaN, it is not fine fo

Re: [PATCH] x86: improve fast bfloat->float conversion

2023-07-11 Thread Jan Beulich via Gcc-patches
On 11.07.2023 08:45, Liu, Hongtao wrote: >> -Original Message- >> From: Jan Beulich >> Sent: Tuesday, July 11, 2023 2:08 PM >> >> There's nothing AVX512BW-ish in here, so no reason to use Yw as the >> constraints for the AVX alternative. Furthermore by using the 512-bit form of >> VPSSLD (

RE: [PATCH] x86: improve fast bfloat->float conversion

2023-07-10 Thread Liu, Hongtao via Gcc-patches
> -Original Message- > From: Jan Beulich > Sent: Tuesday, July 11, 2023 2:08 PM > To: gcc-patches@gcc.gnu.org > Cc: Liu, Hongtao ; Kirill Yukhin > > Subject: [PATCH] x86: improve fast bfloat->float conversion > > There's nothing AVX512BW-ish in

[PATCH] x86: improve fast bfloat->float conversion

2023-07-10 Thread Jan Beulich via Gcc-patches
There's nothing AVX512BW-ish in here, so no reason to use Yw as the constraints for the AVX alternative. Furthermore by using the 512-bit form of VPSSLD (in a new alternative) all 32 registers can be used directly by the insn without AVX512VL needing to be enabled. Also adjust the originally last