Re: [PATCH] Fix _mm512_cvt_roundps_ph to generate sae instruction.

2022-09-04 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 5, 2022 at 10:44 AM liuhongt wrote: > > zmm-version vcvtps2ph is special, it encodes {sae} in evex, but put > round control in the imm. For intrinsic _mm512_cvt_roundps_ph (a, > imm), imm contains both {sae} and round control, we need to separate > it in the assembly output since vcvtp

[PATCH] Fix _mm512_cvt_roundps_ph to generate sae instruction.

2022-09-04 Thread liuhongt via Gcc-patches
zmm-version vcvtps2ph is special, it encodes {sae} in evex, but put round control in the imm. For intrinsic _mm512_cvt_roundps_ph (a, imm), imm contains both {sae} and round control, we need to separate it in the assembly output since vcvtps2ph will ignore imm[3:7]. Corresponding llvm patch. Intri