Re: Enable MOVDIRI, MOVDIR64B, CLDEMOTE and WAITPKG for march=tremont

2020-11-10 Thread Hongtao Liu via Gcc-patches
On Tue, Nov 10, 2020 at 4:17 PM Hongtao Liu wrote: > > On Tue, Nov 10, 2020 at 3:22 AM Jason Merrill via Gcc-patches > wrote: > > > > This patch was also applied to the GCC 9 and 10 branches and breaks those > > builds, because PTA_CLDEMOTE is not defined. > > > Mine, let me fix it, sorry for tha

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-11 Thread Hongtao Liu via Gcc-patches
ping ^3 Rebase patch on latest trunk. On Tue, Oct 27, 2020 at 3:51 PM Hongtao Liu wrote: > > ping^1 > > On Tue, Oct 20, 2020 at 3:36 PM Richard Biener > wrote: > > > > On Tue, Oct 20, 2020 at 4:35 AM Hongtao Liu wrote: > > > > > > On Mon, Oct 19, 2020 at 5:55 PM Richard Biener > > > wrote: >

[PATCH,wwwdocs] gcc-11/changes: Mention Intel AVX-VNNI

2020-11-11 Thread Hongtao Liu via Gcc-patches
[GCC-11] Mention Intel AVX-VNNI and add it to ALDERLAKE and SAPPIRERAPIDS, also add HRESET to ALDERLAKE. diff --git a/htdocs/gcc-11/changes.html b/htdocs/gcc-11/changes.html index d7a3a1f9..fc4c74f4 100644 --- a/htdocs/gcc-11/changes.html +++ b/htdocs/gcc-11/changes.html @@ -267,15 +267,20 @@ a wo

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-11 Thread Hongtao Liu via Gcc-patches
On Wed, Nov 11, 2020 at 4:45 PM Uros Bizjak wrote: > > > gcc/ChangeLog: > > > > PR target/97194 > > * config/i386/i386-expand.c (ix86_expand_vector_set_var): New function. > > * config/i386/i386-protos.h (ix86_expand_vector_set_var): New Decl. > > * config/i386/predicates.md (vec_setm_operand): Ne

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote: > > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote: > > > > > gcc/ChangeLog: > > > > > > > > PR target/97194 > > > > * config/i386/i386-expand.c (ix86_expand_vector_set_var): New function. > > > > * config/i386/i386-protos.h (ix86_expand_vecto

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 12, 2020 at 5:12 PM Hongtao Liu wrote: > > On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote: > > > > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote: > > > > > > > gcc/ChangeLog: > > > > > > > > > > PR target/97194 > > > > > * config/i386/i386-expand.c (ix86_expand_vector_set_var)

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 12, 2020 at 5:15 PM Hongtao Liu wrote: > > On Thu, Nov 12, 2020 at 5:12 PM Hongtao Liu wrote: > > > > On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote: > > > > > > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote: > > > > > > > > > gcc/ChangeLog: > > > > > > > > > > > > PR target/

Re: [PATCH,wwwdocs] gcc-11/changes: Mention Intel AVX-VNNI

2020-11-12 Thread Hongtao Liu via Gcc-patches
Got it. On Fri, Nov 13, 2020 at 3:26 AM Gerald Pfeifer wrote: > > On Wed, 11 Nov 2020, Hongtao Liu via Gcc-patches wrote: > > + New ISA extension support for Intel AVX-VNNI was added to GCC. > > More for the future (i.e., no need to change that now): I suggest > to skip &

Re: [committed] wwwdocs: Editorial changes around x86-64 ISA extensions

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Fri, Nov 13, 2020 at 3:32 AM Gerald Pfeifer wrote: > > Per our discussion on the list (plus a grammer improvement in a > section above). > > One question: why are the ISA extension lists not alphabetically > sorted? Wouldn't that be beneficial for users? Easier to find > something and also ea

Re: [PATCH][AVX512]Lower AVX512 vector compare to AVX version when dest is vector

2020-11-16 Thread Hongtao Liu via Gcc-patches
On Tue, Nov 17, 2020 at 8:05 AM Jeff Law wrote: > > > On 9/2/20 3:34 AM, Hongtao Liu via Gcc-patches wrote: > > Hi: > > Add define_peephole2 to eliminate potential redundant conversion > > from mask to vector. > > Bootstrap is ok, regression test is ok for

Re: [PATCH] Check the type of mask while generating cond_op in gimple simplication.

2021-08-31 Thread Hongtao Liu via Gcc-patches
On Tue, Aug 31, 2021 at 7:56 PM Richard Biener wrote: > > On Tue, Aug 31, 2021 at 12:18 PM Hongtao Liu wrote: > > > > On Mon, Aug 30, 2021 at 8:25 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Fri, Aug 27, 2021 at 8:53 AM liuhongt wrote: > > > > > > > > When gimple simplifcation

Re: [PATCH V3 0/6] Initial support for AVX512FP16

2021-09-01 Thread Hongtao Liu via Gcc-patches
I'm going to check in the first 3 patches which are already approved. Update hf soft-fp from glibc. [i386] Enable _Float16 type for TARGET_SSE2 and above. [i386] libgcc: Enable hfmode soft-sf/df/xf/tf extensions and truncations. On Mon, Aug 2, 2021 at 2:31 PM liuhongt wrote: > > Update

Re: [PATCH 4/6] Support -fexcess-precision=16 which will enable FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when backend supports _Float16.

2021-09-01 Thread Hongtao Liu via Gcc-patches
On Tue, Aug 24, 2021 at 5:39 PM Hongtao Liu wrote: > > On Tue, Aug 17, 2021 at 9:53 AM Hongtao Liu wrote: > > > > On Fri, Aug 6, 2021 at 2:06 PM Hongtao Liu wrote: > > > > > > On Tue, Aug 3, 2021 at 10:44 AM Hongtao Liu wrote: > > > > > > > > On Tue, Aug 3, 2021 at 3:34 AM Joseph Myers > > >

Re: [PATCH] Check the type of mask while generating cond_op in gimple simplication.

2021-09-01 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 1, 2021 at 8:52 PM Richard Sandiford wrote: > > Richard Biener writes: > > On Wed, Sep 1, 2021 at 8:28 AM Hongtao Liu wrote: > >> > >> On Tue, Aug 31, 2021 at 7:56 PM Richard Biener > >> wrote: > >> > > >> > On Tue, Aug 31, 2021 at 12:18 PM Hongtao Liu wrote: > >> > > > >> > > On M

[PATCH V3 0/6] Initial support for AVX512FP16

2021-09-02 Thread Hongtao Liu via Gcc-patches
On Thursday, September 2, 2021, Iain Sandoe wrote: > Hi Hongtao. > > > On 2 Sep 2021, at 07:06, Hongtao Liu via Gcc-patches < > gcc-patches@gcc.gnu.org> wrote: > > > > I'm going to check in the first 3 patches which are already approved. > > > >

Re: [PATCH V3 0/6] Initial support for AVX512FP16

2021-09-02 Thread Hongtao Liu via Gcc-patches
On Thursday, September 2, 2021, H.J. Lu wrote: > On Wed, Sep 1, 2021 at 11:00 PM Hongtao Liu wrote: > > > > I'm going to check in the first 3 patches which are already approved. > > > > Update hf soft-fp from glibc. > > [i386] Enable _Float16 type for TARGET_SSE2 and above. > > [i386] libg

Re: [PATCH] Explicitly add -msse2 to compile HF related libgcc source file.

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 3, 2021 at 5:01 PM Jakub Jelinek via Gcc-patches wrote: > > On Fri, Sep 03, 2021 at 03:41:13PM +0800, liuhongt via Gcc-patches wrote: > > --- a/libgcc/config/i386/64/t-softfp > > +++ b/libgcc/config/i386/64/t-softfp > > @@ -1 +1,6 @@ > > softfp_extras := fixhfti fixunshfti floattihf f

Re: [PATCH] Explicitly add -msse2 to compile HF related libgcc source file.

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 3, 2021 at 5:01 PM Jakub Jelinek via Gcc-patches wrote: > > On Fri, Sep 03, 2021 at 03:41:13PM +0800, liuhongt via Gcc-patches wrote: > > --- a/libgcc/config/i386/64/t-softfp > > +++ b/libgcc/config/i386/64/t-softfp > > @@ -1 +1,6 @@ > > softfp_extras := fixhfti fixunshfti floattihf f

Re: [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above.

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 3, 2021 at 8:42 PM Jakub Jelinek via Gcc-patches wrote: > > On Mon, Aug 02, 2021 at 02:31:12PM +0800, liuhongt via Gcc-patches wrote: > > * doc/extend.texi (Half-Precision Floating Point): Documemt > > _Float16 for x86. > > > --- a/gcc/doc/extend.texi > > +++ b/gcc/doc/exte

Re: [PATCH] x86: Add non-destructive source to @xorsign3_1

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Sun, Sep 5, 2021 at 5:54 AM H.J. Lu via Gcc-patches wrote: > > Add non-destructive source alternative to @xorsign3_1 for AVX. LGTM. > > gcc/ > > PR target/89984 > * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2]. > * config/i386/i386.md (@xorsign3_1): Ad

Re: [PATCH] x86: Enable FMA in unsigned SI to SF expanders

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Sun, Sep 5, 2021 at 5:56 AM H.J. Lu via Gcc-patches wrote: > > Enable FMA in scalar/vector unsigned SI to SF expanders. > > gcc/ > > PR target/85819 > * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse): > Enable FMA. > (ix86_expand_vector_convert_uns_

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 4:46 PM liuhongt via Gcc-patches wrote: > > Hi: > As discussed in [1], most of (currently unopposed) targets want > auto-vectorization at O2, and IMHO now would be a good time to enable O2 > vectorization for GCC trunk, so it would leave enough time to expose > related iss

Re: [PATCH] Check the type of mask while generating cond_op in gimple simplication.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 2:54 PM Richard Biener wrote: > > On Thu, Sep 2, 2021 at 7:54 PM Richard Sandiford > wrote: > > > > Hongtao Liu via Gcc-patches writes: > > > On Wed, Sep 1, 2021 at 8:52 PM Richard Sandiford > > > wrote: > > >> > >

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 5:19 PM Richard Biener via Gcc-patches wrote: > > On Mon, Sep 6, 2021 at 10:47 AM liuhongt via Gcc-patches > wrote: > > > > Hi: > > As discussed in [1], most of (currently unopposed) targets want > > auto-vectorization at O2, and IMHO now would be a good time to enable O2

Re: [PATCH] Avoid FROM being overwritten in expand_fix.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 6:03 PM Richard Biener via Gcc-patches wrote: > > On Mon, 6 Sep 2021, liuhongt wrote: > > > Hi: > > For the conversion from _Float16 to int, if the corresponding optab > > does not exist, the compiler will try the wider mode (SFmode here), > > but when floatsfsi exists but

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 5:42 PM Jakub Jelinek via Gcc-patches wrote: > > On Mon, Sep 06, 2021 at 11:18:47AM +0200, Richard Biener wrote: > > On Mon, Sep 6, 2021 at 10:47 AM liuhongt via Gcc-patches > > wrote: > > > > > > Hi: > > > As discussed in [1], most of (currently unopposed) targets want >

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 7:01 PM Jakub Jelinek wrote: > > On Mon, Sep 06, 2021 at 06:58:37PM +0800, Hongtao Liu wrote: > > > process_options would mean it affects only the command line and not > > > __attribute__((optimize ("O2", "ftree-vectorize"))) > > > etc. > > > So, shouldn't it be instead done

Re: [PATCH v2] x86: Enable FMA in unsigned SI to SF expanders

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 8:35 PM H.J. Lu wrote: > > On Sun, Sep 5, 2021 at 10:34 PM Hongtao Liu wrote: > > > > On Sun, Sep 5, 2021 at 5:56 AM H.J. Lu via Gcc-patches > > wrote: > > > > > > Enable FMA in scalar/vector unsigned SI to SF expanders. > > > > > > gcc/ > > > > > > PR target/85819

Re: [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 8:13 PM Jakub Jelinek wrote: > > On Mon, Sep 06, 2021 at 10:05:00AM +0800, Hongtao Liu wrote: > > > @@ -1076,9 +1076,11 @@ systems where @code{__float128} is supported. > > The @code{_Float32} > > type is supported on all systems supporting IEEE binary32; the > > @code{_Fl

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 7, 2021 at 12:37 AM Joseph Myers wrote: > > On Mon, 6 Sep 2021, liuhongt via Gcc-patches wrote: > > > Hi: > > As discussed in [1], most of (currently unopposed) targets want > > auto-vectorization at O2, and IMHO now would be a good time to enable O2 > > vectorization for GCC trunk,

Re: [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above.

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 7, 2021 at 3:18 PM Jakub Jelinek wrote: > > On Tue, Sep 07, 2021 at 09:52:57AM +0800, Hongtao Liu wrote: > > Adjust the wording for x86 _Float16 type. > > > > gcc/ChangeLog: > > > > * doc/extend.texi: (@node Floating Types): Adjust the wording. > > (@node Half-Precision): Ditto. > > >

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 7, 2021 at 8:29 PM Jakub Jelinek via Gcc-patches wrote: > > On Mon, Sep 06, 2021 at 10:58:53AM +0200, Jakub Jelinek via Gcc-patches wrote: > > On Mon, Sep 06, 2021 at 08:49:27AM +0100, Iain Sandoe wrote: > > > > Ok. The *.ver changes are still needed (see above), but that can be > >

Re: [PATCH 2/2] Get rid of all float-int special cases in validate_subreg.

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 7:20 AM Segher Boessenkool wrote: > > On Fri, Sep 03, 2021 at 05:05:47PM +0200, Andreas Schwab wrote: > > On Sep 02 2021, Segher Boessenkool wrote: > > > On Tue, Aug 31, 2021 at 07:17:49PM +0800, liuhongt via Gcc-patches wrote: > > >>* emit-rtl.c (validate_subreg): Get r

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 8:54 AM Hongtao Liu wrote: > > On Tue, Sep 7, 2021 at 8:29 PM Jakub Jelinek via Gcc-patches > wrote: > > > > On Mon, Sep 06, 2021 at 10:58:53AM +0200, Jakub Jelinek via Gcc-patches > > wrote: > > > On Mon, Sep 06, 2021 at 08:49:27AM +0100, Iain Sandoe wrote: > > > > > Ok.

Re: [PATCH V2 00/10] Initial support for AVX512FP16

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Jul 21, 2021 at 3:43 PM liuhongt wrote: > > Hi: > As discussed in [1], this patch support _Float16 under target sse2 > and above, w/o avx512fp16, _Float16 type is storage only, all operations > are emulated by soft-fp and float instructions. Soft-fp keeps the intermediate > result of the

Re: [PATCH V2 00/10] Initial support for AVX512FP16

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 10:54 AM Hongtao Liu wrote: > > On Wed, Jul 21, 2021 at 3:43 PM liuhongt wrote: > > > > Hi: > > As discussed in [1], this patch support _Float16 under target sse2 > > and above, w/o avx512fp16, _Float16 type is storage only, all operations > > are emulated by soft-fp and

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 3:43 PM Jakub Jelinek via Gcc-patches wrote: > > Hi! > > As the testcase shows, we miscompile @xorsign3_1 if both input > operands are in the same register, because the splitter overwrites op1 > before with op1 & mask before using op0. > > For dest = xorsign op0, op0 we can

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 5:09 PM Jakub Jelinek wrote: > > On Wed, Sep 08, 2021 at 10:37:17AM +0800, Hongtao Liu wrote: > > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. > > Ok for trunk? > > > > libgcc/ChangeLog: > > > > * config/i386/t-softfp: Compile __{mul,div}hc3 into > >

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote: > > On Wed, Sep 08, 2021 at 05:23:40PM +0800, Hongtao Liu wrote: > > > Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk? > > > > > Patch LGTM. > > Thanks, committed. > > > PS: > > I'm curious why we need the post_reload spli

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 6:02 PM Jakub Jelinek wrote: > > On Wed, Sep 08, 2021 at 06:00:50PM +0800, Hongtao Liu wrote: > > Yes, I think so. > > And I find paradoxical subreg like (subreg:V4SF (reg:SF)) are not > > allowed by validate_subreg until r11-621. > > That's why post_reload splitter is neede

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 3:17 AM Segher Boessenkool wrote: > > On Wed, Sep 08, 2021 at 08:39:31PM +0200, Richard Biener wrote: > > On September 8, 2021 7:08:09 PM GMT+02:00, Segher Boessenkool > > wrote: > > >It is not a good idea to do allow all those things. Most backends can > > >only support

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 3:17 AM Segher Boessenkool wrote: > > On Wed, Sep 08, 2021 at 08:39:31PM +0200, Richard Biener wrote: > > On September 8, 2021 7:08:09 PM GMT+02:00, Segher Boessenkool > > wrote: > > >It is not a good idea to do allow all those things. Most backends can > > >only support

Re: [PATCH] x86: Add TARGET_AVX256_[MOVE|STORE]_BY_PIECES

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 11:21 AM H.J. Lu via Gcc-patches wrote: > > 1. Add TARGET_AVX256_MOVE_BY_PIECES to perform move by-pieces operation > with 256-bit AVX instructions. > 2. Add TARGET_AVX256_STORE_BY_PIECES to perform move and store by-pieces > operations with 256-bit AVX instructions. > > The

Re: [PATCH 07/62] AVX512FP16: Add vaddph/vsubph/vdivph/vmulph.

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Jul 1, 2021 at 2:17 PM liuhongt wrote: > > gcc/ChangeLog: > > * config.gcc: Add avx512fp16vlintrin.h. > * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic. > (_mm512_mask_add_ph): Likewise. > (_mm512_maskz_add_ph): Likewise. > (_mm512_s

Re: [PATCH] [i386] Remove copysign post_reload splitter for scalar modes.

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 3:54 PM liuhongt wrote: > > Hi: > As a follow up of [1], the patch removes all scalar mode copysign related > post_reload splitter/define_insn and expand copysign directly into below using > paradoxical subregs. [1] https://gcc.gnu.org/pipermail/gcc-patches/2021-September

Re: [PATCH] x86-64: Update AVX512FP16 ABI tests for x32

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 11:31 PM H.J. Lu wrote: > > On Wed, Jul 21, 2021 at 12:44 AM liuhongt wrote: > > > > From: "H.J. Lu" > > > > Copied from regular XMM ABI tests. Only run AVX512FP16 ABI tests for ELF > > targets. > > > > gcc/testsuite/ChangeLog: > > > > * gcc.target/x86_64/abi/avx51

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool wrote: > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener wrote: > > > I think we should (longer term) get rid of the overloaded meanings and > > > uses of subregs. One fairly simple thing is to make a new rtx code > > > "bit_cast" (or

Re: [PATCH] [i386] Remove copysign post_reload splitter for scalar modes.

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 4:00 PM Hongtao Liu wrote: > > On Thu, Sep 9, 2021 at 3:54 PM liuhongt wrote: > > > > Hi: > > As a follow up of [1], the patch removes all scalar mode copysign related > > post_reload splitter/define_insn and expand copysign directly into below > > using > > paradoxical

Re: [PATCH 09/62] AVX512FP16: Enable _Float16 autovectorization

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Jul 1, 2021 at 2:17 PM liuhongt wrote: > > From: "H.J. Lu" > > gcc/ChangeLog: > > * config/i386/i386-expand.c > (ix86_avx256_split_vector_move_misalign): Handle V16HF mode. > * config/i386/i386.c > (ix86_preferred_simd_mode): Handle HF mode. > * con

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 6:54 PM Richard Biener wrote: > > On Fri, Sep 10, 2021 at 5:03 AM Hongtao Liu wrote: > > > > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool > > wrote: > > > > > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener wrote: > > > > > I think we should (longer term)

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 7:25 PM Hongtao Liu wrote: > > On Fri, Sep 10, 2021 at 6:54 PM Richard Biener > wrote: > > > > On Fri, Sep 10, 2021 at 5:03 AM Hongtao Liu wrote: > > > > > > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool > > > wrote: > > > > > > > > On Thu, Sep 09, 2021 at 08:16:16A

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches wrote: > > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > > > > gcc/ChangeLog: > > > > * expmed.c (extract_bit_field_using_extv): validate_subreg > > before call gen_lowpart. > > --- > > gcc/expmed.c | 6 +- > >

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:27 PM Hongtao Liu wrote: > > On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches > wrote: > > > > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > > > > > > gcc/ChangeLog: > > > > > > * expmed.c (extract_bit_field_using_extv): validate_subreg > > >

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:32 PM Richard Biener wrote: > > On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu > wrote: > >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches > > wrote: > >> > >> On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > >> > > >> > gcc/ChangeLog: > >> > >

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:32 PM Richard Biener wrote: > > On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu > wrote: > >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches > > wrote: > >> > >> On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > >> > > >> > gcc/ChangeLog: > >> > >

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 10:08 PM David Edelsohn wrote: > > On Thu, Sep 9, 2021 at 11:03 PM Hongtao Liu wrote: > > > > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool > > wrote: > > > > > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener wrote: > > > > > I think we should (longer term

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:44 PM Hongtao Liu wrote: > > On Fri, Sep 10, 2021 at 9:32 PM Richard Biener > wrote: > > > > On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu > > wrote: > > >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches > > > wrote: > > >> > > >> On Fri, Sep 10

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 5:21 AM Segher Boessenkool wrote: > > On Fri, Sep 10, 2021 at 10:25:45PM +0800, Hongtao Liu wrote: > > Updated patch. > > > > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}, do I need to > > run this patch on other targets machine, or the patch is supposed to > >

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 8:29 AM Hongtao Liu wrote: > > On Sat, Sep 11, 2021 at 5:21 AM Segher Boessenkool > wrote: > > > > On Fri, Sep 10, 2021 at 10:25:45PM +0800, Hongtao Liu wrote: > > > Updated patch. > > > > > > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}, do I need to > > > run

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-11 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 4:25 PM Richard Biener via Gcc-patches wrote: > > On September 10, 2021 11:27:16 PM GMT+02:00, Segher Boessenkool > wrote: > >On Fri, Sep 10, 2021 at 08:36:12PM +0200, Richard Biener wrote: > >> On September 10, 2021 6:24:50 PM GMT+02:00, Segher Boessenkool > >> wrote:

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-11 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 5:51 PM Hongtao Liu wrote: > > On Sat, Sep 11, 2021 at 4:25 PM Richard Biener via Gcc-patches > wrote: > > > > On September 10, 2021 11:27:16 PM GMT+02:00, Segher Boessenkool > > wrote: > > >On Fri, Sep 10, 2021 at 08:36:12PM +0200, Richard Biener wrote: > > >> On Septem

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-12 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 2:11 PM Richard Biener via Gcc-patches wrote: > > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > > > > gcc/ChangeLog: > > > > * expmed.c (extract_bit_field_using_extv): validate_subreg > > before call gen_lowpart. > > --- > > gcc/expmed.c | 6 +- > >

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 5:15 PM Richard Biener wrote: > > On Mon, Sep 13, 2021 at 8:26 AM Hongtao Liu wrote: > > > > On Mon, Sep 13, 2021 at 2:11 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > > > > > > > > gcc/ChangeLog: > > > > > > >

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches wrote: > > > > On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote: > >Currently for (vec_concat:M (vec_select op0 idx1)(vec_select op0 idx2)), > > optimizer wouldn't simplify if op0 has different mode with M, but that's too > > restri

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches wrote: > > > > On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote: > >Currently for (vec_concat:M (vec_select op0 idx1)(vec_select op0 idx2)), > > optimizer wouldn't simplify if op0 has different mode with M, but that's too > > restri

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 14, 2021 at 8:58 AM Andrew Pinski wrote: > > On Wed, Sep 8, 2021 at 2:55 AM Hongtao Liu via Gcc-patches > wrote: > > > > On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote: > > > > > > On Wed, Sep 08, 2021 at 05:23:40PM +0800, Hongtao Liu wro

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 14, 2021 at 10:06 AM Hongtao Liu wrote: > > On Tue, Sep 14, 2021 at 8:58 AM Andrew Pinski wrote: > > > > On Wed, Sep 8, 2021 at 2:55 AM Hongtao Liu via Gcc-patches > > wrote: > > > > > > On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote:

Re: [PATCH 16/62] AVX512FP16: Add vsqrtph/vrsqrtph/vsqrtsh/vrsqrtsh.

2021-09-13 Thread Hongtao Liu via Gcc-patches
i'm going to commit 8 patches: [PATCH 16/62] AVX512FP16: Add vsqrtph/vrsqrtph/vsqrtsh/vrsqrtsh. [PATCH 17/62] AVX512FP16: Add testcase for vsqrtph/vsqrtsh/vrsqrtph/vrsqrtsh. [PATCH 18/62] AVX512FP16: Add vrcpph/vrcpsh/vscalefph/vscalefsh. [PATCH 19/62] AVX512FP16: Add testcase for vrcpph/vrcpsh/vs

Re: [PATCH 24/62] AVX512FP16: Add vmovw/vmovsh.

2021-09-15 Thread Hongtao Liu via Gcc-patches
I'm going to check in 6 patches [PATCH 24/62] AVX512FP16: Add vmovw/vmovsh. [PATCH 25/62] AVX512FP16: Add testcase for vmovsh/vmovw. [PATCH 26/62] AVX512FP16: Add vcvtph2dq/vcvtph2qq/vcvtph2w/vcvtph2uw/vcvtph2uqq/vcvtph2udq [PATCH 27/62] AVX512FP16: Add testcase for vcvtph2w/vcvtph2uw/vcvtph2dq/vc

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-16 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches wrote: > > On Thu, 16 Sep 2021, liuhongt wrote: > > > Ping > > rebased on latest trunk. > > > > gcc/ChangeLog: > > > > * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize). > > * doc/invoke.texi (Options That Contro

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-16 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote: > > On Thu, 16 Sep 2021, Hongtao Liu wrote: > > > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Thu, 16 Sep 2021, liuhongt wrote: > > > > > > > Ping > > > > rebased on latest trunk. > > > > > > > > gc

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-17 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 17, 2021 at 3:47 PM Richard Biener wrote: > > On Fri, 17 Sep 2021, Hongtao Liu wrote: > > > On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote: > > > > > > On Thu, 16 Sep 2021, Hongtao Liu wrote: > > > > > > > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches > > > > w

Re: [PATCH 30/62] AVX512FP16: Add vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh.

2021-09-17 Thread Hongtao Liu via Gcc-patches
I'm going to check in 10 patches. [PATCH 30/62] AVX512FP16: Add vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh. [PATCH 31/62] AVX512FP16: Add testcase for vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh. [PATCH 32/62] AVX512FP16: Add vcvttph2w/vcvttph2uw/vcvttph2dq/vcvttph2qq/vcvttph2udq/vcvttph2uqq [PATCH 3

Re: [PATCH 3/4] [PATCH 3/4] x86: Properly handle USE_VECTOR_FP_CONVERTS/USE_VECTOR_CONVERTS

2021-09-17 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 18, 2021 at 7:50 AM Jakub Jelinek via Gcc-patches wrote: > > On Fri, Sep 17, 2021 at 08:35:57AM +0200, Uros Bizjak via Gcc-patches wrote: > > > > On Wed, Sep 15, 2021 at 10:10 AM wrote: > > > > > > > > > > From: "H.J. Lu" > > > > > > > > > > Check TARGET_USE_VECTOR_FP_CONVERTS or > >

Re: [PATCH 40/62] AVX512FP16: Add vfmaddsub[132, 213, 231]ph/vfmsubadd[132, 213, 231]ph.

2021-09-17 Thread Hongtao Liu via Gcc-patches
Hi: I'm going to check the 9 patches below. Bootstrapped and regtest on x86_64-pc-linux-gnu{-m32,}. Newly added testcase passed on sde{-m32,}. [PATCH 40/62] AVX512FP16: Add vfmaddsub[132, 213, 231]ph/vfmsubadd[132, 213, 231]ph. [PATCH 41/62] AVX512FP16: Add testcase for vfmaddsub[132, 213, 2

Re: [PATCH] [i386] Fix ICE in pass_rpad.

2021-09-18 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 18, 2021 at 3:31 PM Jakub Jelinek wrote: > > On Sat, Sep 18, 2021 at 11:09:32AM +0800, liuhongt wrote: > > Besides conversion instructions, pass_rpad also handles scalar > > sqrt/rsqrt/rcp/round instructions, while r12-3614 should only want to > > handle conversion instructions, so fix

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-18 Thread Hongtao Liu via Gcc-patches
On Sunday, September 19, 2021, Iain Sandoe wrote: > Hi Hongtao, > > > On 8 Sep 2021, at 10:31, Hongtao Liu wrote: > > > > On Wed, Sep 8, 2021 at 5:09 PM Jakub Jelinek wrote: > >> > >> On Wed, Sep 08, 2021 at 10:37:17AM +0800, Hongtao Liu wrote: > >>> Bootstrapped and regtested on x86_64-linux-

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-21 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote: > > On 9/16/21 3:03 AM, Hongtao Liu via Gcc-patches wrote: > > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches > > wrote: > >> > >> On Thu, 16 Sep 2021, liuhongt wrote: > >

Re: [PATCH] Allow different vector types for stmt groups

2021-09-21 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 21, 2021 at 10:55 PM H.J. Lu wrote: > > On Mon, Sep 20, 2021 at 5:15 AM Richard Biener via Gcc-patches > wrote: > > > > This allows vectorization (in practice non-loop vectorization) to > > have a stmt participate in different vector type vectorizations. > > It allows us to remove vec

Re: [PATCH 49/62] AVX512FP16: Add vfcmaddcph/vfmaddcph/vfcmulcph/vfmulcph

2021-09-21 Thread Hongtao Liu via Gcc-patches
I'm going to check in 7 patches. [PATCH 49/62] AVX512FP16: Add vfcmaddcph/vfmaddcph/vfcmulcph/vfmulcph [PATCH 50/62] AVX512FP16: Add testcases for vfcmaddcph/vfmaddcph/vfcmulcph/vfmulcph. [PATCH 51/62] AVX512FP16: Add vfcmaddcsh/vfmaddcsh/vfcmulcsh/vfmulcsh. [PATCH 52/62] AVX512FP16: Add testcases

Re: [PATCH 59/62] AVX512FP16: Support load/store/abs intrinsics.

2021-09-22 Thread Hongtao Liu via Gcc-patches
I'm going to check in 4 patches. [PATCH 59/62] AVX512FP16: Support load/store/abs intrinsics. [PATCH 60/62] AVX512FP16: Add reduce operators(add/mul/min/max). [PATCH 61/62] AVX512FP16: Add complex conjugation intrinsic instructions. [PATCH 62/62] AVX512FP16: Add permutation and mask blend intrinsi

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-22 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 22, 2021 at 10:21 PM Martin Sebor wrote: > > On 9/21/21 7:38 PM, Hongtao Liu wrote: > > On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote: > ... > > diff --git a/gcc/testsuite/c-c++-common/Wstringop-overflow-2.c > > b/gcc/testsuite/c-c++-common/Wstringop-overflow-2.c > >

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-22 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 23, 2021 at 9:48 AM Hongtao Liu wrote: > > On Wed, Sep 22, 2021 at 10:21 PM Martin Sebor wrote: > > > > On 9/21/21 7:38 PM, Hongtao Liu wrote: > > > On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote: > > ... > > > diff --git a/gcc/testsuite/c-c++-common/Wstringop-overflow-2.c >

Re: [PATCH] [GCC12] Mention Intel AVX512-FP16 and _Float16 support.

2021-09-23 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 24, 2021 at 9:20 AM H.J. Lu wrote: > > On Thu, Sep 23, 2021 at 6:17 PM liuhongt wrote: > > > > Updated, mention _Float16 support. > > > > --- > > htdocs/gcc-12/changes.html | 13 - > > 1 file changed, 12 insertions(+), 1 deletion(-) > > > > diff --git a/htdocs/gcc-12/chan

Re: [PATCH] [GCC12] Mention Intel AVX512-FP16 and _Float16 support.

2021-09-23 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 24, 2021 at 9:42 AM H.J. Lu wrote: > > On Thu, Sep 23, 2021 at 6:38 PM Hongtao Liu wrote: > > > > On Fri, Sep 24, 2021 at 9:20 AM H.J. Lu wrote: > > > > > > On Thu, Sep 23, 2021 at 6:17 PM liuhongt wrote: > > > > > > > > Updated, mention _Float16 support. > > > > > > > > --- > > > >

Re: [PATCH] AVX512FP16: Support cond_op for HFmode

2021-09-23 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 24, 2021 at 10:16 AM Hongyu Wang via Gcc-patches wrote: > > > >This patch extend the expanders for cond_op to support vector HF modes. > > >bootstraped and regtested on x86_64-pc-linux-gnu{-m32,}. > > Do runtime tests passe on sde{-m32,}? > > Yes, forgot to mention this. > > Liu, Hongt

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-23 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 23, 2021 at 11:18 PM Martin Sebor wrote: > > On 9/23/21 12:30 AM, Richard Biener wrote: > > On Thu, 23 Sep 2021, Hongtao Liu wrote: > > > >> On Thu, Sep 23, 2021 at 9:48 AM Hongtao Liu wrote: > >>> > >>> On Wed, Sep 22, 2021 at 10:21 PM Martin Sebor wrote: > > On 9/21/21 7:

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-24 Thread Hongtao Liu via Gcc-patches
ping On Mon, Sep 13, 2021 at 11:19 PM Hongtao Liu wrote: > > On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches > wrote: > > > > > > > > On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote: > > >Currently for (vec_concat:M (vec_select op0 idx1)(vec_select op0 > > > idx2)), > > > opt

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-27 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 24, 2021 at 9:08 PM Segher Boessenkool wrote: > > On Mon, Sep 13, 2021 at 04:24:13PM +0200, Richard Biener wrote: > > On Mon, Sep 13, 2021 at 4:10 PM Jeff Law via Gcc-patches > > wrote: > > > I'm not convinced that we need the inner mode to match anything. As > > > long as the vec_co

Re: [PATCH] [GIMPLE] Simplify (_Float16) ceil ((double) x) to .CEIL (x) when available.

2021-09-27 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 27, 2021 at 8:53 PM Richard Biener wrote: > > On Fri, Sep 24, 2021 at 1:26 PM liuhongt wrote: > > > > Hi: > > Related discussion in [1] and PR. > > > > Bootstrapped and regtest on x86_64-linux-gnu{-m32,}. > > Ok for trunk? > > > > [1] https://gcc.gnu.org/pipermail/gcc-patches/20

Re: [r12-3893 Regression] FAIL: gcc.target/i386/vect-pr97352.c scan-assembler-times vmov.pd 4 on Linux/x86_64

2021-09-28 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 28, 2021 at 2:59 PM Richard Biener via Gcc-patches wrote: > > On Mon, 27 Sep 2021, sunil.k.pandey wrote: > > > On Linux/x86_64, > > > > 6390c5047adb75960f86d56582e6322aaa4d9281 is the first bad commit > > commit 6390c5047adb75960f86d56582e6322aaa4d9281 > > Author: Richard Biener > > D

Re: [PATCH] Adjust testcase for O2 vect.

2021-10-28 Thread Hongtao Liu via Gcc-patches
On Fri, Oct 29, 2021 at 12:20 AM Martin Sebor via Gcc-patches wrote: > > On 10/28/21 1:23 AM, liuhongt via Gcc-patches wrote: > > Adjust code in check_vect_slp_aligned_store_usage to make it an exact > > pattern match of the corresponding testcases. > > These new target/xfail selectors are added a

Re: [PATCH] Enable vectorization for _Float16 floor/ceil/trunc/nearbyint/rint operations.

2021-10-28 Thread Hongtao Liu via Gcc-patches
On Thu, Oct 28, 2021 at 10:26 AM Hongtao Liu wrote: > > On Mon, Oct 25, 2021 at 4:24 PM liuhongt wrote: > > > > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. > > Ok for trunk? > > > I'm going to check in this patch if there's no objection. Committed. > > gcc/ChangeLog: > > > >

Re: [PATCH] Adjust testcase for O2 vect.

2021-10-28 Thread Hongtao Liu via Gcc-patches
On Fri, Oct 29, 2021 at 10:34 AM Martin Sebor wrote: > > On 10/28/21 7:47 PM, Hongtao Liu wrote: > > On Fri, Oct 29, 2021 at 12:20 AM Martin Sebor via Gcc-patches > > wrote: > >> > >> On 10/28/21 1:23 AM, liuhongt via Gcc-patches wrote: > >>> Adjust code in check_vect_slp_aligned_store_usage to m

Re: [PATCH] i386: Fix wrong result for AMX-TILE intrinsic when parsing expression.

2021-11-03 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 4, 2021 at 9:19 AM Hongyu Wang via Gcc-patches wrote: > > Hi, > > _tile_loadd, _tile_stored, _tile_streamloadd intrinsics are defined by > macro, so the parameters should be wrapped by parentheses to accept > expressions. > > Bootstraped/regtested on x86_64-pc-linux-gnu{-m32,} and sde.

Re: [PATCH] i386: Fix wrong result for AMX-TILE intrinsic when parsing expression.

2021-11-03 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 4, 2021 at 11:25 AM Hongyu Wang wrote: > > > Could you add a testcase for that? > > Yes, updated patch. > > Hongtao Liu via Gcc-patches 于2021年11月4日周四 上午10:25写道: > > > > On Thu, Nov 4, 2021 at 9:19 AM Hongyu Wang via Gcc-patches > > wrote:

Re: [PATCH] i386: Auto vectorize sdot_prod, usdot_prod with VNNI instruction.

2021-11-03 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 4, 2021 at 9:50 AM Hongyu Wang via Gcc-patches wrote: > > Hi, > > AVX512VNNI/AVXVNNI has vpdpwssd for HImode, vpdpbusd for QImode, so > Adjust HImode sdot_prod expander and add QImode usdot_prod expander > to enhance vectorization for dotprod. > > Bootstraped/regtested on x86_64-pc-lin

Re: [PATCH 1/2] [Gimple] Simplify (trunc)fmax/fmin((extend)a, (extend)b) to MAX/MIN(a,b)

2021-11-07 Thread Hongtao Liu via Gcc-patches
On Fri, Nov 5, 2021 at 5:52 PM Richard Biener wrote: > > On Fri, Nov 5, 2021 at 6:38 AM liuhongt wrote: > > > > a and b are same type as trunc type and has less precision than > > extend type, the transformation is guarded by flag_finite_math_only. > > > > Bootstrapped and regtested under x86_64-

Re: [PATCH] i386: Support complex fma/conj_fma for _Float16.

2021-11-07 Thread Hongtao Liu via Gcc-patches
On Fri, Nov 5, 2021 at 3:09 PM Kong, Lingling via Gcc-patches wrote: > > Hi, > > This patch is to support cmla_optab, cmul_optab, cmla_conj_optab, > cmul_conj_optab for vector _Float16. > Ok for master? LGTM. > gcc/ChangeLog: > > * config/i386/sse.md (cmul3): add new define_expand. >

Re: [PATCH] i386: Optimization for mm512_set1_pch.

2021-11-07 Thread Hongtao Liu via Gcc-patches
On Fri, Nov 5, 2021 at 3:20 PM Kong, Lingling via Gcc-patches wrote: > > Hi, > > This patch is to support fold _mm512_fmadd_pch (a, _mm512_set1_pch(*(b)), c) > to 1 instruction vfmaddcph (%rsp){1to16}, %zmm1, %zmm2. > OK for master? > LGTM. > gcc/ChangeLog: > > * config/i386/sse.md (fma__

<    1   2   3   4   5   6   7   8   9   10   >