Re: [PATCH] Optimize vec_setv8{hi,hf}_0 + pmovzxbq to pmovzxbq.

2022-05-08 Thread Hongtao Liu via Gcc-patches
On Mon, May 9, 2022 at 2:43 PM liuhongt via Gcc-patches wrote: > > Clean up of 16-bit uppers is not needed for pmovzxbq/pmovsxbq. > > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. > Ok for trunk? > > gcc/ChangeLog: > > PR target/105072 > * config/i386/sse.md (*sse4_1_v2

Re: [PATCH] [i386] Implement permutation with pslldq + psrldq + por when pshufb is not available.

2022-05-09 Thread Hongtao Liu via Gcc-patches
On Mon, May 9, 2022 at 4:19 PM Uros Bizjak wrote: > > On Mon, May 9, 2022 at 7:24 AM Hongtao Liu wrote: > > > > On Mon, May 9, 2022 at 1:22 PM liuhongt via Gcc-patches > > wrote: > > > > > > pand/pandn may be used to clear upper/lower bits of the oper

Re: [PATCH] [i386] Optimize movzwl + vmovd/vmovq to vmovw.

2022-05-10 Thread Hongtao Liu via Gcc-patches
On Mon, May 9, 2022 at 4:28 PM Uros Bizjak wrote: > > On Mon, May 9, 2022 at 4:03 AM liuhongt wrote: > > > > Similarly optimize movl + vmovq to vmovd. > > > > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. > > Ok for trunk? > > > > gcc/ChangeLog: > > > > PR target/104915 > >

Re: [PATCH v2] Strip of a vector load which is only used partially.

2022-05-11 Thread Hongtao Liu via Gcc-patches
On Tue, May 10, 2022 at 2:54 PM Richard Biener via Gcc-patches wrote: > > On Mon, May 9, 2022 at 7:11 AM liuhongt via Gcc-patches > wrote: > > > > Here's adjused patch. > > Ok for trunk? > > > > Optimize > > > > _4 = VEC_PERM_EXPR <_1, _1, { 4, 5, 6, 7, 4, 5, 6, 7 }>; > > _5 = BIT_FIELD_REF <

Re: [PATCH] [Middle-end] Enhance final_value_replacement_loop to handle bitwise induction.

2022-05-12 Thread Hongtao Liu via Gcc-patches
On Wed, May 11, 2022 at 4:45 PM Richard Biener via Gcc-patches wrote: > > On Mon, May 9, 2022 at 7:19 AM liuhongt wrote: > > > > This patch will enable below optimization: > > > > { > > - int bit; > > - long long unsigned int _1; > > - long long unsigned int _2; > > - > > [local count: 46

Re: [PATCH] Expand __builtin_memcmp_eq with ptest for OImode.

2022-05-15 Thread Hongtao Liu via Gcc-patches
ping. On Sat, May 7, 2022 at 1:05 PM liuhongt via Gcc-patches wrote: > > This is adjusted patch only for OImode. > > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. > Ok for trunk? > > gcc/ChangeLog: > > PR target/104610 > * config/i386/i386-expand.cc (ix86_expand_branch

Re: [PATCH] Expand __builtin_memcmp_eq with ptest for OImode.

2022-05-16 Thread Hongtao Liu via Gcc-patches
On Mon, May 16, 2022 at 5:21 PM Uros Bizjak via Gcc-patches wrote: > > On Sat, May 7, 2022 at 7:05 AM liuhongt wrote: > > > > This is adjusted patch only for OImode. > > > > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. > > Ok for trunk? > > > > gcc/ChangeLog: > > > > PR targe

Re: [PATCH v2] Optimize vpermtiw/b to vpunpcklqdq for certain cases.

2022-05-16 Thread Hongtao Liu via Gcc-patches
I've committed the patch. On Fri, May 13, 2022 at 5:22 PM liuhongt via Gcc-patches wrote: > > Here's updated patch which adds ix86_pre_reload_split () to those 2 > define_insn_and_splits. > > Assembly Optimization like: > - vmovq %xmm0, %xmm2 > - vmovdqa .LC0(%rip), %xmm0 >

Re: [PATCH] [i386] recognize bzhi pattern when there's zero_extendsidi.

2022-05-16 Thread Hongtao Liu via Gcc-patches
On Tue, May 17, 2022 at 11:06 AM liuhongt via Gcc-patches wrote: > > backend has > > 16550(define_insn "*bmi2_bzhi_3_2" > 16551 [(set (match_operand:SWI48 0 "register_operand" "=r") > 16552(and:SWI48 > 16553 (plus:SWI48 > 16554(ashift:SWI48 (const_int 1) > 16555

Re: [committed] forwprop: Fix a typo and comment formatting

2022-05-17 Thread Hongtao Liu via Gcc-patches
thanks. On Tue, May 17, 2022 at 3:09 PM Jakub Jelinek via Gcc-patches wrote: > > Hi! > > When looking around the spot of the PR105591 fix, I've noticed a typo > and incorrectly formatted comment. > > Bootstrapped/regtested on x86_64-linux and i668-linux, committed to > trunk as obvious. > > 2022-

Re: [PATCH] [Middle-end] Enhance final_value_replacement_loop to handle bitwise induction.

2022-05-17 Thread Hongtao Liu via Gcc-patches
On Fri, May 13, 2022 at 7:16 PM Richard Biener wrote: > > On Fri, May 13, 2022 at 5:37 AM Hongtao Liu wrote: > > > > On Wed, May 11, 2022 at 4:45 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Mon, May 9, 2022 at 7:19 AM liuhongt wrote: &g

Re: [PATCH] Expand __builtin_memcmp_eq with ptest for OImode.

2022-05-17 Thread Hongtao Liu via Gcc-patches
On Tue, May 17, 2022 at 6:03 PM Uros Bizjak wrote: > > On Tue, May 17, 2022 at 3:33 AM Hongtao Liu wrote: > > > > On Mon, May 16, 2022 at 5:21 PM Uros Bizjak via Gcc-patches > > wrote: > > > > > > On Sat, May 7, 2022 at 7:05 AM liuhongt wrote: >

Re: [PATCH] [i386] recognize bzhi pattern when there's zero_extendsidi.

2022-05-17 Thread Hongtao Liu via Gcc-patches
On Tue, May 17, 2022 at 6:07 PM Uros Bizjak via Gcc-patches wrote: > > On Tue, May 17, 2022 at 5:06 AM liuhongt wrote: > > > > backend has > > > > 16550(define_insn "*bmi2_bzhi_3_2" > > 16551 [(set (match_operand:SWI48 0 "register_operand" "=r") > > 16552(and:SWI48 > > 16553 (pl

Re: [PATCH] Add a bit dislike for separate mem alternative when op is REG_P.

2022-05-24 Thread Hongtao Liu via Gcc-patches
On Wed, May 25, 2022 at 11:39 AM liuhongt via Gcc-patches wrote: > > Rigt now, mem_cost for separate mem alternative is 1 * frequency which > is pretty small and caused the unnecessary SSE spill in the PR, I've tried > to rework backend cost model, but RA still not happy with that(regress > somewh

Re: [PATCH][PR target/97540] Don't extract memory from operand for normal memory constraint.

2020-11-01 Thread Hongtao Liu via Gcc-patches
On Fri, Oct 30, 2020 at 1:00 AM Richard Sandiford wrote: > > I guess my main objection is that we have a special memory constraint > that isn't in fact matching a MEM (at least not directly). That seems > odd and feels like it's going to come back to bite us. > > From an RTL perspective, the MEM

Re: PING [Patch] x86: Enable GCC support for Intel AVX-VNNI extension

2020-11-03 Thread Hongtao Liu via Gcc-patches
rnal/us/en/documents/architecture-instruction-set-extensions-programming-reference.pdf > > > > Bootstrap ok, regression test on i386/x86 backend is ok. > > > > OK for master? > > > > 2020-10-13 Hongtao Liu > > Hongyu Wang > > > > gcc/ >

Re: [PATCH][PR target/97540] Don't extract memory from operand for normal memory constraint.

2020-11-03 Thread Hongtao Liu via Gcc-patches
On Tue, Nov 3, 2020 at 9:51 PM Richard Sandiford wrote: > > Vladimir Makarov via Gcc-patches writes: > > On 2020-10-27 2:53 a.m., Hongtao Liu wrote: > >> Hi: > >>For inline asm, there could be an operand like (not (mem:)), it's > >> not

[PATCH][PR target/97642] Fix incorrect replacement of vmovdqu32 with vpblendd.

2020-11-04 Thread Hongtao Liu via Gcc-patches
Hi: When programmers explicitly use mask loaded intrinsics, don't transform the instruction to vpblend{b,w,d,q} since If mem_addr points to a memory region with less than whole vector size of accessible memory, the mask would prevent reading the inaccessible bytes which could avoid fault. Boo

Re: [Patch] x86: Enable GCC support for Intel AVX-VNNI extension

2020-11-09 Thread Hongtao Liu via Gcc-patches
it |= OPTION_MASK_ISA2_AVX2_UNSET; > } >return true; > Yes. > No review for the sse.md and for testcases. > > Uros. Update the patch based on latest trunk. -- BR, Hongtao From 881868b8c9f5925c63a953454f45f5e0a3c8ea4f Mon Sep 17 00:00:00 2001 From: liuhongt Date: T

Re: [Patch] x86: Enable GCC support for Intel AVX-VNNI extension

2020-11-10 Thread Hongtao Liu via Gcc-patches
On Mon, Nov 9, 2020 at 8:26 PM Uros Bizjak wrote: > > On Mon, Nov 9, 2020 at 11:31 AM Hongtao Liu wrote: > > > > > > > > + /* Support unified builtin. */ > > > + || (mask2 == OPTION_MASK_ISA2_AVXVNNI) > > > > > > I don

Re: Enable MOVDIRI, MOVDIR64B, CLDEMOTE and WAITPKG for march=tremont

2020-11-10 Thread Hongtao Liu via Gcc-patches
On Tue, Nov 10, 2020 at 3:22 AM Jason Merrill via Gcc-patches wrote: > > This patch was also applied to the GCC 9 and 10 branches and breaks those > builds, because PTA_CLDEMOTE is not defined. > Mine, let me fix it, sorry for that. -- BR, Hongtao

Re: Enable MOVDIRI, MOVDIR64B, CLDEMOTE and WAITPKG for march=tremont

2020-11-10 Thread Hongtao Liu via Gcc-patches
On Tue, Nov 10, 2020 at 4:17 PM Hongtao Liu wrote: > > On Tue, Nov 10, 2020 at 3:22 AM Jason Merrill via Gcc-patches > wrote: > > > > This patch was also applied to the GCC 9 and 10 branches and breaks those > > builds, because PTA_CLDEMOTE is not defined. > >

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-11 Thread Hongtao Liu via Gcc-patches
ping ^3 Rebase patch on latest trunk. On Tue, Oct 27, 2020 at 3:51 PM Hongtao Liu wrote: > > ping^1 > > On Tue, Oct 20, 2020 at 3:36 PM Richard Biener > wrote: > > > > On Tue, Oct 20, 2020 at 4:35 AM Hongtao Liu wrote: > > > > > > On Mon, Oct 19

[PATCH,wwwdocs] gcc-11/changes: Mention Intel AVX-VNNI

2020-11-11 Thread Hongtao Liu via Gcc-patches
[GCC-11] Mention Intel AVX-VNNI and add it to ALDERLAKE and SAPPIRERAPIDS, also add HRESET to ALDERLAKE. diff --git a/htdocs/gcc-11/changes.html b/htdocs/gcc-11/changes.html index d7a3a1f9..fc4c74f4 100644 --- a/htdocs/gcc-11/changes.html +++ b/htdocs/gcc-11/changes.html @@ -267,15 +267,20 @@ a wo

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-11 Thread Hongtao Liu via Gcc-patches
On Wed, Nov 11, 2020 at 4:45 PM Uros Bizjak wrote: > > > gcc/ChangeLog: > > > > PR target/97194 > > * config/i386/i386-expand.c (ix86_expand_vector_set_var): New function. > > * config/i386/i386-protos.h (ix86_expand_vector_set_var): New Decl. > > * config/i386/predicates.md (vec_setm_operand): Ne

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote: > > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote: > > > > > gcc/ChangeLog: > > > > > > > > PR target/97194 > > > > * config/i386/i386-expand.c (ix86_expand_vector_se

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 12, 2020 at 5:12 PM Hongtao Liu wrote: > > On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote: > > > > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote: > > > > > > > gcc/ChangeLog: > > > > > > > > > > PR target/97

Re: [PATCH] [PR target/97194] [AVX2] Support variable index vec_set.

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Thu, Nov 12, 2020 at 5:15 PM Hongtao Liu wrote: > > On Thu, Nov 12, 2020 at 5:12 PM Hongtao Liu wrote: > > > > On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote: > > > > > > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote: > > > > > >

Re: [PATCH,wwwdocs] gcc-11/changes: Mention Intel AVX-VNNI

2020-11-12 Thread Hongtao Liu via Gcc-patches
Got it. On Fri, Nov 13, 2020 at 3:26 AM Gerald Pfeifer wrote: > > On Wed, 11 Nov 2020, Hongtao Liu via Gcc-patches wrote: > > + New ISA extension support for Intel AVX-VNNI was added to GCC. > > More for the future (i.e., no need to change that now): I suggest > to skip &

Re: [committed] wwwdocs: Editorial changes around x86-64 ISA extensions

2020-11-12 Thread Hongtao Liu via Gcc-patches
On Fri, Nov 13, 2020 at 3:32 AM Gerald Pfeifer wrote: > > Per our discussion on the list (plus a grammer improvement in a > section above). > > One question: why are the ISA extension lists not alphabetically > sorted? Wouldn't that be beneficial for users? Easier to find > something and also ea

Re: [PATCH][AVX512]Lower AVX512 vector compare to AVX version when dest is vector

2020-11-16 Thread Hongtao Liu via Gcc-patches
On Tue, Nov 17, 2020 at 8:05 AM Jeff Law wrote: > > > On 9/2/20 3:34 AM, Hongtao Liu via Gcc-patches wrote: > > Hi: > > Add define_peephole2 to eliminate potential redundant conversion > > from mask to vector. > > Bootstrap is ok, regression test is ok for

Re: [PATCH] Check the type of mask while generating cond_op in gimple simplication.

2021-08-31 Thread Hongtao Liu via Gcc-patches
On Tue, Aug 31, 2021 at 7:56 PM Richard Biener wrote: > > On Tue, Aug 31, 2021 at 12:18 PM Hongtao Liu wrote: > > > > On Mon, Aug 30, 2021 at 8:25 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Fri, Aug 27, 2021 at 8:53 AM liuh

Re: [PATCH V3 0/6] Initial support for AVX512FP16

2021-09-01 Thread Hongtao Liu via Gcc-patches
I'm going to check in the first 3 patches which are already approved. Update hf soft-fp from glibc. [i386] Enable _Float16 type for TARGET_SSE2 and above. [i386] libgcc: Enable hfmode soft-sf/df/xf/tf extensions and truncations. On Mon, Aug 2, 2021 at 2:31 PM liuhongt wrote: > > Update

Re: [PATCH 4/6] Support -fexcess-precision=16 which will enable FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when backend supports _Float16.

2021-09-01 Thread Hongtao Liu via Gcc-patches
On Tue, Aug 24, 2021 at 5:39 PM Hongtao Liu wrote: > > On Tue, Aug 17, 2021 at 9:53 AM Hongtao Liu wrote: > > > > On Fri, Aug 6, 2021 at 2:06 PM Hongtao Liu wrote: > > > > > > On Tue, Aug 3, 2021 at 10:44 AM Hongtao Liu wrote: > > > > >

Re: [PATCH] Check the type of mask while generating cond_op in gimple simplication.

2021-09-01 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 1, 2021 at 8:52 PM Richard Sandiford wrote: > > Richard Biener writes: > > On Wed, Sep 1, 2021 at 8:28 AM Hongtao Liu wrote: > >> > >> On Tue, Aug 31, 2021 at 7:56 PM Richard Biener > >> wrote: > >> > > >> > On Tue, Au

[PATCH V3 0/6] Initial support for AVX512FP16

2021-09-02 Thread Hongtao Liu via Gcc-patches
On Thursday, September 2, 2021, Iain Sandoe wrote: > Hi Hongtao. > > > On 2 Sep 2021, at 07:06, Hongtao Liu via Gcc-patches < > gcc-patches@gcc.gnu.org> wrote: > > > > I'm going to check in the first 3 patches which are already approved. > > > >

Re: [PATCH V3 0/6] Initial support for AVX512FP16

2021-09-02 Thread Hongtao Liu via Gcc-patches
On Thursday, September 2, 2021, H.J. Lu wrote: > On Wed, Sep 1, 2021 at 11:00 PM Hongtao Liu wrote: > > > > I'm going to check in the first 3 patches which are already approved. > > > > Update hf soft-fp from glibc. > > [i386] Enable _Float16 type

Re: [PATCH] Explicitly add -msse2 to compile HF related libgcc source file.

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 3, 2021 at 5:01 PM Jakub Jelinek via Gcc-patches wrote: > > On Fri, Sep 03, 2021 at 03:41:13PM +0800, liuhongt via Gcc-patches wrote: > > --- a/libgcc/config/i386/64/t-softfp > > +++ b/libgcc/config/i386/64/t-softfp > > @@ -1 +1,6 @@ > > softfp_extras := fixhfti fixunshfti floattihf f

Re: [PATCH] Explicitly add -msse2 to compile HF related libgcc source file.

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 3, 2021 at 5:01 PM Jakub Jelinek via Gcc-patches wrote: > > On Fri, Sep 03, 2021 at 03:41:13PM +0800, liuhongt via Gcc-patches wrote: > > --- a/libgcc/config/i386/64/t-softfp > > +++ b/libgcc/config/i386/64/t-softfp > > @@ -1 +1,6 @@ > > softfp_extras := fixhfti fixunshfti floattihf f

Re: [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above.

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 3, 2021 at 8:42 PM Jakub Jelinek via Gcc-patches wrote: > > On Mon, Aug 02, 2021 at 02:31:12PM +0800, liuhongt via Gcc-patches wrote: > > * doc/extend.texi (Half-Precision Floating Point): Documemt > > _Float16 for x86. > > > --- a/gcc/doc/extend.texi > > +++ b/gcc/doc/exte

Re: [PATCH] x86: Add non-destructive source to @xorsign3_1

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Sun, Sep 5, 2021 at 5:54 AM H.J. Lu via Gcc-patches wrote: > > Add non-destructive source alternative to @xorsign3_1 for AVX. LGTM. > > gcc/ > > PR target/89984 > * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2]. > * config/i386/i386.md (@xorsign3_1): Ad

Re: [PATCH] x86: Enable FMA in unsigned SI to SF expanders

2021-09-05 Thread Hongtao Liu via Gcc-patches
On Sun, Sep 5, 2021 at 5:56 AM H.J. Lu via Gcc-patches wrote: > > Enable FMA in scalar/vector unsigned SI to SF expanders. > > gcc/ > > PR target/85819 > * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse): > Enable FMA. > (ix86_expand_vector_convert_uns_

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 4:46 PM liuhongt via Gcc-patches wrote: > > Hi: > As discussed in [1], most of (currently unopposed) targets want > auto-vectorization at O2, and IMHO now would be a good time to enable O2 > vectorization for GCC trunk, so it would leave enough time to expose > related iss

Re: [PATCH] Check the type of mask while generating cond_op in gimple simplication.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 2:54 PM Richard Biener wrote: > > On Thu, Sep 2, 2021 at 7:54 PM Richard Sandiford > wrote: > > > > Hongtao Liu via Gcc-patches writes: > > > On Wed, Sep 1, 2021 at 8:52 PM Richard Sandiford > > > wrote: > > >> > >

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 5:19 PM Richard Biener via Gcc-patches wrote: > > On Mon, Sep 6, 2021 at 10:47 AM liuhongt via Gcc-patches > wrote: > > > > Hi: > > As discussed in [1], most of (currently unopposed) targets want > > auto-vectorization at O2, and IMHO now would be a good time to enable O2

Re: [PATCH] Avoid FROM being overwritten in expand_fix.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 6:03 PM Richard Biener via Gcc-patches wrote: > > On Mon, 6 Sep 2021, liuhongt wrote: > > > Hi: > > For the conversion from _Float16 to int, if the corresponding optab > > does not exist, the compiler will try the wider mode (SFmode here), > > but when floatsfsi exists but

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 5:42 PM Jakub Jelinek via Gcc-patches wrote: > > On Mon, Sep 06, 2021 at 11:18:47AM +0200, Richard Biener wrote: > > On Mon, Sep 6, 2021 at 10:47 AM liuhongt via Gcc-patches > > wrote: > > > > > > Hi: > > > As discussed in [1], most of (currently unopposed) targets want >

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 7:01 PM Jakub Jelinek wrote: > > On Mon, Sep 06, 2021 at 06:58:37PM +0800, Hongtao Liu wrote: > > > process_options would mean it affects only the command line and not > > > __attribute__((optimize ("O2", "ftree-vectorize")))

Re: [PATCH v2] x86: Enable FMA in unsigned SI to SF expanders

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 8:35 PM H.J. Lu wrote: > > On Sun, Sep 5, 2021 at 10:34 PM Hongtao Liu wrote: > > > > On Sun, Sep 5, 2021 at 5:56 AM H.J. Lu via Gcc-patches > > wrote: > > > > > > Enable FMA in scalar/vector unsigned SI to SF expanders. >

Re: [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 6, 2021 at 8:13 PM Jakub Jelinek wrote: > > On Mon, Sep 06, 2021 at 10:05:00AM +0800, Hongtao Liu wrote: > > > @@ -1076,9 +1076,11 @@ systems where @code{__float128} is supported. > > The @code{_Float32} > > type is supported on all systems supporting IE

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 7, 2021 at 12:37 AM Joseph Myers wrote: > > On Mon, 6 Sep 2021, liuhongt via Gcc-patches wrote: > > > Hi: > > As discussed in [1], most of (currently unopposed) targets want > > auto-vectorization at O2, and IMHO now would be a good time to enable O2 > > vectorization for GCC trunk,

Re: [PATCH 2/6] [i386] Enable _Float16 type for TARGET_SSE2 and above.

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 7, 2021 at 3:18 PM Jakub Jelinek wrote: > > On Tue, Sep 07, 2021 at 09:52:57AM +0800, Hongtao Liu wrote: > > Adjust the wording for x86 _Float16 type. > > > > gcc/ChangeLog: > > > > * doc/extend.texi: (@node Floating Types): Adjust the wordin

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 7, 2021 at 8:29 PM Jakub Jelinek via Gcc-patches wrote: > > On Mon, Sep 06, 2021 at 10:58:53AM +0200, Jakub Jelinek via Gcc-patches wrote: > > On Mon, Sep 06, 2021 at 08:49:27AM +0100, Iain Sandoe wrote: > > > > Ok. The *.ver changes are still needed (see above), but that can be > >

Re: [PATCH 2/2] Get rid of all float-int special cases in validate_subreg.

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 7:20 AM Segher Boessenkool wrote: > > On Fri, Sep 03, 2021 at 05:05:47PM +0200, Andreas Schwab wrote: > > On Sep 02 2021, Segher Boessenkool wrote: > > > On Tue, Aug 31, 2021 at 07:17:49PM +0800, liuhongt via Gcc-patches wrote: > > >>* emit-rtl.c (validate_subreg): Get r

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 8:54 AM Hongtao Liu wrote: > > On Tue, Sep 7, 2021 at 8:29 PM Jakub Jelinek via Gcc-patches > wrote: > > > > On Mon, Sep 06, 2021 at 10:58:53AM +0200, Jakub Jelinek via Gcc-patches > > wrote: > > > On Mon, Sep 06, 2021 at 08:49:27AM +0

Re: [PATCH V2 00/10] Initial support for AVX512FP16

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Jul 21, 2021 at 3:43 PM liuhongt wrote: > > Hi: > As discussed in [1], this patch support _Float16 under target sse2 > and above, w/o avx512fp16, _Float16 type is storage only, all operations > are emulated by soft-fp and float instructions. Soft-fp keeps the intermediate > result of the

Re: [PATCH V2 00/10] Initial support for AVX512FP16

2021-09-07 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 10:54 AM Hongtao Liu wrote: > > On Wed, Jul 21, 2021 at 3:43 PM liuhongt wrote: > > > > Hi: > > As discussed in [1], this patch support _Float16 under target sse2 > > and above, w/o avx512fp16, _Float16 type is storage only, all operations &

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 3:43 PM Jakub Jelinek via Gcc-patches wrote: > > Hi! > > As the testcase shows, we miscompile @xorsign3_1 if both input > operands are in the same register, because the splitter overwrites op1 > before with op1 & mask before using op0. > > For dest = xorsign op0, op0 we can

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 5:09 PM Jakub Jelinek wrote: > > On Wed, Sep 08, 2021 at 10:37:17AM +0800, Hongtao Liu wrote: > > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. > > Ok for trunk? > > > > libgcc/ChangeLog: > > > > * config/

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote: > > On Wed, Sep 08, 2021 at 05:23:40PM +0800, Hongtao Liu wrote: > > > Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk? > > > > > Patch LGTM. > > Thanks, committed. > > > PS: &

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Wed, Sep 8, 2021 at 6:02 PM Jakub Jelinek wrote: > > On Wed, Sep 08, 2021 at 06:00:50PM +0800, Hongtao Liu wrote: > > Yes, I think so. > > And I find paradoxical subreg like (subreg:V4SF (reg:SF)) are not > > allowed by validate_subreg until r11-621. > > That

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 3:17 AM Segher Boessenkool wrote: > > On Wed, Sep 08, 2021 at 08:39:31PM +0200, Richard Biener wrote: > > On September 8, 2021 7:08:09 PM GMT+02:00, Segher Boessenkool > > wrote: > > >It is not a good idea to do allow all those things. Most backends can > > >only support

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 3:17 AM Segher Boessenkool wrote: > > On Wed, Sep 08, 2021 at 08:39:31PM +0200, Richard Biener wrote: > > On September 8, 2021 7:08:09 PM GMT+02:00, Segher Boessenkool > > wrote: > > >It is not a good idea to do allow all those things. Most backends can > > >only support

Re: [PATCH] x86: Add TARGET_AVX256_[MOVE|STORE]_BY_PIECES

2021-09-08 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 11:21 AM H.J. Lu via Gcc-patches wrote: > > 1. Add TARGET_AVX256_MOVE_BY_PIECES to perform move by-pieces operation > with 256-bit AVX instructions. > 2. Add TARGET_AVX256_STORE_BY_PIECES to perform move and store by-pieces > operations with 256-bit AVX instructions. > > The

Re: [PATCH 07/62] AVX512FP16: Add vaddph/vsubph/vdivph/vmulph.

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Jul 1, 2021 at 2:17 PM liuhongt wrote: > > gcc/ChangeLog: > > * config.gcc: Add avx512fp16vlintrin.h. > * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic. > (_mm512_mask_add_ph): Likewise. > (_mm512_maskz_add_ph): Likewise. > (_mm512_s

Re: [PATCH] [i386] Remove copysign post_reload splitter for scalar modes.

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 3:54 PM liuhongt wrote: > > Hi: > As a follow up of [1], the patch removes all scalar mode copysign related > post_reload splitter/define_insn and expand copysign directly into below using > paradoxical subregs. [1] https://gcc.gnu.org/pipermail/gcc-patches/2021-September

Re: [PATCH] x86-64: Update AVX512FP16 ABI tests for x32

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 11:31 PM H.J. Lu wrote: > > On Wed, Jul 21, 2021 at 12:44 AM liuhongt wrote: > > > > From: "H.J. Lu" > > > > Copied from regular XMM ABI tests. Only run AVX512FP16 ABI tests for ELF > > targets. > > > > gcc/testsuite/ChangeLog: > > > > * gcc.target/x86_64/abi/avx51

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool wrote: > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener wrote: > > > I think we should (longer term) get rid of the overloaded meanings and > > > uses of subregs. One fairly simple thing is to make a new rtx code > > > "bit_cast" (or

Re: [PATCH] [i386] Remove copysign post_reload splitter for scalar modes.

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 9, 2021 at 4:00 PM Hongtao Liu wrote: > > On Thu, Sep 9, 2021 at 3:54 PM liuhongt wrote: > > > > Hi: > > As a follow up of [1], the patch removes all scalar mode copysign related > > post_reload splitter/define_insn and expand copysign directly into be

Re: [PATCH 09/62] AVX512FP16: Enable _Float16 autovectorization

2021-09-09 Thread Hongtao Liu via Gcc-patches
On Thu, Jul 1, 2021 at 2:17 PM liuhongt wrote: > > From: "H.J. Lu" > > gcc/ChangeLog: > > * config/i386/i386-expand.c > (ix86_avx256_split_vector_move_misalign): Handle V16HF mode. > * config/i386/i386.c > (ix86_preferred_simd_mode): Handle HF mode. > * con

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 6:54 PM Richard Biener wrote: > > On Fri, Sep 10, 2021 at 5:03 AM Hongtao Liu wrote: > > > > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool > > wrote: > > > > > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener wrote

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 7:25 PM Hongtao Liu wrote: > > On Fri, Sep 10, 2021 at 6:54 PM Richard Biener > wrote: > > > > On Fri, Sep 10, 2021 at 5:03 AM Hongtao Liu wrote: > > > > > > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool > > > wrot

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches wrote: > > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > > > > gcc/ChangeLog: > > > > * expmed.c (extract_bit_field_using_extv): validate_subreg > > before call gen_lowpart. > > --- > > gcc/expmed.c | 6 +- > >

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:27 PM Hongtao Liu wrote: > > On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches > wrote: > > > > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > > > > > > gcc/ChangeLog: > > > > > > *

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:32 PM Richard Biener wrote: > > On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu > wrote: > >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches > > wrote: > >> > >> On Fri, Sep 10, 2021 at 2:58 PM liuho

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:32 PM Richard Biener wrote: > > On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu > wrote: > >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches > > wrote: > >> > >> On Fri, Sep 10, 2021 at 2:58 PM liuho

Re: [PATCH] Fix SFmode subreg of DImode and TImode

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 10:08 PM David Edelsohn wrote: > > On Thu, Sep 9, 2021 at 11:03 PM Hongtao Liu wrote: > > > > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool > > wrote: > > > > > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 10, 2021 at 9:44 PM Hongtao Liu wrote: > > On Fri, Sep 10, 2021 at 9:32 PM Richard Biener > wrote: > > > > On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu > > wrote: > > >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches &g

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 5:21 AM Segher Boessenkool wrote: > > On Fri, Sep 10, 2021 at 10:25:45PM +0800, Hongtao Liu wrote: > > Updated patch. > > > > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}, do I need to > > run this patch on other targets machin

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 8:29 AM Hongtao Liu wrote: > > On Sat, Sep 11, 2021 at 5:21 AM Segher Boessenkool > wrote: > > > > On Fri, Sep 10, 2021 at 10:25:45PM +0800, Hongtao Liu wrote: > > > Updated patch. > > > > > > Bootstrapped and reg

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-11 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 4:25 PM Richard Biener via Gcc-patches wrote: > > On September 10, 2021 11:27:16 PM GMT+02:00, Segher Boessenkool > wrote: > >On Fri, Sep 10, 2021 at 08:36:12PM +0200, Richard Biener wrote: > >> On September 10, 2021 6:24:50 PM GMT+02:00, Segher Boessenkool > >> wrote:

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-11 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 11, 2021 at 5:51 PM Hongtao Liu wrote: > > On Sat, Sep 11, 2021 at 4:25 PM Richard Biener via Gcc-patches > wrote: > > > > On September 10, 2021 11:27:16 PM GMT+02:00, Segher Boessenkool > > wrote: > > >On Fri, Sep 10, 2021 at 08:36:12PM +0

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-12 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 2:11 PM Richard Biener via Gcc-patches wrote: > > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote: > > > > gcc/ChangeLog: > > > > * expmed.c (extract_bit_field_using_extv): validate_subreg > > before call gen_lowpart. > > --- > > gcc/expmed.c | 6 +- > >

Re: [PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 5:15 PM Richard Biener wrote: > > On Mon, Sep 13, 2021 at 8:26 AM Hongtao Liu wrote: > > > > On Mon, Sep 13, 2021 at 2:11 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Fri, Sep 10, 2021 at 2:58 PM liuh

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches wrote: > > > > On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote: > >Currently for (vec_concat:M (vec_select op0 idx1)(vec_select op0 idx2)), > > optimizer wouldn't simplify if op0 has different mode with M, but that's too > > restri

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches wrote: > > > > On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote: > >Currently for (vec_concat:M (vec_select op0 idx1)(vec_select op0 idx2)), > > optimizer wouldn't simplify if op0 has different mode with M, but that's too > > restri

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 14, 2021 at 8:58 AM Andrew Pinski wrote: > > On Wed, Sep 8, 2021 at 2:55 AM Hongtao Liu via Gcc-patches > wrote: > > > > On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote: > > > > > > On Wed, Sep 08, 2021 at 05:23:40PM +0800, Hongtao Liu wro

Re: [PATCH] i386: Fix up @xorsign3_1 [PR102224]

2021-09-13 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 14, 2021 at 10:06 AM Hongtao Liu wrote: > > On Tue, Sep 14, 2021 at 8:58 AM Andrew Pinski wrote: > > > > On Wed, Sep 8, 2021 at 2:55 AM Hongtao Liu via Gcc-patches > > wrote: > > > > > > On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote:

Re: [PATCH 16/62] AVX512FP16: Add vsqrtph/vrsqrtph/vsqrtsh/vrsqrtsh.

2021-09-13 Thread Hongtao Liu via Gcc-patches
i'm going to commit 8 patches: [PATCH 16/62] AVX512FP16: Add vsqrtph/vrsqrtph/vsqrtsh/vrsqrtsh. [PATCH 17/62] AVX512FP16: Add testcase for vsqrtph/vsqrtsh/vrsqrtph/vrsqrtsh. [PATCH 18/62] AVX512FP16: Add vrcpph/vrcpsh/vscalefph/vscalefsh. [PATCH 19/62] AVX512FP16: Add testcase for vrcpph/vrcpsh/vs

Re: [PATCH 24/62] AVX512FP16: Add vmovw/vmovsh.

2021-09-15 Thread Hongtao Liu via Gcc-patches
I'm going to check in 6 patches [PATCH 24/62] AVX512FP16: Add vmovw/vmovsh. [PATCH 25/62] AVX512FP16: Add testcase for vmovsh/vmovw. [PATCH 26/62] AVX512FP16: Add vcvtph2dq/vcvtph2qq/vcvtph2w/vcvtph2uw/vcvtph2uqq/vcvtph2udq [PATCH 27/62] AVX512FP16: Add testcase for vcvtph2w/vcvtph2uw/vcvtph2dq/vc

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-16 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches wrote: > > On Thu, 16 Sep 2021, liuhongt wrote: > > > Ping > > rebased on latest trunk. > > > > gcc/ChangeLog: > > > > * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize). > > * doc/invoke.texi (Options That Contro

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-16 Thread Hongtao Liu via Gcc-patches
On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote: > > On Thu, 16 Sep 2021, Hongtao Liu wrote: > > > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Thu, 16 Sep 2021, liuhongt wrote: > > &

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-17 Thread Hongtao Liu via Gcc-patches
On Fri, Sep 17, 2021 at 3:47 PM Richard Biener wrote: > > On Fri, 17 Sep 2021, Hongtao Liu wrote: > > > On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote: > > > > > > On Thu, 16 Sep 2021, Hongtao Liu wrote: > > > > > > > On Thu,

Re: [PATCH 30/62] AVX512FP16: Add vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh.

2021-09-17 Thread Hongtao Liu via Gcc-patches
I'm going to check in 10 patches. [PATCH 30/62] AVX512FP16: Add vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh. [PATCH 31/62] AVX512FP16: Add testcase for vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh. [PATCH 32/62] AVX512FP16: Add vcvttph2w/vcvttph2uw/vcvttph2dq/vcvttph2qq/vcvttph2udq/vcvttph2uqq [PATCH 3

Re: [PATCH 3/4] [PATCH 3/4] x86: Properly handle USE_VECTOR_FP_CONVERTS/USE_VECTOR_CONVERTS

2021-09-17 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 18, 2021 at 7:50 AM Jakub Jelinek via Gcc-patches wrote: > > On Fri, Sep 17, 2021 at 08:35:57AM +0200, Uros Bizjak via Gcc-patches wrote: > > > > On Wed, Sep 15, 2021 at 10:10 AM wrote: > > > > > > > > > > From: "H.J. Lu" > > > > > > > > > > Check TARGET_USE_VECTOR_FP_CONVERTS or > >

Re: [PATCH 40/62] AVX512FP16: Add vfmaddsub[132, 213, 231]ph/vfmsubadd[132, 213, 231]ph.

2021-09-17 Thread Hongtao Liu via Gcc-patches
Hi: I'm going to check the 9 patches below. Bootstrapped and regtest on x86_64-pc-linux-gnu{-m32,}. Newly added testcase passed on sde{-m32,}. [PATCH 40/62] AVX512FP16: Add vfmaddsub[132, 213, 231]ph/vfmsubadd[132, 213, 231]ph. [PATCH 41/62] AVX512FP16: Add testcase for vfmaddsub[132, 213, 2

Re: [PATCH] [i386] Fix ICE in pass_rpad.

2021-09-18 Thread Hongtao Liu via Gcc-patches
On Sat, Sep 18, 2021 at 3:31 PM Jakub Jelinek wrote: > > On Sat, Sep 18, 2021 at 11:09:32AM +0800, liuhongt wrote: > > Besides conversion instructions, pass_rpad also handles scalar > > sqrt/rsqrt/rcp/round instructions, while r12-3614 should only want to > > handle conversion instructions, so fix

Re: [PATCH] libgcc, i386: Export *hf* and *hc* from libgcc_s.so.1

2021-09-18 Thread Hongtao Liu via Gcc-patches
On Sunday, September 19, 2021, Iain Sandoe wrote: > Hi Hongtao, > > > On 8 Sep 2021, at 10:31, Hongtao Liu wrote: > > > > On Wed, Sep 8, 2021 at 5:09 PM Jakub Jelinek wrote: > >> > >> On Wed, Sep 08, 2021 at 10:37:17AM +0800, Hongtao Liu wrote: > &g

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-21 Thread Hongtao Liu via Gcc-patches
On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote: > > On 9/16/21 3:03 AM, Hongtao Liu via Gcc-patches wrote: > > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches > > wrote: > >> > >> On Thu, 16 Sep 2021, liuhongt wrote: > >

Re: [PATCH] Allow different vector types for stmt groups

2021-09-21 Thread Hongtao Liu via Gcc-patches
On Tue, Sep 21, 2021 at 10:55 PM H.J. Lu wrote: > > On Mon, Sep 20, 2021 at 5:15 AM Richard Biener via Gcc-patches > wrote: > > > > This allows vectorization (in practice non-loop vectorization) to > > have a stmt participate in different vector type vectorizations. > > It allows us to remove vec

<    4   5   6   7   8   9   10   11   12   13   >