from:"Hongtao Liu"

Re: [PATCH 24/62] AVX512FP16: Add vmovw/vmovsh.

2021-09-15 Thread Hongtao Liu via Gcc-patches

I'm going to check in 6 patches [PATCH 24/62] AVX512FP16: Add vmovw/vmovsh. [PATCH 25/62] AVX512FP16: Add testcase for vmovsh/vmovw. [PATCH 26/62] AVX512FP16: Add vcvtph2dq/vcvtph2qq/vcvtph2w/vcvtph2uw/vcvtph2uqq/vcvtph2udq [PATCH 27/62] AVX512FP16: Add testcase for vcvtph2w/vcvtph2uw/vcvtph2dq/vc

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-16 Thread Hongtao Liu via Gcc-patches

On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches wrote: > > On Thu, 16 Sep 2021, liuhongt wrote: > > > Ping > > rebased on latest trunk. > > > > gcc/ChangeLog: > > > > * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize). > > * doc/invoke.texi (Options That Contro

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-16 Thread Hongtao Liu via Gcc-patches

On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote: > > On Thu, 16 Sep 2021, Hongtao Liu wrote: > > > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches > > wrote: > > > > > > On Thu, 16 Sep 2021, liuhongt wrote: > > &

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-17 Thread Hongtao Liu via Gcc-patches

On Fri, Sep 17, 2021 at 3:47 PM Richard Biener wrote: > > On Fri, 17 Sep 2021, Hongtao Liu wrote: > > > On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote: > > > > > > On Thu, 16 Sep 2021, Hongtao Liu wrote: > > > > > > > On Thu,

Re: [PATCH 30/62] AVX512FP16: Add vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh.

2021-09-17 Thread Hongtao Liu via Gcc-patches

I'm going to check in 10 patches. [PATCH 30/62] AVX512FP16: Add vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh. [PATCH 31/62] AVX512FP16: Add testcase for vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh. [PATCH 32/62] AVX512FP16: Add vcvttph2w/vcvttph2uw/vcvttph2dq/vcvttph2qq/vcvttph2udq/vcvttph2uqq [PATCH 3

Re: [PATCH 3/4] [PATCH 3/4] x86: Properly handle USE_VECTOR_FP_CONVERTS/USE_VECTOR_CONVERTS

2021-09-17 Thread Hongtao Liu via Gcc-patches

On Sat, Sep 18, 2021 at 7:50 AM Jakub Jelinek via Gcc-patches wrote: > > On Fri, Sep 17, 2021 at 08:35:57AM +0200, Uros Bizjak via Gcc-patches wrote: > > > > On Wed, Sep 15, 2021 at 10:10 AM wrote: > > > > > > > > > > From: "H.J. Lu" > > > > > > > > > > Check TARGET_USE_VECTOR_FP_CONVERTS or > >

Re: [PATCH 40/62] AVX512FP16: Add vfmaddsub[132, 213, 231]ph/vfmsubadd[132, 213, 231]ph.

2021-09-17 Thread Hongtao Liu via Gcc-patches

Hi: I'm going to check the 9 patches below. Bootstrapped and regtest on x86_64-pc-linux-gnu{-m32,}. Newly added testcase passed on sde{-m32,}. [PATCH 40/62] AVX512FP16: Add vfmaddsub[132, 213, 231]ph/vfmsubadd[132, 213, 231]ph. [PATCH 41/62] AVX512FP16: Add testcase for vfmaddsub[132, 213, 2

Re: [PATCH] [i386] Fix ICE in pass_rpad.

2021-09-18 Thread Hongtao Liu via Gcc-patches

On Sat, Sep 18, 2021 at 3:31 PM Jakub Jelinek wrote: > > On Sat, Sep 18, 2021 at 11:09:32AM +0800, liuhongt wrote: > > Besides conversion instructions, pass_rpad also handles scalar > > sqrt/rsqrt/rcp/round instructions, while r12-3614 should only want to > > handle conversion instructions, so fix

Re: [PATCH] libgcc, i386: Export hf and hc from libgcc_s.so.1

2021-09-18 Thread Hongtao Liu via Gcc-patches

On Sunday, September 19, 2021, Iain Sandoe wrote: > Hi Hongtao, > > > On 8 Sep 2021, at 10:31, Hongtao Liu wrote: > > > > On Wed, Sep 8, 2021 at 5:09 PM Jakub Jelinek wrote: > >> > >> On Wed, Sep 08, 2021 at 10:37:17AM +0800, Hongtao Liu wrote: > &g

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-21 Thread Hongtao Liu via Gcc-patches

On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote: > > On 9/16/21 3:03 AM, Hongtao Liu via Gcc-patches wrote: > > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches > > wrote: > >> > >> On Thu, 16 Sep 2021, liuhongt wrote: > >

Re: [PATCH] Allow different vector types for stmt groups

2021-09-21 Thread Hongtao Liu via Gcc-patches

On Tue, Sep 21, 2021 at 10:55 PM H.J. Lu wrote: > > On Mon, Sep 20, 2021 at 5:15 AM Richard Biener via Gcc-patches > wrote: > > > > This allows vectorization (in practice non-loop vectorization) to > > have a stmt participate in different vector type vectorizations. > > It allows us to remove vec

Re: [PATCH 49/62] AVX512FP16: Add vfcmaddcph/vfmaddcph/vfcmulcph/vfmulcph

2021-09-21 Thread Hongtao Liu via Gcc-patches

I'm going to check in 7 patches. [PATCH 49/62] AVX512FP16: Add vfcmaddcph/vfmaddcph/vfcmulcph/vfmulcph [PATCH 50/62] AVX512FP16: Add testcases for vfcmaddcph/vfmaddcph/vfcmulcph/vfmulcph. [PATCH 51/62] AVX512FP16: Add vfcmaddcsh/vfmaddcsh/vfcmulcsh/vfmulcsh. [PATCH 52/62] AVX512FP16: Add testcases

Re: [PATCH 59/62] AVX512FP16: Support load/store/abs intrinsics.

2021-09-22 Thread Hongtao Liu via Gcc-patches

I'm going to check in 4 patches. [PATCH 59/62] AVX512FP16: Support load/store/abs intrinsics. [PATCH 60/62] AVX512FP16: Add reduce operators(add/mul/min/max). [PATCH 61/62] AVX512FP16: Add complex conjugation intrinsic instructions. [PATCH 62/62] AVX512FP16: Add permutation and mask blend intrinsi

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-22 Thread Hongtao Liu via Gcc-patches

On Wed, Sep 22, 2021 at 10:21 PM Martin Sebor wrote: > > On 9/21/21 7:38 PM, Hongtao Liu wrote: > > On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote: > ... > >>>>> diff --git a/gcc/testsuite/c-c++-common/Wstringop-overflow-2.c > >>>>>

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-22 Thread Hongtao Liu via Gcc-patches

On Thu, Sep 23, 2021 at 9:48 AM Hongtao Liu wrote: > > On Wed, Sep 22, 2021 at 10:21 PM Martin Sebor wrote: > > > > On 9/21/21 7:38 PM, Hongtao Liu wrote: > > > On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote: > > ... > > >>>>> diff --g

Re: [PATCH] [GCC12] Mention Intel AVX512-FP16 and _Float16 support.

2021-09-23 Thread Hongtao Liu via Gcc-patches

On Fri, Sep 24, 2021 at 9:20 AM H.J. Lu wrote: > > On Thu, Sep 23, 2021 at 6:17 PM liuhongt wrote: > > > > Updated, mention _Float16 support. > > > > --- > > htdocs/gcc-12/changes.html | 13 - > > 1 file changed, 12 insertions(+), 1 deletion(-) > > > > diff --git a/htdocs/gcc-12/chan

Re: [PATCH] [GCC12] Mention Intel AVX512-FP16 and _Float16 support.

2021-09-23 Thread Hongtao Liu via Gcc-patches

On Fri, Sep 24, 2021 at 9:42 AM H.J. Lu wrote: > > On Thu, Sep 23, 2021 at 6:38 PM Hongtao Liu wrote: > > > > On Fri, Sep 24, 2021 at 9:20 AM H.J. Lu wrote: > > > > > > On Thu, Sep 23, 2021 at 6:17 PM liuhongt wrote: > > >

Re: [PATCH] AVX512FP16: Support cond_op for HFmode

2021-09-23 Thread Hongtao Liu via Gcc-patches

On Fri, Sep 24, 2021 at 10:16 AM Hongyu Wang via Gcc-patches wrote: > > > >This patch extend the expanders for cond_op to support vector HF modes. > > >bootstraped and regtested on x86_64-pc-linux-gnu{-m32,}. > > Do runtime tests passe on sde{-m32,}? > > Yes, forgot to mention this. > > Liu, Hongt

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-23 Thread Hongtao Liu via Gcc-patches

On Thu, Sep 23, 2021 at 11:18 PM Martin Sebor wrote: > > On 9/23/21 12:30 AM, Richard Biener wrote: > > On Thu, 23 Sep 2021, Hongtao Liu wrote: > > > >> On Thu, Sep 23, 2021 at 9:48 AM Hongtao Liu wrote: > >>> > >>> On Wed, Sep 22, 2021 at 10:21

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-24 Thread Hongtao Liu via Gcc-patches

ping On Mon, Sep 13, 2021 at 11:19 PM Hongtao Liu wrote: > > On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches > wrote: > > > > > > > > On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote: > > >Currently for (vec_concat:M (vec_s

Re: [PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-27 Thread Hongtao Liu via Gcc-patches

On Fri, Sep 24, 2021 at 9:08 PM Segher Boessenkool wrote: > > On Mon, Sep 13, 2021 at 04:24:13PM +0200, Richard Biener wrote: > > On Mon, Sep 13, 2021 at 4:10 PM Jeff Law via Gcc-patches > > wrote: > > > I'm not convinced that we need the inner mode to match anything. As > > > long as the vec_co

Re: [PATCH] [GIMPLE] Simplify (_Float16) ceil ((double) x) to .CEIL (x) when available.

2021-09-27 Thread Hongtao Liu via Gcc-patches

On Mon, Sep 27, 2021 at 8:53 PM Richard Biener wrote: > > On Fri, Sep 24, 2021 at 1:26 PM liuhongt wrote: > > > > Hi: > > Related discussion in [1] and PR. > > > > Bootstrapped and regtest on x86_64-linux-gnu{-m32,}. > > Ok for trunk? > > > > [1] https://gcc.gnu.org/pipermail/gcc-patches/20

Re: [r12-3893 Regression] FAIL: gcc.target/i386/vect-pr97352.c scan-assembler-times vmov.pd 4 on Linux/x86_64

2021-09-28 Thread Hongtao Liu via Gcc-patches

On Tue, Sep 28, 2021 at 2:59 PM Richard Biener via Gcc-patches wrote: > > On Mon, 27 Sep 2021, sunil.k.pandey wrote: > > > On Linux/x86_64, > > > > 6390c5047adb75960f86d56582e6322aaa4d9281 is the first bad commit > > commit 6390c5047adb75960f86d56582e6322aaa4d9281 > > Author: Richard Biener > > D

Re: [PATCH] Adjust testcase for O2 vect.

2021-10-28 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 29, 2021 at 12:20 AM Martin Sebor via Gcc-patches wrote: > > On 10/28/21 1:23 AM, liuhongt via Gcc-patches wrote: > > Adjust code in check_vect_slp_aligned_store_usage to make it an exact > > pattern match of the corresponding testcases. > > These new target/xfail selectors are added a

Re: [PATCH] Enable vectorization for _Float16 floor/ceil/trunc/nearbyint/rint operations.

2021-10-28 Thread Hongtao Liu via Gcc-patches

On Thu, Oct 28, 2021 at 10:26 AM Hongtao Liu wrote: > > On Mon, Oct 25, 2021 at 4:24 PM liuhongt wrote: > > > > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. > > Ok for trunk? > > > I'm going to check in this patch if there's

Re: [PATCH] Adjust testcase for O2 vect.

2021-10-28 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 29, 2021 at 10:34 AM Martin Sebor wrote: > > On 10/28/21 7:47 PM, Hongtao Liu wrote: > > On Fri, Oct 29, 2021 at 12:20 AM Martin Sebor via Gcc-patches > > wrote: > >> > >> On 10/28/21 1:23 AM, liuhongt via Gcc-patches wrote: > >>> Adju

Re: [PATCH] i386: Fix wrong result for AMX-TILE intrinsic when parsing expression.

2021-11-03 Thread Hongtao Liu via Gcc-patches

On Thu, Nov 4, 2021 at 9:19 AM Hongyu Wang via Gcc-patches wrote: > > Hi, > > _tile_loadd, _tile_stored, _tile_streamloadd intrinsics are defined by > macro, so the parameters should be wrapped by parentheses to accept > expressions. > > Bootstraped/regtested on x86_64-pc-linux-gnu{-m32,} and sde.

Re: [PATCH] i386: Fix wrong result for AMX-TILE intrinsic when parsing expression.

2021-11-03 Thread Hongtao Liu via Gcc-patches

On Thu, Nov 4, 2021 at 11:25 AM Hongyu Wang wrote: > > > Could you add a testcase for that? > > Yes, updated patch. > > Hongtao Liu via Gcc-patches 于2021年11月4日周四上午10:25写道： > > > > On Thu, Nov 4, 2021 at 9:19 AM Hongyu Wang via Gcc-patches > > wrote:

Re: [PATCH] i386: Auto vectorize sdot_prod, usdot_prod with VNNI instruction.

2021-11-03 Thread Hongtao Liu via Gcc-patches

On Thu, Nov 4, 2021 at 9:50 AM Hongyu Wang via Gcc-patches wrote: > > Hi, > > AVX512VNNI/AVXVNNI has vpdpwssd for HImode, vpdpbusd for QImode, so > Adjust HImode sdot_prod expander and add QImode usdot_prod expander > to enhance vectorization for dotprod. > > Bootstraped/regtested on x86_64-pc-lin

Re: [PATCH 1/2] [Gimple] Simplify (trunc)fmax/fmin((extend)a, (extend)b) to MAX/MIN(a,b)

2021-11-07 Thread Hongtao Liu via Gcc-patches

On Fri, Nov 5, 2021 at 5:52 PM Richard Biener wrote: > > On Fri, Nov 5, 2021 at 6:38 AM liuhongt wrote: > > > > a and b are same type as trunc type and has less precision than > > extend type, the transformation is guarded by flag_finite_math_only. > > > > Bootstrapped and regtested under x86_64-

Re: [PATCH] i386: Support complex fma/conj_fma for _Float16.

2021-11-07 Thread Hongtao Liu via Gcc-patches

On Fri, Nov 5, 2021 at 3:09 PM Kong, Lingling via Gcc-patches wrote: > > Hi, > > This patch is to support cmla_optab, cmul_optab, cmla_conj_optab, > cmul_conj_optab for vector _Float16. > Ok for master? LGTM. > gcc/ChangeLog: > > * config/i386/sse.md (cmul3): add new define_expand. >

Re: [PATCH] i386: Optimization for mm512_set1_pch.

2021-11-07 Thread Hongtao Liu via Gcc-patches

On Fri, Nov 5, 2021 at 3:20 PM Kong, Lingling via Gcc-patches wrote: > > Hi, > > This patch is to support fold _mm512_fmadd_pch (a, _mm512_set1_pch(*(b)), c) > to 1 instruction vfmaddcph (%rsp){1to16}, %zmm1, %zmm2. > OK for master? > LGTM. > gcc/ChangeLog: > > * config/i386/sse.md (fma__

Re: [PATCH 1/2] [Gimple] Simplify (trunc)fmax/fmin((extend)a, (extend)b) to MAX/MIN(a,b)

2021-11-08 Thread Hongtao Liu via Gcc-patches

On Mon, Nov 8, 2021 at 4:59 PM Richard Biener wrote: > > On Mon, Nov 8, 2021 at 2:30 AM Hongtao Liu wrote: > > > > On Fri, Nov 5, 2021 at 5:52 PM Richard Biener > > wrote: > > > > > > On Fri, Nov 5, 2021 at 6:38 AM liuhongt wrote: > > > &g

Re: [PATCH 1/2] [Gimple] Simplify (trunc)fmax/fmin((extend)a, (extend)b) to MAX/MIN(a,b)

2021-11-09 Thread Hongtao Liu via Gcc-patches

On Tue, Nov 9, 2021 at 6:21 PM Richard Biener wrote: > > On Tue, Nov 9, 2021 at 3:37 AM Hongtao Liu wrote: > > > > On Mon, Nov 8, 2021 at 4:59 PM Richard Biener > > wrote: > > > > > > On Mon, Nov 8, 2021 at 2:30 AM Hongtao Liu wrote: > > &g

Re: [PATCH] [pass_if_conversion] Extend is_cond_scalar_reduction to handle bit_and/bit_xor/bit_ior.

2021-11-09 Thread Hongtao Liu via Gcc-patches

On Tue, Nov 9, 2021 at 6:22 PM Richard Biener via Gcc-patches wrote: > > On Tue, Nov 9, 2021 at 3:09 AM liuhongt wrote: > > > > This will enable transformation like > > > > - # sum1_50 = PHI > > - # sum2_52 = PHI > > + # sum1_50 = PHI <_87(13), 0(4)> > > + # sum2_52 = PHI <_89(13), 0(4)> >

Re: [PATCH v2] Check optab before transforming atomic bit test and operations

2021-11-14 Thread Hongtao Liu via Gcc-patches

On Sat, Nov 13, 2021 at 2:21 AM H.J. Lu via Gcc-patches wrote: > > On Fri, Nov 12, 2021 at 8:13 AM Jakub Jelinek wrote: > > > > On Fri, Nov 12, 2021 at 07:55:26AM -0800, H.J. Lu wrote: > > > > I have following patch queued for testing for this... > > > > > > > > 2021-11-12 Jakub Jelinek > > >

Re: [PATCH v2] Check optab before transforming atomic bit test and operations

2021-11-14 Thread Hongtao Liu via Gcc-patches

On Mon, Nov 15, 2021 at 9:37 AM Hongtao Liu wrote: > > On Sat, Nov 13, 2021 at 2:21 AM H.J. Lu via Gcc-patches > wrote: > > > > On Fri, Nov 12, 2021 at 8:13 AM Jakub Jelinek wrote: > > > > > > On Fri, Nov 12, 2021 at 07:55:26AM -0800, H.J. Lu wrote: >

Re: [PATCH] i386: add alias for fmul_ch intrinsics

2021-11-16 Thread Hongtao Liu via Gcc-patches

On Tue, Nov 16, 2021 at 4:23 PM Kong, Lingling via Gcc-patches wrote: > > Hi, > > This patch is to add alias for f*mul_*ch intrinsics. > > Ok for master? This patch just adds some macro definitions (new aliases for intrinsic) to the header file, and I think this should be low risk. And considering

Re: [PATCH] i386: vcvtph2ps and vcvtps2ph should be used to convert _Float16 to SFmode with -mf16c [PR 102811]

2021-11-16 Thread Hongtao Liu via Gcc-patches

On Tue, Nov 16, 2021 at 4:15 PM Kong, Lingling via Gcc-patches wrote: > > Hi, > > vcvtph2ps and vcvtps2ph should be used to convert _Float16 to SFmode with > -mf16c. So added define_insn extendhfsf2 and truncsfhf2 for target_f16c. > > OK for master? > > gcc/ChangeLog: > > PR target/102811

[i386] [PATCH] Fix ICE when lhs is NULL [PR target/100660]

2021-05-19 Thread Hongtao Liu via Gcc-patches

Hi: In folding target-specific builtin, when lhs is NULL, create a temporary variable for it. Bootstrapped and regtested on x86_64-linux-gnu{-m32,} gcc/ChangeLog: PR target/100660 * config/i386/i386.c (ix86_gimple_fold_builtin): Create a tmp variable for lhs when it doe

Re: [i386] [PATCH] Fix ICE when lhs is NULL [PR target/100660]

2021-05-20 Thread Hongtao Liu via Gcc-patches

On Thu, May 20, 2021 at 4:06 PM Richard Biener wrote: > > On Thu, May 20, 2021 at 8:54 AM Hongtao Liu wrote: > > > > Hi: > > In folding target-specific builtin, when lhs is NULL, create a > > temporary variable for it. > > Bootstrapped and regtested on

Re: [PATCH] i386: Optimize vpblendvb on inverted mask register to vpblendvb on swapping the order of operand 1 and operand 2. [PR target/99908]

2021-05-20 Thread Hongtao Liu via Gcc-patches

On Thu, May 13, 2021 at 8:43 AM Hongtao Liu wrote: > > On Wed, May 12, 2021 at 8:38 PM Uros Bizjak wrote: > > > > On Wed, May 12, 2021 at 1:42 PM Hongtao Liu wrote: > > > > > > On Wed, May 12, 2021 at 4:36 PM Uros Bizjak wrote: > > > > > &g

Re: [i386] [PATCH] Fix ICE when lhs is NULL [PR target/100660]

2021-05-20 Thread Hongtao Liu via Gcc-patches

On Thu, May 20, 2021 at 4:30 PM Richard Biener wrote: > > On Thu, May 20, 2021 at 10:15 AM Hongtao Liu wrote: > > > > On Thu, May 20, 2021 at 4:06 PM Richard Biener > > wrote: > > > > > > On Thu, May 20, 2021 at 8:54 AM Hongtao Liu wrote: >

[PATCH 1/2] [i386] Fold blendv builtins into gimple.

2021-05-23 Thread Hongtao Liu via Gcc-patches

Hi: This patch is about to Fold __builtin_ia32_pblendvb128 (a, b, c) as VEC_COND_EXPR (c < 0, b, a), similar for float version but with mask operand VIEW_CONVERT_EXPR to same sized integer vectype. After folding, blendv related patterns can be redefined as vec_merge since all elements of mask

[PATCH 2/2] [i386] For 128/256-bit vec_cond_expr, When mask operands is lt reg const0_rtx, blendv can be used instead of avx512 mask. [PR target/100648]

2021-05-23 Thread Hongtao Liu via Gcc-patches

Hi: This patch is about to add define_insn_and_split to convert avx512 mask mov back to pblendv instructions when mask operand is (lt: reg const0_rtx). Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. Ok for trunk? gcc/ChangeLog: PR target/100648 * config/i386/sse.md

[PATCH] [i386] Support avx512 vector shift with vector [PR98434]

2021-05-24 Thread Hongtao Liu via Gcc-patches

Hi: This patch is about to add expanders for vashl, vlshr, vashr and vashr. Besides there's some assumption in expand_mult_const that mul and add must be available at the same time, but for i386, addv8qi is restricted under TARGET_64BIT, but mulv8qi not, that could cause ICE. So restrict mulv8qi

[PATCH] Extend is_cond_scalar_reduction to handle nop_expr after/before scalar reduction.[PR98365]

2021-05-24 Thread Hongtao Liu via Gcc-patches

Hi: Details described in PR. Bootstrapped and regtest on x86_64-linux-gnu{-m32,}/x86_64-linux-gnu{-m32\ -march=cascadelake,-march=cascadelake} Ok for trunk? gcc/ChangeLog: PR tree-optimization/pr98365 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.

[PATCH][i386] Split not+broadcast+pand to broadcast+pandn.

2021-05-24 Thread Hongtao Liu via Gcc-patches

Hi: This patch is about to do transformation like below. Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. Ok for trunk? from notl%edi vpbroadcastd%edi, %xmm0 vpand %xmm1, %xmm0, %xmm0 to vpbroadcastd%edi, %xmm0 vpandn %xmm1, %xmm0

Re: [PATCH] [i386] Fix _mm256_zeroupper to notify LRA that vzeroupper will kill sse registers. [PR target/82735]

2021-05-24 Thread Hongtao Liu via Gcc-patches

On Tue, May 18, 2021 at 11:18 PM Richard Sandiford wrote: > > Hongtao Liu via Gcc-patches writes: > > On Mon, May 17, 2021 at 5:56 PM Richard Sandiford > > wrote: > >> It looks like the rtx “used” flag is unused for INSNs, so we could > >> use that as a CALL_

Re: [PATCH][i386] Split not+broadcast+pand to broadcast+pandn.

2021-05-24 Thread Hongtao Liu via Gcc-patches

On Tue, May 25, 2021 at 2:11 PM Andrew Pinski wrote: > > On Mon, May 24, 2021 at 11:03 PM Hongtao Liu via Gcc-patches > wrote: > > > > Hi: > > This patch is about to do transformation like below. > > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}.

Re: [PATCH] [i386] Fix _mm256_zeroupper to notify LRA that vzeroupper will kill sse registers. [PR target/82735]

2021-05-24 Thread Hongtao Liu via Gcc-patches

On Tue, May 25, 2021 at 2:04 PM Hongtao Liu wrote: > > On Tue, May 18, 2021 at 11:18 PM Richard Sandiford > wrote: > > > > Hongtao Liu via Gcc-patches writes: > > > On Mon, May 17, 2021 at 5:56 PM Richard Sandiford > > > wrote: > > >> It loo

Re: [PATCH][i386] Split not+broadcast+pand to broadcast+pandn.

2021-05-24 Thread Hongtao Liu via Gcc-patches

On Tue, May 25, 2021 at 2:29 PM Andrew Pinski wrote: > > On Mon, May 24, 2021 at 11:23 PM Hongtao Liu wrote: > > > > On Tue, May 25, 2021 at 2:11 PM Andrew Pinski wrote: > > > > > > On Mon, May 24, 2021 at 11:03 PM Hongtao Liu via Gcc-patches > > &g

Re: [PATCH][i386] Split not+broadcast+pand to broadcast+pandn.

2021-05-25 Thread Hongtao Liu via Gcc-patches

Update patch: The new patch simplify (vec_duplicate (not (nonimmedaite_operand))) to (not (vec_duplicate (nonimmedaite_operand))). This is not a straightforward simplification, just adding some tendency to pull not out of vec_duplicate. For i386, it will enable below opt from notl

Re: [PATCH] Extend is_cond_scalar_reduction to handle nop_expr after/before scalar reduction.[PR98365]

2021-05-25 Thread Hongtao Liu via Gcc-patches

On Tue, May 25, 2021 at 6:24 PM Richard Biener wrote: > > On Mon, May 24, 2021 at 11:52 AM Hongtao Liu wrote: > > > > Hi: > > Details described in PR. > > Bootstrapped and regtest on > > x86_64-linux-gnu{-m32,}/x86_64-linux-gnu{-m32\ > > -march=ca

Re: [PATCH][i386] Split not+broadcast+pand to broadcast+pandn.

2021-05-25 Thread Hongtao Liu via Gcc-patches

On Wed, May 26, 2021 at 12:12 PM Andrew Pinski wrote: > > On Tue, May 25, 2021 at 6:17 PM Hongtao Liu wrote: > > > > Update patch: > > The new patch simplify (vec_duplicate (not (nonimmedaite_operand))) > > to (not (vec_duplicate (nonimmedaite_operand))). Thi

Re: [PATCH] [i386] Fix _mm256_zeroupper to notify LRA that vzeroupper will kill sse registers. [PR target/82735]

2021-05-26 Thread Hongtao Liu via Gcc-patches

Hi: This is an updated patch which implements vzeroupper as call_insn which has a special vzeroupper ABI, also in this patch i reverted r11-7684, r10-6451, r10-3677 which seems to fix the same issue but in a different way. Bootstrapped and regtested on x86_64-linux-gnux{-m32,} and x86_64-linux-

Re: [PATCH] Extend is_cond_scalar_reduction to handle nop_expr after/before scalar reduction.[PR98365]

2021-05-27 Thread Hongtao Liu via Gcc-patches

On Wed, May 26, 2021 at 8:41 PM Richard Biener wrote: > > On Wed, May 26, 2021 at 7:06 AM Hongtao Liu wrote: > > > > On Tue, May 25, 2021 at 6:24 PM Richard Biener > > wrote: > > > > > > On Mon, May 24, 2021 at 11:52 AM Hongtao Liu wrote: > > &

Re: [PATCH] Extend is_cond_scalar_reduction to handle nop_expr after/before scalar reduction.[PR98365]

2021-05-31 Thread Hongtao Liu via Gcc-patches

On Mon, May 31, 2021 at 6:14 PM Richard Biener wrote: > > On Thu, May 27, 2021 at 9:05 AM Hongtao Liu wrote: > > > > On Wed, May 26, 2021 at 8:41 PM Richard Biener > > wrote: > > > > > > On Wed, May 26, 2021 at 7:06 AM Hongtao Liu wrote: > > >

Re: [PATCH] [i386] Fix _mm256_zeroupper to notify LRA that vzeroupper will kill sse registers. [PR target/82735]

2021-05-31 Thread Hongtao Liu via Gcc-patches

On Thu, May 27, 2021 at 6:50 PM Richard Sandiford wrote: > > Jakub Jelinek writes: > > On Thu, May 27, 2021 at 01:07:09PM +0800, Hongtao Liu via Gcc-patches wrote: > >> + /* Flag used for call_insn indicates it's a fake call. */ > >> + RTX_FLAG (insn, used

Re: [PATCH] [i386] Fix _mm256_zeroupper to notify LRA that vzeroupper will kill sse registers. [PR target/82735]

2021-05-31 Thread Hongtao Liu via Gcc-patches

On Thu, May 27, 2021 at 3:05 PM Uros Bizjak wrote: > > On Thu, May 27, 2021 at 7:03 AM Hongtao Liu wrote: > > > > Hi: > > This is an updated patch which implements vzeroupper as call_insn > > which has a special vzeroupper ABI, also in this patch i reverted &g

Re: [PATCH] [i386] Fix _mm256_zeroupper to notify LRA that vzeroupper will kill sse registers. [PR target/82735]

2021-05-31 Thread Hongtao Liu via Gcc-patches

On Tue, Jun 1, 2021 at 10:22 AM Hongtao Liu wrote: > > On Thu, May 27, 2021 at 6:50 PM Richard Sandiford > wrote: > > > > Jakub Jelinek writes: > > > On Thu, May 27, 2021 at 01:07:09PM +0800, Hongtao Liu via Gcc-patches > > > wrote: > > >>

[PATCH] Simplify (view_convert ~a) < 0 to (view_convert a) >= 0 [PR middle-end/100738]

2021-05-31 Thread Hongtao Liu via Gcc-patches

Hi: This patch is about to simplify (view_convert:type ~a) < 0 to (view_convert:type a) >= 0 when type is signed integer. Similar for (view_convert:type ~a) >= 0. Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. Ok for the trunk? gcc/ChangeLog: PR middle-end/100738 * m

Re: [PATCH][i386] Split not+broadcast+pand to broadcast+pandn.

2021-06-01 Thread Hongtao Liu via Gcc-patches

On Wed, May 26, 2021 at 1:17 PM Hongtao Liu wrote: > > On Wed, May 26, 2021 at 12:12 PM Andrew Pinski wrote: > > > > On Tue, May 25, 2021 at 6:17 PM Hongtao Liu wrote: > > > > > > Update patch: > > > The new patch simplify (vec_duplicat

Re: [PATCH] Simplify (view_convert ~a) < 0 to (view_convert a) >= 0 [PR middle-end/100738]

2021-06-01 Thread Hongtao Liu via Gcc-patches

On Tue, Jun 1, 2021 at 1:29 PM Andrew Pinski wrote: > > On Mon, May 31, 2021 at 10:21 PM Hongtao Liu via Gcc-patches > wrote: > > > > Hi: > > This patch is about to simplify (view_convert:type ~a) < 0 to > > (view_convert:type a) >= 0 w

Re: [PATCH v2] Add vec_const_duplicate optab and TARGET_GEN_MEMSET_SCRATCH_RTX

2021-06-01 Thread Hongtao Liu via Gcc-patches

On Wed, Jun 2, 2021 at 7:07 AM H.J. Lu via Gcc-patches wrote: > > On Tue, Jun 1, 2021 at 7:21 AM Jeff Law wrote: > > > > > > > > On 6/1/2021 7:29 AM, H.J. Lu via Gcc-patches wrote: > > > On Tue, Jun 1, 2021 at 6:25 AM Richard Biener > > > wrote: > > >> On Tue, Jun 1, 2021 at 3:05 PM H.J. Lu wro

Re:

2021-06-01 Thread Hongtao Liu via Gcc-patches

Please discard this one, sorry for disturbing. Obviously I'm new to git send-email. On Wed, Jun 2, 2021 at 1:40 PM liuhongt via Gcc-patches wrote: > > This is the updated patch. > > -- BR, Hongtao

Re: [PATCH 1/2] CALL_INSN may not be a real function call.

2021-06-03 Thread Hongtao Liu via Gcc-patches

Ping, This is a splitted middle-end patch as a follow up of https://gcc.gnu.org/pipermail/gcc-patches/2021-June/571544.html On Thu, Jun 3, 2021 at 2:54 PM liuhongt via Gcc-patches wrote: > > Use "used" flag for CALL_INSN to indicate it's a fake call. If it's a > fake call, it won't have its own f

Re: [PATCH 2/2] Fix _mm256_zeroupper by representing the instructions as call_insns in which the call has a special vzeroupper ABI.

2021-06-03 Thread Hongtao Liu via Gcc-patches

Ping This is a splitted backend patch as a follow up of https://gcc.gnu.org/pipermail/gcc-patches/2021-June/571545.html On Thu, Jun 3, 2021 at 2:55 PM liuhongt via Gcc-patches wrote: > > When __builtin_ia32_vzeroupper is called explicitly, the corresponding > vzeroupper pattern does not carry an

Re: [PATCH] Simplify (view_convert ~a) < 0 to (view_convert a) >= 0 [PR middle-end/100738]

2021-06-03 Thread Hongtao Liu via Gcc-patches

On Tue, Jun 1, 2021 at 6:17 PM Marc Glisse wrote: > > On Tue, 1 Jun 2021, Hongtao Liu via Gcc-patches wrote: > > > Hi: > > This patch is about to simplify (view_convert:type ~a) < 0 to > > (view_convert:type a) >= 0 when type is signed integer. Similar fo

Re: [PATCH 1/2] [i386] Fold blendv builtins into gimple.

2021-06-03 Thread Hongtao Liu via Gcc-patches

ping On Mon, May 24, 2021 at 12:56 PM Hongtao Liu wrote: > > Hi: > This patch is about to Fold __builtin_ia32_pblendvb128 (a, b, c) as > VEC_COND_EXPR (c < 0, b, a), similar for float version but with > mask operand VIEW_CONVERT_EXPR to same sized integer vectype. > &g

Re: [PATCH 2/2] Fix _mm256_zeroupper by representing the instructions as call_insns in which the call has a special vzeroupper ABI.

2021-06-03 Thread Hongtao Liu via Gcc-patches

On Fri, Jun 4, 2021 at 2:27 PM Uros Bizjak via Gcc-patches wrote: > > On Thu, Jun 3, 2021 at 8:54 AM liuhongt wrote: > > > > When __builtin_ia32_vzeroupper is called explicitly, the corresponding > > vzeroupper pattern does not carry any CLOBBERS or SETs before LRA, > > which leads to incorrect o

Re: [PATCH] Simplify (view_convert ~a) < 0 to (view_convert a) >= 0 [PR middle-end/100738]

2021-06-04 Thread Hongtao Liu via Gcc-patches

On Fri, Jun 4, 2021 at 1:01 PM Hongtao Liu wrote: > > On Tue, Jun 1, 2021 at 6:17 PM Marc Glisse wrote: > > > > On Tue, 1 Jun 2021, Hongtao Liu via Gcc-patches wrote: > > > > > Hi: > > > This patch is about to simplify (view_convert:type ~a) < 0 to

Re: [PATCH] Simplify (view_convert ~a) < 0 to (view_convert a) >= 0 [PR middle-end/100738]

2021-06-06 Thread Hongtao Liu via Gcc-patches

On Fri, Jun 4, 2021 at 4:18 PM Marc Glisse wrote: > > On Fri, 4 Jun 2021, Hongtao Liu via Gcc-patches wrote: > > > On Tue, Jun 1, 2021 at 6:17 PM Marc Glisse wrote: > >> > >> On Tue, 1 Jun 2021, Hongtao Liu via Gcc-patches wrote: > >> > &

Re: [PATCH] Simplify (view_convert ~a) < 0 to (view_convert a) >= 0 [PR middle-end/100738]

2021-06-07 Thread Hongtao Liu via Gcc-patches

On Mon, Jun 7, 2021 at 2:22 PM Hongtao Liu wrote: > > On Fri, Jun 4, 2021 at 4:18 PM Marc Glisse wrote: > > > > On Fri, 4 Jun 2021, Hongtao Liu via Gcc-patches wrote: > > > > > On Tue, Jun 1, 2021 at 6:17 PM Marc Glisse wrote: > > >> > > >&

Re: [PATCH v3 1/2] x86: Convert CONST_WIDE_INT/CONST_VECTOR to broadcast

2021-06-09 Thread Hongtao Liu via Gcc-patches

On Wed, Jun 9, 2021 at 2:02 AM H.J. Lu via Gcc-patches wrote: > > 1. Update move expanders to convert the CONST_WIDE_INT and CONST_VECTO > operands to vector broadcast from an integer with AVX2. > 2. Add ix86_gen_scratch_sse_rtx to return a scratch SSE register which > won't increase stack alignme

Re: [PATCH] [GCC12] Mention Intel AVX512-FP16 and _Float16 support.

2021-10-07 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 1, 2021 at 6:13 PM Gerald Pfeifer wrote: > > On Fri, 24 Sep 2021, Hongtao Liu via Gcc-patches wrote: > > + New ISA extension support for Intel AVX512-FP16 was added to GCC. > > + AVX512FP16 intrinsics are available [...] > > So, is it AVX512-FP16 or A

Re: [PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-10-07 Thread Hongtao Liu via Gcc-patches

On Thu, Oct 7, 2021 at 11:38 PM H.J. Lu via Gcc-patches wrote: > > On Thu, Oct 7, 2021 at 8:35 AM Martin Liška wrote: > > > > Hello. > > > > The patch is approved, are you planning committing the changes? Committed. > > > > Thanks, > > Martin > > Hongtao is on holiday. He will be back later toda

Re: [PATCH] Refine movhfcc.

2021-10-08 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 8, 2021 at 5:31 PM liuhongt wrote: > > For AVX512-FP16, HFmode only supports vcmpsh whose dest is mask > register, so for movhfcc, it's > > vcmpsh op2, op1, %k1 > vmovsh op1, op2{%k1} > mov op2, dest > > gcc/ChangeLog: > > PR target/102639 > * config/i386/i386-expand.c

Re: [r12-4240 Regression] FAIL: libgomp.c++/scan-9.C scan-tree-dump-times vect "vectorized [2-6] loops" 2 on Linux/x86_64

2021-10-08 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 8, 2021 at 8:02 PM sunil.k.pandey via Gcc-patches wrote: > > On Linux/x86_64, > > 2b8453c401b699ed93c085d0413ab4b5030bcdb8 is the first bad commit > commit 2b8453c401b699ed93c085d0413ab4b5030bcdb8 > Author: liuhongt > Date: Mon Sep 6 13:48:49 2021 +0800 > > Enable auto-vectoriza

Re: [r12-4240 Regression] FAIL: libgomp.c++/scan-9.C scan-tree-dump-times vect "vectorized [2-6] loops" 2 on Linux/x86_64

2021-10-08 Thread Hongtao Liu via Gcc-patches

On Sat, Oct 9, 2021 at 1:27 AM sunil.k.pandey via Gcc-patches wrote: > > On Linux/x86_64, > > 2b8453c401b699ed93c085d0413ab4b5030bcdb8 is the first bad commit > commit 2b8453c401b699ed93c085d0413ab4b5030bcdb8 > Author: liuhongt > Date: Mon Sep 6 13:48:49 2021 +0800 > > Enable auto-vectoriza

Re: [r12-4240 Regression] FAIL: libgomp.c++/scan-9.C scan-tree-dump-times vect "vectorized [2-6] loops" 2 on Linux/x86_64

2021-10-08 Thread Hongtao Liu via Gcc-patches

On Sat, Oct 9, 2021 at 10:53 AM Hongtao Liu wrote: > > On Sat, Oct 9, 2021 at 1:27 AM sunil.k.pandey via Gcc-patches > wrote: > > > > On Linux/x86_64, > > > > 2b8453c401b699ed93c085d0413ab4b5030bcdb8 is the first bad commit > > commit 2b8453c401b699ed93c085

Re: [PATCH] testsuite: Add missing comment for some dg-warning

2021-10-09 Thread Hongtao Liu via Gcc-patches

On Sat, Oct 9, 2021 at 3:51 PM Kewen.Lin via Gcc-patches wrote: > > Hi, > > This patch fixes the typos introduced by commit r12-4240. > > The dg-warning format looks like: > > { dg-warning regexp [comment [{ target/xfail selector } [line] ]] } > > Some dg-warnings such as: > > { dg-warning "\\\[-W

Re: [PATCH] Adjust testcase for O2 vectorization enabling

2021-10-10 Thread Hongtao Liu via Gcc-patches

On Mon, Oct 11, 2021 at 10:23 AM Kewen.Lin via Gcc-patches wrote: > > Hi Hongtao, > > on 2021/10/11 上午10:10, liuhongt via Gcc-patches wrote: > > libgomp/ChangeLog: > > > > * testsuite/libgomp.graphite/force-parallel-8.c: Add > > -fno-tree-vectorize. > > --- > > libgomp/testsuite/libgomp.gr

Re: [PATCH] x86_64: Some SUBREG related optimization tweaks to i386 backend.

2021-10-11 Thread Hongtao Liu via Gcc-patches

On Mon, Oct 11, 2021 at 4:55 PM Roger Sayle wrote: > > > This patch contains two SUBREG-related optimization enabling tweaks to > the x86 backend. > > The first change, to ix86_expand_vector_extract, cures the strange > -march=cascadelake related non-determinism that affected my new test > cases l

Re: [PATCH] vectorizer: Fix up -fsimd-cost-model= handling

2021-10-11 Thread Hongtao Liu via Gcc-patches

On Monday, October 11, 2021, Jakub Jelinek via Gcc-patches < gcc-patches@gcc.gnu.org> wrote: > On Mon, Oct 11, 2021 at 11:28:55AM +0200, Jakub Jelinek via Gcc-patches > wrote: > > I don't think this is the right thing to do. > > This just means that at some point between 2013 when -fsimd-cost-mode

Re: [PATCH] rs6000/test: Adjust some cases due to O2 vect [PR102658]

2021-10-11 Thread Hongtao Liu via Gcc-patches

On Tue, Oct 12, 2021 at 4:08 AM Martin Sebor via Gcc-patches wrote: > > On 10/11/21 11:43 AM, Segher Boessenkool wrote: > > On Mon, Oct 11, 2021 at 10:23:03AM -0600, Martin Sebor wrote: > >> On 10/11/21 9:30 AM, Segher Boessenkool wrote: > >>> On Mon, Oct 11, 2021 at 10:47:00AM +0800, Kewen.Lin wr

Re: [PATCH] rs6000/test: Adjust some cases due to O2 vect [PR102658]

2021-10-12 Thread Hongtao Liu via Gcc-patches

On Tue, Oct 12, 2021 at 11:49 PM Martin Sebor wrote: > > On 10/11/21 8:31 PM, Hongtao Liu wrote: > > On Tue, Oct 12, 2021 at 4:08 AM Martin Sebor via Gcc-patches > > wrote: > >> > >> On 10/11/21 11:43 AM, Segher Boessenkool wrote: > >>> On Mon

Re: [PATCH] rs6000/test: Adjust some cases due to O2 vect [PR102658]

2021-10-12 Thread Hongtao Liu via Gcc-patches

On Wed, Oct 13, 2021 at 11:34 AM Hongtao Liu wrote: > > On Tue, Oct 12, 2021 at 11:49 PM Martin Sebor wrote: > > > > On 10/11/21 8:31 PM, Hongtao Liu wrote: > > > On Tue, Oct 12, 2021 at 4:08 AM Martin Sebor via Gcc-patches > > > wrote: > > >>

Re: [PATCH] AVX512FP16: Adjust builtin for mask complex fma

2021-10-13 Thread Hongtao Liu via Gcc-patches

On Wed, Oct 13, 2021 at 5:07 PM Hongyu Wang via Gcc-patches wrote: > > Hi, > > Current mask/mask3 implementation for complex fma contains > duplicated parameter in macro, which may cause error at -O0. > Refactor macro implementation to builtins to avoid potential > error. > > For round intrinsic w

Re: [PATCH] AVX512FP16: Support vector shuffle builtins

2021-10-13 Thread Hongtao Liu via Gcc-patches

On Thu, Oct 14, 2021 at 10:39 AM Hongyu Wang via Gcc-patches wrote: > > Hi, > > This patch supports HFmode vector shuffle by creating HImode subreg when > expanding permutation expr. > > Bootstrapped/regtested on x86_64-pc-linux-gnu{-m32,} and sde{-m32,} > OK for master? > > gcc/ChangeLog: > >

Re: [PATCH] AVX512FP16: Fix ICE for 2 v4hf vector concat

2021-10-14 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 15, 2021 at 1:07 PM Hongyu Wang via Gcc-patches wrote: > > Hi, > > For V4HFmode, doing vector concat like > > __builtin_shufflevector (a, b, {0, 1, 2, 3, 4, 5, 6, 7}) > > could trigger ICE since it is not handled in ix86_vector_init (). > > Handle HFmode like HImode to avoid such ICE.

Re: [PATCH] AVX512FP16: Support vector shuffle builtins

2021-10-14 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 15, 2021 at 1:37 PM Hongyu Wang wrote: > > > This part seems not related to vector shuffle. > Yes, have separated this part to another patch and checked-in. > > Updated patch. Ok for this one? > > Hongtao Liu via Gcc-patches 于2021年10月14日周四下午2:33写道： > >

Re: [PATCH] AVX512FP16: Support vector shuffle builtins

2021-10-14 Thread Hongtao Liu via Gcc-patches

checks for NULL seems reasonable according to documents, > > op0,op1,target maybe NULL. > Thanks for pointing it out, didn't realize the difference between > these 2 functions. LGTM. > > Updated patch. > > Hongtao Liu 于2021年10月15日周五下午1:54写道： > > > > On Fri, O

Re: [PATCH] AVX512FP16: Add *_set1_pch intrinsics.

2021-10-17 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 15, 2021 at 4:38 PM dianhong.xu--- via Gcc-patches wrote: > > From: dianhong xu > > Add *_set1_pch (_Float16 _Complex A) intrinsics. > > gcc/ChangeLog: > > * config/i386/avx512fp16intrin.h: > (_mm512_set1_pch): New intrinsic. > * config/i386/avx512fp16vlintrin.

Re: [PATCH] Adjust testcase for O2 vectorization.

2021-10-17 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 15, 2021 at 11:37 PM Martin Sebor wrote: > > On 10/14/21 1:11 AM, liuhongt wrote: > > Hi Kewen: > >Cound you help to verify if this patch fix those regressions > > for rs6000 port. > > > > As discussed in [1], this patch add xfail/target selector to those > > testcases, also make a

Re: [PATCH] Adjust testcase for O2 vectorization.

2021-10-17 Thread Hongtao Liu via Gcc-patches

On Fri, Oct 15, 2021 at 3:11 PM Kewen.Lin via Gcc-patches wrote: > > on 2021/10/14 下午6:56, Kewen.Lin via Gcc-patches wrote: > > Hi Hongtao, > > > > on 2021/10/14 下午3:11, liuhongt wrote: > >> Hi Kewen: > >> Cound you help to verify if this patch fix those regressions > >> for rs6000 port. > >> >

Re: [PATCH] Adjust testcase for O2 vectorization.

2021-10-20 Thread Hongtao Liu via Gcc-patches

On Wed, Oct 20, 2021 at 7:34 PM Christophe Lyon via Gcc-patches wrote: > > Hi, > > > On Tue, Oct 19, 2021 at 11:03 AM liuhongt via Gcc-patches < > gcc-patches@gcc.gnu.org> wrote: > > > updated patch: > > 1. Add documents in doc/sourcebuild.texi (Effective-Target Keywords). > > 2. Reduce -novec

Re: [PATCH] Adjust testcase for O2 vectorization.

2021-10-20 Thread Hongtao Liu via Gcc-patches

On Thu, Oct 21, 2021 at 9:20 AM Hongtao Liu wrote: > > On Wed, Oct 20, 2021 at 7:34 PM Christophe Lyon via Gcc-patches > wrote: > > > > Hi, > > > > > > On Tue, Oct 19, 2021 at 11:03 AM liuhongt via Gcc-patches < > > gcc-patches@gcc.gnu.org

Re: [PATCH] Adjust testcase for O2 vectorization.

2021-10-20 Thread Hongtao Liu via Gcc-patches

On Thu, Oct 21, 2021 at 10:06 AM Hongtao Liu wrote: > > On Thu, Oct 21, 2021 at 9:20 AM Hongtao Liu wrote: > > > > On Wed, Oct 20, 2021 at 7:34 PM Christophe Lyon via Gcc-patches > > wrote: > > > > > > Hi, > > > > > > > > >

Re: [PATCH] i386: Fix wrong codegen for V8HF move without TARGET_AVX512F

2021-10-20 Thread Hongtao Liu via Gcc-patches

On Wed, Oct 20, 2021 at 1:31 PM Hongyu Wang via Gcc-patches wrote: > > Since _Float16 type is enabled under sse2 target, returning > V8HFmode vector without AVX512F target would generate wrong > vmovdqa64 instruction. Adjust ix86_get_ssemov to avoid this. > > Bootstraped/regtested on x86_64-pc-lin

< 5 6 7 8 9 10 11 12 13 14 >

901 - 1000 of 1403 matches

Mail list logo