On Mon, May 9, 2022 at 2:43 PM liuhongt via Gcc-patches
wrote:
>
> Clean up of 16-bit uppers is not needed for pmovzxbq/pmovsxbq.
>
> Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
> Ok for trunk?
>
> gcc/ChangeLog:
>
> PR target/105072
> * config/i386/sse.md (*sse4_1_v2
On Mon, May 9, 2022 at 4:19 PM Uros Bizjak wrote:
>
> On Mon, May 9, 2022 at 7:24 AM Hongtao Liu wrote:
> >
> > On Mon, May 9, 2022 at 1:22 PM liuhongt via Gcc-patches
> > wrote:
> > >
> > > pand/pandn may be used to clear upper/lower bits of the oper
On Mon, May 9, 2022 at 4:28 PM Uros Bizjak wrote:
>
> On Mon, May 9, 2022 at 4:03 AM liuhongt wrote:
> >
> > Similarly optimize movl + vmovq to vmovd.
> >
> > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
> > Ok for trunk?
> >
> > gcc/ChangeLog:
> >
> > PR target/104915
> >
On Tue, May 10, 2022 at 2:54 PM Richard Biener via Gcc-patches
wrote:
>
> On Mon, May 9, 2022 at 7:11 AM liuhongt via Gcc-patches
> wrote:
> >
> > Here's adjused patch.
> > Ok for trunk?
> >
> > Optimize
> >
> > _4 = VEC_PERM_EXPR <_1, _1, { 4, 5, 6, 7, 4, 5, 6, 7 }>;
> > _5 = BIT_FIELD_REF <
On Wed, May 11, 2022 at 4:45 PM Richard Biener via Gcc-patches
wrote:
>
> On Mon, May 9, 2022 at 7:19 AM liuhongt wrote:
> >
> > This patch will enable below optimization:
> >
> > {
> > - int bit;
> > - long long unsigned int _1;
> > - long long unsigned int _2;
> > -
> > [local count: 46
ping.
On Sat, May 7, 2022 at 1:05 PM liuhongt via Gcc-patches
wrote:
>
> This is adjusted patch only for OImode.
>
> Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
> Ok for trunk?
>
> gcc/ChangeLog:
>
> PR target/104610
> * config/i386/i386-expand.cc (ix86_expand_branch
On Mon, May 16, 2022 at 5:21 PM Uros Bizjak via Gcc-patches
wrote:
>
> On Sat, May 7, 2022 at 7:05 AM liuhongt wrote:
> >
> > This is adjusted patch only for OImode.
> >
> > Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
> > Ok for trunk?
> >
> > gcc/ChangeLog:
> >
> > PR targe
I've committed the patch.
On Fri, May 13, 2022 at 5:22 PM liuhongt via Gcc-patches
wrote:
>
> Here's updated patch which adds ix86_pre_reload_split () to those 2
> define_insn_and_splits.
>
> Assembly Optimization like:
> - vmovq %xmm0, %xmm2
> - vmovdqa .LC0(%rip), %xmm0
>
On Tue, May 17, 2022 at 11:06 AM liuhongt via Gcc-patches
wrote:
>
> backend has
>
> 16550(define_insn "*bmi2_bzhi_3_2"
> 16551 [(set (match_operand:SWI48 0 "register_operand" "=r")
> 16552(and:SWI48
> 16553 (plus:SWI48
> 16554(ashift:SWI48 (const_int 1)
> 16555
thanks.
On Tue, May 17, 2022 at 3:09 PM Jakub Jelinek via Gcc-patches
wrote:
>
> Hi!
>
> When looking around the spot of the PR105591 fix, I've noticed a typo
> and incorrectly formatted comment.
>
> Bootstrapped/regtested on x86_64-linux and i668-linux, committed to
> trunk as obvious.
>
> 2022-
On Fri, May 13, 2022 at 7:16 PM Richard Biener
wrote:
>
> On Fri, May 13, 2022 at 5:37 AM Hongtao Liu wrote:
> >
> > On Wed, May 11, 2022 at 4:45 PM Richard Biener via Gcc-patches
> > wrote:
> > >
> > > On Mon, May 9, 2022 at 7:19 AM liuhongt wrote:
&g
On Tue, May 17, 2022 at 6:03 PM Uros Bizjak wrote:
>
> On Tue, May 17, 2022 at 3:33 AM Hongtao Liu wrote:
> >
> > On Mon, May 16, 2022 at 5:21 PM Uros Bizjak via Gcc-patches
> > wrote:
> > >
> > > On Sat, May 7, 2022 at 7:05 AM liuhongt wrote:
>
On Tue, May 17, 2022 at 6:07 PM Uros Bizjak via Gcc-patches
wrote:
>
> On Tue, May 17, 2022 at 5:06 AM liuhongt wrote:
> >
> > backend has
> >
> > 16550(define_insn "*bmi2_bzhi_3_2"
> > 16551 [(set (match_operand:SWI48 0 "register_operand" "=r")
> > 16552(and:SWI48
> > 16553 (pl
On Wed, May 25, 2022 at 11:39 AM liuhongt via Gcc-patches
wrote:
>
> Rigt now, mem_cost for separate mem alternative is 1 * frequency which
> is pretty small and caused the unnecessary SSE spill in the PR, I've tried
> to rework backend cost model, but RA still not happy with that(regress
> somewh
On Fri, Oct 30, 2020 at 1:00 AM Richard Sandiford
wrote:
>
> I guess my main objection is that we have a special memory constraint
> that isn't in fact matching a MEM (at least not directly). That seems
> odd and feels like it's going to come back to bite us.
>
> From an RTL perspective, the MEM
rnal/us/en/documents/architecture-instruction-set-extensions-programming-reference.pdf
> >
> > Bootstrap ok, regression test on i386/x86 backend is ok.
> >
> > OK for master?
> >
> > 2020-10-13 Hongtao Liu
> > Hongyu Wang
> >
> > gcc/
>
On Tue, Nov 3, 2020 at 9:51 PM Richard Sandiford
wrote:
>
> Vladimir Makarov via Gcc-patches writes:
> > On 2020-10-27 2:53 a.m., Hongtao Liu wrote:
> >> Hi:
> >>For inline asm, there could be an operand like (not (mem:)), it's
> >> not
Hi:
When programmers explicitly use mask loaded intrinsics, don't
transform the instruction to vpblend{b,w,d,q} since If mem_addr points
to a memory region with less than whole vector size of accessible
memory, the mask would prevent reading the inaccessible bytes which
could avoid fault.
Boo
it |= OPTION_MASK_ISA2_AVX2_UNSET;
> }
>return true;
>
Yes.
> No review for the sse.md and for testcases.
>
> Uros.
Update the patch based on latest trunk.
--
BR,
Hongtao
From 881868b8c9f5925c63a953454f45f5e0a3c8ea4f Mon Sep 17 00:00:00 2001
From: liuhongt
Date: T
On Mon, Nov 9, 2020 at 8:26 PM Uros Bizjak wrote:
>
> On Mon, Nov 9, 2020 at 11:31 AM Hongtao Liu wrote:
> >
> > >
> > > + /* Support unified builtin. */
> > > + || (mask2 == OPTION_MASK_ISA2_AVXVNNI)
> > >
> > > I don
On Tue, Nov 10, 2020 at 3:22 AM Jason Merrill via Gcc-patches
wrote:
>
> This patch was also applied to the GCC 9 and 10 branches and breaks those
> builds, because PTA_CLDEMOTE is not defined.
>
Mine, let me fix it, sorry for that.
--
BR,
Hongtao
On Tue, Nov 10, 2020 at 4:17 PM Hongtao Liu wrote:
>
> On Tue, Nov 10, 2020 at 3:22 AM Jason Merrill via Gcc-patches
> wrote:
> >
> > This patch was also applied to the GCC 9 and 10 branches and breaks those
> > builds, because PTA_CLDEMOTE is not defined.
> >
ping ^3
Rebase patch on latest trunk.
On Tue, Oct 27, 2020 at 3:51 PM Hongtao Liu wrote:
>
> ping^1
>
> On Tue, Oct 20, 2020 at 3:36 PM Richard Biener
> wrote:
> >
> > On Tue, Oct 20, 2020 at 4:35 AM Hongtao Liu wrote:
> > >
> > > On Mon, Oct 19
[GCC-11] Mention Intel AVX-VNNI and add it to ALDERLAKE and SAPPIRERAPIDS,
also add HRESET to ALDERLAKE.
diff --git a/htdocs/gcc-11/changes.html b/htdocs/gcc-11/changes.html
index d7a3a1f9..fc4c74f4 100644
--- a/htdocs/gcc-11/changes.html
+++ b/htdocs/gcc-11/changes.html
@@ -267,15 +267,20 @@ a wo
On Wed, Nov 11, 2020 at 4:45 PM Uros Bizjak wrote:
>
> > gcc/ChangeLog:
> >
> > PR target/97194
> > * config/i386/i386-expand.c (ix86_expand_vector_set_var): New function.
> > * config/i386/i386-protos.h (ix86_expand_vector_set_var): New Decl.
> > * config/i386/predicates.md (vec_setm_operand): Ne
On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote:
>
> On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote:
>
> > > > gcc/ChangeLog:
> > > >
> > > > PR target/97194
> > > > * config/i386/i386-expand.c (ix86_expand_vector_se
On Thu, Nov 12, 2020 at 5:12 PM Hongtao Liu wrote:
>
> On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote:
> >
> > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote:
> >
> > > > > gcc/ChangeLog:
> > > > >
> > > > > PR target/97
On Thu, Nov 12, 2020 at 5:15 PM Hongtao Liu wrote:
>
> On Thu, Nov 12, 2020 at 5:12 PM Hongtao Liu wrote:
> >
> > On Thu, Nov 12, 2020 at 4:21 PM Uros Bizjak wrote:
> > >
> > > On Thu, Nov 12, 2020 at 3:04 AM Hongtao Liu wrote:
> > >
> > >
Got it.
On Fri, Nov 13, 2020 at 3:26 AM Gerald Pfeifer wrote:
>
> On Wed, 11 Nov 2020, Hongtao Liu via Gcc-patches wrote:
> > + New ISA extension support for Intel AVX-VNNI was added to GCC.
>
> More for the future (i.e., no need to change that now): I suggest
> to skip &
On Fri, Nov 13, 2020 at 3:32 AM Gerald Pfeifer wrote:
>
> Per our discussion on the list (plus a grammer improvement in a
> section above).
>
> One question: why are the ISA extension lists not alphabetically
> sorted? Wouldn't that be beneficial for users? Easier to find
> something and also ea
On Tue, Nov 17, 2020 at 8:05 AM Jeff Law wrote:
>
>
> On 9/2/20 3:34 AM, Hongtao Liu via Gcc-patches wrote:
> > Hi:
> > Add define_peephole2 to eliminate potential redundant conversion
> > from mask to vector.
> > Bootstrap is ok, regression test is ok for
On Tue, Aug 31, 2021 at 7:56 PM Richard Biener
wrote:
>
> On Tue, Aug 31, 2021 at 12:18 PM Hongtao Liu wrote:
> >
> > On Mon, Aug 30, 2021 at 8:25 PM Richard Biener via Gcc-patches
> > wrote:
> > >
> > > On Fri, Aug 27, 2021 at 8:53 AM liuh
I'm going to check in the first 3 patches which are already approved.
Update hf soft-fp from glibc.
[i386] Enable _Float16 type for TARGET_SSE2 and above.
[i386] libgcc: Enable hfmode soft-sf/df/xf/tf extensions and
truncations.
On Mon, Aug 2, 2021 at 2:31 PM liuhongt wrote:
>
> Update
On Tue, Aug 24, 2021 at 5:39 PM Hongtao Liu wrote:
>
> On Tue, Aug 17, 2021 at 9:53 AM Hongtao Liu wrote:
> >
> > On Fri, Aug 6, 2021 at 2:06 PM Hongtao Liu wrote:
> > >
> > > On Tue, Aug 3, 2021 at 10:44 AM Hongtao Liu wrote:
> > > >
>
On Wed, Sep 1, 2021 at 8:52 PM Richard Sandiford
wrote:
>
> Richard Biener writes:
> > On Wed, Sep 1, 2021 at 8:28 AM Hongtao Liu wrote:
> >>
> >> On Tue, Aug 31, 2021 at 7:56 PM Richard Biener
> >> wrote:
> >> >
> >> > On Tue, Au
On Thursday, September 2, 2021, Iain Sandoe wrote:
> Hi Hongtao.
>
> > On 2 Sep 2021, at 07:06, Hongtao Liu via Gcc-patches <
> gcc-patches@gcc.gnu.org> wrote:
> >
> > I'm going to check in the first 3 patches which are already approved.
> >
> >
On Thursday, September 2, 2021, H.J. Lu wrote:
> On Wed, Sep 1, 2021 at 11:00 PM Hongtao Liu wrote:
> >
> > I'm going to check in the first 3 patches which are already approved.
> >
> > Update hf soft-fp from glibc.
> > [i386] Enable _Float16 type
On Fri, Sep 3, 2021 at 5:01 PM Jakub Jelinek via Gcc-patches
wrote:
>
> On Fri, Sep 03, 2021 at 03:41:13PM +0800, liuhongt via Gcc-patches wrote:
> > --- a/libgcc/config/i386/64/t-softfp
> > +++ b/libgcc/config/i386/64/t-softfp
> > @@ -1 +1,6 @@
> > softfp_extras := fixhfti fixunshfti floattihf f
On Fri, Sep 3, 2021 at 5:01 PM Jakub Jelinek via Gcc-patches
wrote:
>
> On Fri, Sep 03, 2021 at 03:41:13PM +0800, liuhongt via Gcc-patches wrote:
> > --- a/libgcc/config/i386/64/t-softfp
> > +++ b/libgcc/config/i386/64/t-softfp
> > @@ -1 +1,6 @@
> > softfp_extras := fixhfti fixunshfti floattihf f
On Fri, Sep 3, 2021 at 8:42 PM Jakub Jelinek via Gcc-patches
wrote:
>
> On Mon, Aug 02, 2021 at 02:31:12PM +0800, liuhongt via Gcc-patches wrote:
> > * doc/extend.texi (Half-Precision Floating Point): Documemt
> > _Float16 for x86.
>
> > --- a/gcc/doc/extend.texi
> > +++ b/gcc/doc/exte
On Sun, Sep 5, 2021 at 5:54 AM H.J. Lu via Gcc-patches
wrote:
>
> Add non-destructive source alternative to @xorsign3_1 for AVX.
LGTM.
>
> gcc/
>
> PR target/89984
> * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
> * config/i386/i386.md (@xorsign3_1): Ad
On Sun, Sep 5, 2021 at 5:56 AM H.J. Lu via Gcc-patches
wrote:
>
> Enable FMA in scalar/vector unsigned SI to SF expanders.
>
> gcc/
>
> PR target/85819
> * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
> Enable FMA.
> (ix86_expand_vector_convert_uns_
On Mon, Sep 6, 2021 at 4:46 PM liuhongt via Gcc-patches
wrote:
>
> Hi:
> As discussed in [1], most of (currently unopposed) targets want
> auto-vectorization at O2, and IMHO now would be a good time to enable O2
> vectorization for GCC trunk, so it would leave enough time to expose
> related iss
On Mon, Sep 6, 2021 at 2:54 PM Richard Biener
wrote:
>
> On Thu, Sep 2, 2021 at 7:54 PM Richard Sandiford
> wrote:
> >
> > Hongtao Liu via Gcc-patches writes:
> > > On Wed, Sep 1, 2021 at 8:52 PM Richard Sandiford
> > > wrote:
> > >>
> >
On Mon, Sep 6, 2021 at 5:19 PM Richard Biener via Gcc-patches
wrote:
>
> On Mon, Sep 6, 2021 at 10:47 AM liuhongt via Gcc-patches
> wrote:
> >
> > Hi:
> > As discussed in [1], most of (currently unopposed) targets want
> > auto-vectorization at O2, and IMHO now would be a good time to enable O2
On Mon, Sep 6, 2021 at 6:03 PM Richard Biener via Gcc-patches
wrote:
>
> On Mon, 6 Sep 2021, liuhongt wrote:
>
> > Hi:
> > For the conversion from _Float16 to int, if the corresponding optab
> > does not exist, the compiler will try the wider mode (SFmode here),
> > but when floatsfsi exists but
On Mon, Sep 6, 2021 at 5:42 PM Jakub Jelinek via Gcc-patches
wrote:
>
> On Mon, Sep 06, 2021 at 11:18:47AM +0200, Richard Biener wrote:
> > On Mon, Sep 6, 2021 at 10:47 AM liuhongt via Gcc-patches
> > wrote:
> > >
> > > Hi:
> > > As discussed in [1], most of (currently unopposed) targets want
>
On Mon, Sep 6, 2021 at 7:01 PM Jakub Jelinek wrote:
>
> On Mon, Sep 06, 2021 at 06:58:37PM +0800, Hongtao Liu wrote:
> > > process_options would mean it affects only the command line and not
> > > __attribute__((optimize ("O2", "ftree-vectorize")))
On Mon, Sep 6, 2021 at 8:35 PM H.J. Lu wrote:
>
> On Sun, Sep 5, 2021 at 10:34 PM Hongtao Liu wrote:
> >
> > On Sun, Sep 5, 2021 at 5:56 AM H.J. Lu via Gcc-patches
> > wrote:
> > >
> > > Enable FMA in scalar/vector unsigned SI to SF expanders.
>
On Mon, Sep 6, 2021 at 8:13 PM Jakub Jelinek wrote:
>
> On Mon, Sep 06, 2021 at 10:05:00AM +0800, Hongtao Liu wrote:
>
> > @@ -1076,9 +1076,11 @@ systems where @code{__float128} is supported.
> > The @code{_Float32}
> > type is supported on all systems supporting IE
On Tue, Sep 7, 2021 at 12:37 AM Joseph Myers wrote:
>
> On Mon, 6 Sep 2021, liuhongt via Gcc-patches wrote:
>
> > Hi:
> > As discussed in [1], most of (currently unopposed) targets want
> > auto-vectorization at O2, and IMHO now would be a good time to enable O2
> > vectorization for GCC trunk,
On Tue, Sep 7, 2021 at 3:18 PM Jakub Jelinek wrote:
>
> On Tue, Sep 07, 2021 at 09:52:57AM +0800, Hongtao Liu wrote:
> > Adjust the wording for x86 _Float16 type.
> >
> > gcc/ChangeLog:
> >
> > * doc/extend.texi: (@node Floating Types): Adjust the wordin
On Tue, Sep 7, 2021 at 8:29 PM Jakub Jelinek via Gcc-patches
wrote:
>
> On Mon, Sep 06, 2021 at 10:58:53AM +0200, Jakub Jelinek via Gcc-patches wrote:
> > On Mon, Sep 06, 2021 at 08:49:27AM +0100, Iain Sandoe wrote:
> > > > Ok. The *.ver changes are still needed (see above), but that can be
> >
On Wed, Sep 8, 2021 at 7:20 AM Segher Boessenkool
wrote:
>
> On Fri, Sep 03, 2021 at 05:05:47PM +0200, Andreas Schwab wrote:
> > On Sep 02 2021, Segher Boessenkool wrote:
> > > On Tue, Aug 31, 2021 at 07:17:49PM +0800, liuhongt via Gcc-patches wrote:
> > >>* emit-rtl.c (validate_subreg): Get r
On Wed, Sep 8, 2021 at 8:54 AM Hongtao Liu wrote:
>
> On Tue, Sep 7, 2021 at 8:29 PM Jakub Jelinek via Gcc-patches
> wrote:
> >
> > On Mon, Sep 06, 2021 at 10:58:53AM +0200, Jakub Jelinek via Gcc-patches
> > wrote:
> > > On Mon, Sep 06, 2021 at 08:49:27AM +0
On Wed, Jul 21, 2021 at 3:43 PM liuhongt wrote:
>
> Hi:
> As discussed in [1], this patch support _Float16 under target sse2
> and above, w/o avx512fp16, _Float16 type is storage only, all operations
> are emulated by soft-fp and float instructions. Soft-fp keeps the intermediate
> result of the
On Wed, Sep 8, 2021 at 10:54 AM Hongtao Liu wrote:
>
> On Wed, Jul 21, 2021 at 3:43 PM liuhongt wrote:
> >
> > Hi:
> > As discussed in [1], this patch support _Float16 under target sse2
> > and above, w/o avx512fp16, _Float16 type is storage only, all operations
&
On Wed, Sep 8, 2021 at 3:43 PM Jakub Jelinek via Gcc-patches
wrote:
>
> Hi!
>
> As the testcase shows, we miscompile @xorsign3_1 if both input
> operands are in the same register, because the splitter overwrites op1
> before with op1 & mask before using op0.
>
> For dest = xorsign op0, op0 we can
On Wed, Sep 8, 2021 at 5:09 PM Jakub Jelinek wrote:
>
> On Wed, Sep 08, 2021 at 10:37:17AM +0800, Hongtao Liu wrote:
> > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}.
> > Ok for trunk?
> >
> > libgcc/ChangeLog:
> >
> > * config/
On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote:
>
> On Wed, Sep 08, 2021 at 05:23:40PM +0800, Hongtao Liu wrote:
> > > Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?
> > >
> > Patch LGTM.
>
> Thanks, committed.
>
> > PS:
&
On Wed, Sep 8, 2021 at 6:02 PM Jakub Jelinek wrote:
>
> On Wed, Sep 08, 2021 at 06:00:50PM +0800, Hongtao Liu wrote:
> > Yes, I think so.
> > And I find paradoxical subreg like (subreg:V4SF (reg:SF)) are not
> > allowed by validate_subreg until r11-621.
> > That
On Thu, Sep 9, 2021 at 3:17 AM Segher Boessenkool
wrote:
>
> On Wed, Sep 08, 2021 at 08:39:31PM +0200, Richard Biener wrote:
> > On September 8, 2021 7:08:09 PM GMT+02:00, Segher Boessenkool
> > wrote:
> > >It is not a good idea to do allow all those things. Most backends can
> > >only support
On Thu, Sep 9, 2021 at 3:17 AM Segher Boessenkool
wrote:
>
> On Wed, Sep 08, 2021 at 08:39:31PM +0200, Richard Biener wrote:
> > On September 8, 2021 7:08:09 PM GMT+02:00, Segher Boessenkool
> > wrote:
> > >It is not a good idea to do allow all those things. Most backends can
> > >only support
On Thu, Sep 9, 2021 at 11:21 AM H.J. Lu via Gcc-patches
wrote:
>
> 1. Add TARGET_AVX256_MOVE_BY_PIECES to perform move by-pieces operation
> with 256-bit AVX instructions.
> 2. Add TARGET_AVX256_STORE_BY_PIECES to perform move and store by-pieces
> operations with 256-bit AVX instructions.
>
> The
On Thu, Jul 1, 2021 at 2:17 PM liuhongt wrote:
>
> gcc/ChangeLog:
>
> * config.gcc: Add avx512fp16vlintrin.h.
> * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
> (_mm512_mask_add_ph): Likewise.
> (_mm512_maskz_add_ph): Likewise.
> (_mm512_s
On Thu, Sep 9, 2021 at 3:54 PM liuhongt wrote:
>
> Hi:
> As a follow up of [1], the patch removes all scalar mode copysign related
> post_reload splitter/define_insn and expand copysign directly into below using
> paradoxical subregs.
[1] https://gcc.gnu.org/pipermail/gcc-patches/2021-September
On Thu, Sep 9, 2021 at 11:31 PM H.J. Lu wrote:
>
> On Wed, Jul 21, 2021 at 12:44 AM liuhongt wrote:
> >
> > From: "H.J. Lu"
> >
> > Copied from regular XMM ABI tests. Only run AVX512FP16 ABI tests for ELF
> > targets.
> >
> > gcc/testsuite/ChangeLog:
> >
> > * gcc.target/x86_64/abi/avx51
On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool
wrote:
>
> On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener wrote:
> > > I think we should (longer term) get rid of the overloaded meanings and
> > > uses of subregs. One fairly simple thing is to make a new rtx code
> > > "bit_cast" (or
On Thu, Sep 9, 2021 at 4:00 PM Hongtao Liu wrote:
>
> On Thu, Sep 9, 2021 at 3:54 PM liuhongt wrote:
> >
> > Hi:
> > As a follow up of [1], the patch removes all scalar mode copysign related
> > post_reload splitter/define_insn and expand copysign directly into be
On Thu, Jul 1, 2021 at 2:17 PM liuhongt wrote:
>
> From: "H.J. Lu"
>
> gcc/ChangeLog:
>
> * config/i386/i386-expand.c
> (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
> * config/i386/i386.c
> (ix86_preferred_simd_mode): Handle HF mode.
> * con
On Fri, Sep 10, 2021 at 6:54 PM Richard Biener
wrote:
>
> On Fri, Sep 10, 2021 at 5:03 AM Hongtao Liu wrote:
> >
> > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool
> > wrote:
> > >
> > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener wrote
On Fri, Sep 10, 2021 at 7:25 PM Hongtao Liu wrote:
>
> On Fri, Sep 10, 2021 at 6:54 PM Richard Biener
> wrote:
> >
> > On Fri, Sep 10, 2021 at 5:03 AM Hongtao Liu wrote:
> > >
> > > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool
> > > wrot
On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches
wrote:
>
> On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote:
> >
> > gcc/ChangeLog:
> >
> > * expmed.c (extract_bit_field_using_extv): validate_subreg
> > before call gen_lowpart.
> > ---
> > gcc/expmed.c | 6 +-
> >
On Fri, Sep 10, 2021 at 9:27 PM Hongtao Liu wrote:
>
> On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches
> wrote:
> >
> > On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote:
> > >
> > > gcc/ChangeLog:
> > >
> > > *
On Fri, Sep 10, 2021 at 9:32 PM Richard Biener
wrote:
>
> On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu
> wrote:
> >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches
> > wrote:
> >>
> >> On Fri, Sep 10, 2021 at 2:58 PM liuho
On Fri, Sep 10, 2021 at 9:32 PM Richard Biener
wrote:
>
> On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu
> wrote:
> >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches
> > wrote:
> >>
> >> On Fri, Sep 10, 2021 at 2:58 PM liuho
On Fri, Sep 10, 2021 at 10:08 PM David Edelsohn wrote:
>
> On Thu, Sep 9, 2021 at 11:03 PM Hongtao Liu wrote:
> >
> > On Fri, Sep 10, 2021 at 7:49 AM Segher Boessenkool
> > wrote:
> > >
> > > On Thu, Sep 09, 2021 at 08:16:16AM +0200, Richard Biener
On Fri, Sep 10, 2021 at 9:44 PM Hongtao Liu wrote:
>
> On Fri, Sep 10, 2021 at 9:32 PM Richard Biener
> wrote:
> >
> > On September 10, 2021 3:27:09 PM GMT+02:00, Hongtao Liu
> > wrote:
> > >On Fri, Sep 10, 2021 at 9:16 PM Richard Biener via Gcc-patches
&g
On Sat, Sep 11, 2021 at 5:21 AM Segher Boessenkool
wrote:
>
> On Fri, Sep 10, 2021 at 10:25:45PM +0800, Hongtao Liu wrote:
> > Updated patch.
> >
> > Bootstrapped and regtested on x86_64-linux-gnu{-m32,}, do I need to
> > run this patch on other targets machin
On Sat, Sep 11, 2021 at 8:29 AM Hongtao Liu wrote:
>
> On Sat, Sep 11, 2021 at 5:21 AM Segher Boessenkool
> wrote:
> >
> > On Fri, Sep 10, 2021 at 10:25:45PM +0800, Hongtao Liu wrote:
> > > Updated patch.
> > >
> > > Bootstrapped and reg
On Sat, Sep 11, 2021 at 4:25 PM Richard Biener via Gcc-patches
wrote:
>
> On September 10, 2021 11:27:16 PM GMT+02:00, Segher Boessenkool
> wrote:
> >On Fri, Sep 10, 2021 at 08:36:12PM +0200, Richard Biener wrote:
> >> On September 10, 2021 6:24:50 PM GMT+02:00, Segher Boessenkool
> >> wrote:
On Sat, Sep 11, 2021 at 5:51 PM Hongtao Liu wrote:
>
> On Sat, Sep 11, 2021 at 4:25 PM Richard Biener via Gcc-patches
> wrote:
> >
> > On September 10, 2021 11:27:16 PM GMT+02:00, Segher Boessenkool
> > wrote:
> > >On Fri, Sep 10, 2021 at 08:36:12PM +0
On Mon, Sep 13, 2021 at 2:11 PM Richard Biener via Gcc-patches
wrote:
>
> On Fri, Sep 10, 2021 at 2:58 PM liuhongt wrote:
> >
> > gcc/ChangeLog:
> >
> > * expmed.c (extract_bit_field_using_extv): validate_subreg
> > before call gen_lowpart.
> > ---
> > gcc/expmed.c | 6 +-
> >
On Mon, Sep 13, 2021 at 5:15 PM Richard Biener
wrote:
>
> On Mon, Sep 13, 2021 at 8:26 AM Hongtao Liu wrote:
> >
> > On Mon, Sep 13, 2021 at 2:11 PM Richard Biener via Gcc-patches
> > wrote:
> > >
> > > On Fri, Sep 10, 2021 at 2:58 PM liuh
On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches
wrote:
>
>
>
> On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote:
> >Currently for (vec_concat:M (vec_select op0 idx1)(vec_select op0 idx2)),
> > optimizer wouldn't simplify if op0 has different mode with M, but that's too
> > restri
On Mon, Sep 13, 2021 at 10:10 PM Jeff Law via Gcc-patches
wrote:
>
>
>
> On 9/9/2021 10:36 PM, liuhongt via Gcc-patches wrote:
> >Currently for (vec_concat:M (vec_select op0 idx1)(vec_select op0 idx2)),
> > optimizer wouldn't simplify if op0 has different mode with M, but that's too
> > restri
On Tue, Sep 14, 2021 at 8:58 AM Andrew Pinski wrote:
>
> On Wed, Sep 8, 2021 at 2:55 AM Hongtao Liu via Gcc-patches
> wrote:
> >
> > On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote:
> > >
> > > On Wed, Sep 08, 2021 at 05:23:40PM +0800, Hongtao Liu wro
On Tue, Sep 14, 2021 at 10:06 AM Hongtao Liu wrote:
>
> On Tue, Sep 14, 2021 at 8:58 AM Andrew Pinski wrote:
> >
> > On Wed, Sep 8, 2021 at 2:55 AM Hongtao Liu via Gcc-patches
> > wrote:
> > >
> > > On Wed, Sep 8, 2021 at 5:33 PM Jakub Jelinek wrote:
i'm going to commit 8 patches:
[PATCH 16/62] AVX512FP16: Add vsqrtph/vrsqrtph/vsqrtsh/vrsqrtsh.
[PATCH 17/62] AVX512FP16: Add testcase for vsqrtph/vsqrtsh/vrsqrtph/vrsqrtsh.
[PATCH 18/62] AVX512FP16: Add vrcpph/vrcpsh/vscalefph/vscalefsh.
[PATCH 19/62] AVX512FP16: Add testcase for vrcpph/vrcpsh/vs
I'm going to check in 6 patches
[PATCH 24/62] AVX512FP16: Add vmovw/vmovsh.
[PATCH 25/62] AVX512FP16: Add testcase for vmovsh/vmovw.
[PATCH 26/62] AVX512FP16: Add
vcvtph2dq/vcvtph2qq/vcvtph2w/vcvtph2uw/vcvtph2uqq/vcvtph2udq
[PATCH 27/62] AVX512FP16: Add testcase for
vcvtph2w/vcvtph2uw/vcvtph2dq/vc
On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches
wrote:
>
> On Thu, 16 Sep 2021, liuhongt wrote:
>
> > Ping
> > rebased on latest trunk.
> >
> > gcc/ChangeLog:
> >
> > * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
> > * doc/invoke.texi (Options That Contro
On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote:
>
> On Thu, 16 Sep 2021, Hongtao Liu wrote:
>
> > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches
> > wrote:
> > >
> > > On Thu, 16 Sep 2021, liuhongt wrote:
> > &
On Fri, Sep 17, 2021 at 3:47 PM Richard Biener wrote:
>
> On Fri, 17 Sep 2021, Hongtao Liu wrote:
>
> > On Thu, Sep 16, 2021 at 8:31 PM Richard Biener wrote:
> > >
> > > On Thu, 16 Sep 2021, Hongtao Liu wrote:
> > >
> > > > On Thu,
I'm going to check in 10 patches.
[PATCH 30/62] AVX512FP16: Add vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh.
[PATCH 31/62] AVX512FP16: Add testcase for
vcvtsh2si/vcvtsh2usi/vcvtsi2sh/vcvtusi2sh.
[PATCH 32/62] AVX512FP16: Add
vcvttph2w/vcvttph2uw/vcvttph2dq/vcvttph2qq/vcvttph2udq/vcvttph2uqq
[PATCH 3
On Sat, Sep 18, 2021 at 7:50 AM Jakub Jelinek via Gcc-patches
wrote:
>
> On Fri, Sep 17, 2021 at 08:35:57AM +0200, Uros Bizjak via Gcc-patches wrote:
> > > > On Wed, Sep 15, 2021 at 10:10 AM wrote:
> > > > >
> > > > > From: "H.J. Lu"
> > > > >
> > > > > Check TARGET_USE_VECTOR_FP_CONVERTS or
> >
Hi:
I'm going to check the 9 patches below.
Bootstrapped and regtest on x86_64-pc-linux-gnu{-m32,}.
Newly added testcase passed on sde{-m32,}.
[PATCH 40/62] AVX512FP16: Add vfmaddsub[132, 213,
231]ph/vfmsubadd[132, 213, 231]ph.
[PATCH 41/62] AVX512FP16: Add testcase for vfmaddsub[132, 213,
2
On Sat, Sep 18, 2021 at 3:31 PM Jakub Jelinek wrote:
>
> On Sat, Sep 18, 2021 at 11:09:32AM +0800, liuhongt wrote:
> > Besides conversion instructions, pass_rpad also handles scalar
> > sqrt/rsqrt/rcp/round instructions, while r12-3614 should only want to
> > handle conversion instructions, so fix
On Sunday, September 19, 2021, Iain Sandoe
wrote:
> Hi Hongtao,
>
> > On 8 Sep 2021, at 10:31, Hongtao Liu wrote:
> >
> > On Wed, Sep 8, 2021 at 5:09 PM Jakub Jelinek wrote:
> >>
> >> On Wed, Sep 08, 2021 at 10:37:17AM +0800, Hongtao Liu wrote:
> &g
On Mon, Sep 20, 2021 at 4:13 AM Martin Sebor wrote:
>
> On 9/16/21 3:03 AM, Hongtao Liu via Gcc-patches wrote:
> > On Thu, Sep 16, 2021 at 4:23 PM Richard Biener via Gcc-patches
> > wrote:
> >>
> >> On Thu, 16 Sep 2021, liuhongt wrote:
> >
On Tue, Sep 21, 2021 at 10:55 PM H.J. Lu wrote:
>
> On Mon, Sep 20, 2021 at 5:15 AM Richard Biener via Gcc-patches
> wrote:
> >
> > This allows vectorization (in practice non-loop vectorization) to
> > have a stmt participate in different vector type vectorizations.
> > It allows us to remove vec
801 - 900 of 1392 matches
Mail list logo