[committed] CRIS: Add new peephole2 "lra_szext_decomposed_indir_plus"

2024-09-03 Thread Hans-Peter Nilsson
I thought I had already committed this, but it looks like it was left dangling when the make_more_copies patch (now committed) was in limbo and I disabled late-combine for (coremark) performance reasons. FWIW that's still a reason at r15-3386-gaf1500dd8c00 (2.6% regression). Tested cris-elf with/

[PATCH] i386: Support partial signbit/xorsign/copysign/abs/neg/and/xor/ior/andn for V2BF/V4BF

2024-09-03 Thread Levy Hsu
Hi This patch adds support for bf16 operations in V2BF and V4BF modes on i386, handling signbit, xorsign, copysign, abs, neg, and various logical operations. Bootstrapped and tested on x86-64-pc-linux-gnu. Ok for trunk? gcc/ChangeLog: * config/i386/i386.cc (ix86_build_const_vector): Ad

[PATCH] i386: Support partial vectorized FMA for V2BF/V4BF

2024-09-03 Thread Levy Hsu
Hi Bootstrapped and tested on x86-64-pc-linux-gnu. Ok for trunk? This patch introduces support for vectorized FMA operations for bf16 types in V2BF and V4BF modes on the i386 architecture. New mode iterators and define_expand entries for fma, fnma, fms, and fnms operations are added in mmx.md, e

Re: [PATCH] expand: Add dump for costing of positive divides

2024-09-03 Thread Richard Biener
> Am 04.09.2024 um 04:00 schrieb Andrew Pinski : > > While trying to understand PR 115910 I found it was useful to print out > the two costs of doing a signed and unsigned division just like was added in > r15-3272-g3c89c41991d8e8 for popcount==1. > > Bootstrapped and tested on x86_64-linux-g

[PATCH] Match: Fix ordered and nonequal

2024-09-03 Thread Hu, Lin1
Hi, all This patch is a fix patch. Need to add :c for bit_and, because bit_and is commutative. And is (ltgt @0 @1) is simpler than (bit_not (uneq @0 @1)). Bootstrapped/regtested on x86-64-pc-linux-gnu, OK for trunk? BRs, Lin gcc/ChangeLog: * match.pd: Fix match for (bit_and (ordered @

Re: [PATCH 1/2] split-paths: Move check for # of statements in join earlier

2024-09-03 Thread Kyrylo Tkachov
Hi Andrew, > On 3 Sep 2024, at 20:11, Andrew Pinski wrote: > > External email: Use caution opening links or attachments > > > This moves the check for # of statements to copy in join to > be the first check. This check is the cheapest check so it > should be first. Plus add a print to the dump

[PATCH v2] testsuite: introduce hostedlib effective target

2024-09-03 Thread Alexandre Oliva
On Nov 9, 2023, Mike Stump wrote: > On Nov 8, 2023, at 8:29 AM, Alexandre Oliva wrote: >> >> On Nov 5, 2023, Mike Stump wrote: >> >>> that, otherwise, I'll approve this version. >> >> FWIW, this version is not usable as is. Something went wrong > Updates and fixes to the original plan ar

Re: [PATCH] lto: Don't check obj.found for offload section

2024-09-03 Thread Richard Biener
On Tue, Sep 3, 2024 at 5:44 PM H.J. Lu wrote: > > On Fri, Aug 23, 2024 at 5:50 AM Richard Biener > wrote: > > > > On Fri, Aug 23, 2024 at 2:36 PM H.J. Lu wrote: > > > > > > obj.found is the number of LTO symbols. We should include the offload > > > section when it is used by linker even if ther

<    1   2