Re: [PATCH v4, rs6000] Add a combine pattern for CA minus one [PR95737]

2022-05-12 Thread Kewen.Lin via Gcc-patches
y for trunk? Any recommendations? Thanks a lot. > > ChangeLog > 2022-05-13 Haochen Gui > > gcc/ > PR target/95737 > * config/rs6000/rs6000.md (extenddi_ca_minus_one): Define. > Nit: (*extenddi_ca_minus_one): New define_insn_and_split. > gcc/tests

Re: [PATCH V2] powerpc: properly check for feenableexcept() on FreeBSD

2022-05-12 Thread Kewen.Lin via Gcc-patches
;t. But I'm not sure if it's the same as what Piotr's environments have. Hope it's similar. :-) [1] https://gcc.gnu.org/pipermail/gcc-patches/2022-April/593193.html [2] https://github.com/freebsd/freebsd-src/blob/main/lib/msun/powerpc/fenv.h [3] https://github.com/freebsd/freebsd-src/blob/main/lib/msun/x86/fenv.h BR, Kewen

[PATCH] rs6000: Handle unresolved overloaded builtin [PR105485]

2022-05-12 Thread Kewen.Lin via Gcc-patches
sion phase like "unresolved overload for builtin ...". Bootstrapped and regtested on powerpc64-linux-gnu P8 and powerpc64le-linux-gnu P9 and P10. Is it ok for trunk? BR, Kewen - PR target/105485 gcc/ChangeLog: * config/rs6000/rs6000-builtin.cc (rs6000_gimpl

[PATCH] Optimize vpermtiw/b to vpunpcklqdq for certain cases.

2022-05-13 Thread liuhongt via Gcc-patches
- .value 2 - .value 3 - .value 8 - .value 9 - .value 10 - .value 11 Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. Ok for trunk? gcc/ChangeLog: PR target/105033 * config/i386/sse.md (*vec_concatv4si): Extend to .. (*vec_concat

[PATCH v2] Optimize vpermtiw/b to vpunpcklqdq for certain cases.

2022-05-13 Thread liuhongt via Gcc-patches
%xmm0 + vpunpcklqdq %xmm1, %xmm0, %xmm0 ... -.LC0: - .value 0 - .value 1 - .value 2 - .value 3 - .value 8 - .value 9 - .value 10 - .value 11 gcc/ChangeLog: PR target/105033 * config/i386/sse.md (*vec_concatv4si): Exte

[PATCH] [i386] Fix ICE caused by wrong condition.

2022-05-13 Thread liuhongt via Gcc-patches
When d->perm[i] == d->perm[i-1] + 1 and d->perm[i] == nelt, it's not continuous. It should fail if there's more than 2 continuous areas. Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. Ok for trunk? gcc/ChangeLog: PR target/105587 * config/

Re: [PATCH v5, rs6000] Add a combine pattern for CA minus one [PR95737]

2022-05-15 Thread Kewen.Lin via Gcc-patches
y for trunk? Any recommendations? Thanks a lot. > > ChangeLog > 2022-05-16 Haochen Gui > > gcc/ > PR target/95737 > * config/rs6000/rs6000.md (subfsi3_carry_in_xx_64): New. (subfsi3_carry_in_xx_64) -> (*subfsi3_carry_in_xx_64) Sorry for nit-picking

[PATCH] Clamp vec_perm_expr index in simplify_bitfield_ref to avoid ICE.

2022-05-16 Thread liuhongt via Gcc-patches
Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,} Ok for trunk? gcc/ChangeLog: PR tree-optimization/105591 * tree-ssa-forwprop.cc (simplify_bitfield_ref): Clamp vec_perm_expr index. gcc/testsuite/ChangeLog: * gcc.dg/pr105591.c: New test. --- gcc

[committed] d: Merge upstream dmd 60bfa0ee7, druntime 94bd5bcb, phobos 3a1cd9a01.

2022-05-16 Thread ibuclaw--- via Gcc-patches
x86_64-linux-gnu/-m32/-mx32, and committed to mainline. Regards, Iain. --- gcc/d/ChangeLog: * dmd/MERGE: Merge upstream dmd 60bfa0ee7. * dmd/VERSION: Update version to v2.100.0. * d-builtins.cc (d_init_versions): Update for new front-end interface. * d-

[PATCH] [i386] recognize bzhi pattern when there's zero_extendsidi.

2022-05-16 Thread liuhongt via Gcc-patches
xtend:DI (and:SI (plus:SI (ashift:SI (const_int 1 [0x1]) 428(subreg:QI (reg:SI 98) 0)) 429(const_int -1 [0x])) 430(subreg:SI (reg:DI 95) 0 431(clobber (reg:CC 17 flags)) 432]) Add new define_insn

[PATCH v3] rs6000: Fix the check of bif argument number [PR104482]

2022-05-18 Thread Kewen.Lin via Gcc-patches
P9 and P10. v3: Update test case with dg-excess-errors. v2: Add one test case and refine commit logs. https://gcc.gnu.org/pipermail/gcc-patches/2022-April/593155.html v1: https://gcc.gnu.org/pipermail/gcc-patches/2022-March/591768.html Is it ok for trunk? BR, Kewen - PR target

[PATCH v3] rs6000: Adjust mov optabs for opaque modes [PR103353]

2022-05-18 Thread Kewen.Lin via Gcc-patches
ed on powerpc64-linux-gnu P8 and powerpc64le-linux-gnu P9 and P10. v3: Update test case with dg-excess-errors. v2: Polish some comments and add one test case as Will and Peter suggested. https://gcc.gnu.org/pipermail/gcc-patches/2022-April/592916.html v1: https://gcc.gnu.org/pipermail/gcc-pa

[PATCH, committed] testsuite/rs6000: Move pr83660.C to g++.target

2022-05-18 Thread Kewen.Lin via Gcc-patches
Hi, Move pr83660.C to g++.target. As comment #3 of PR83660, renaming it to c isn't one option. This test case isn't tested before, so this brings back the test coverage. Committed as r13-619-g297a69068ddfe2. BR, Kewen - gcc/testsuite/ChangeLog: * gcc.target/powerpc

[r13-456 Regression] FAIL: g++.dg/tsan/pr88018.C -O0 (test for excess errors) on Linux/x86_64

2022-05-18 Thread skpandey--- via Gcc-patches
(test for excess errors) with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r13-456/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl --enable

Patch structure and timing for re-adding gcj

2022-05-19 Thread Zopolis0 via Gcc-patches
Over the past few months I have been working on re-adding gcj to gcc ( https://github.com/Zopolis4/gcj/tree/mster). It is now at the point where all of the code in gcc/java and the related additions to other gcc files compiles successfully, although libjava still breaks. Given that I do not know

[PATCH] Increase move cost between mask and gpr.

2022-05-19 Thread liuhongt via Gcc-patches
ted on x86_64-pc-linux-gnu{-m32,}. Ready to push to trunk. gcc/ChangeLog: * config/i386/x86-tune-costs.h (skylake_cost): Increase gpr <-> mask cost from 5 to 6. (icelake_cost): Ditto. gcc/testsuite/ChangeLog: * gcc.target/i386/spill_to_mask-1.c: New test. ---

Re: [PATCH] rs6000: add support for sanitizers on FreeBSD

2022-05-22 Thread Kewen.Lin via Gcc-patches
Hi Piotr, on 2022/5/23 05:36, Piotr Kubaj wrote: > Ping. > > FreeBSD has already enabled sanitizers in its base system in > https://cgit.freebsd.org/src/commit/?id=f5024381ac16ba43d37a8bd32d54c27f6a6afa66 > The upstream (LLVM) patch was reviewed by a couple of people already. > > Our llvm-devel

[r13-707 Regression] FAIL: gcc.target/i386/pr45685.c scan-assembler-times cmov 6 on Linux/x86_64

2022-05-23 Thread skpandey--- via Gcc-patches
/pr45685.c scan-assembler-times cmov 6 with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r13-707/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet

[PATCH, committed] rs6000: Skip debug insns for union [PR105627]

2022-05-23 Thread Kewen.Lin via Gcc-patches
ws up in function union_defs. PR target/105627 gcc/ChangeLog: * config/rs6000/rs6000-p8swap.cc (union_defs): Assert def_insn can't be a debug insn. (union_uses): Skip debug use_insn. gcc/testsuite/ChangeLog: * gcc.target/powerpc/pr105627.c: New test

[PATCH, committed] testsuite/rs6000: Adjust gcc.target/powerpc/pr78604.c [PR105706]

2022-05-23 Thread Kewen.Lin via Gcc-patches
in one more vect_model_simple_cost dumping for each function. Need to adjust the match count accordingly. PR testsuite/105706 gcc/testsuite/ChangeLog: * gcc.target/powerpc/pr78604.c: Adjust. --- gcc/testsuite/gcc.target/powerpc/pr78604.c | 4 +++- 1 file changed, 3 insertions

[r13-726 Regression] FAIL: libgomp.fortran/taskwait-depend-nowait-1.f90 -O execution test on Linux/x86_64

2022-05-24 Thread skpandey--- via Gcc-patches
-1.f90 -O execution test with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r13-726/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl

[PATCH] Add a bit dislike for separate mem alternative when op is REG_P.

2022-05-24 Thread liuhongt via Gcc-patches
te 'm' alternative cost is too small, especially considering that the mov cost of gpr are 2(default for REGISTER_MOVE_COST). So this patch increase mem_cost to 2*frequency, also increase 1 for reg_class cost when m alternative. Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}. Ok

[PATCH v3] pass: Run cleanup passes before SLP [PR96789]

2020-11-02 Thread Kewen.Lin via Gcc-patches
." is still not good to get it clear. You are right, if the bit set for the outermost loop (its father), it's not guaranteed that we will do the propagation for it (and its children), so it shouldn't clear father_bbs otherwise some expected propagation probably won't happen

PING^2 [PATCH v2] rs6000: Use direct move for char/short vector CTOR [PR96933]

2020-11-02 Thread Kewen.Lin via Gcc-patches
Hi, Gentle ping for the patch: https://gcc.gnu.org/pipermail/gcc-patches/2020-September/553555.html BR, Kewen on 2020/10/13 下午2:59, Kewen.Lin via Gcc-patches wrote: > Hi, > > I'd like to gentle ping this patch: > > https://gcc.gnu.org/pipermail/gcc-patches/2020-S

PING^4 [PATCH 1/4] unroll: Add middle-end unroll factor estimation

2020-11-02 Thread Kewen.Lin via Gcc-patches
Hi, Gentle ping^4 this: https://gcc.gnu.org/pipermail/gcc-patches/2020-May/546698.html BR, Kewen on 2020/10/13 下午3:06, Kewen.Lin via Gcc-patches wrote: > Hi, > > Gentle ping this: > > https://gcc.gnu.org/pipermail/gcc-patches/2020-May/546698.html > > BR, > Kewen

Re: [patch] Fix build when source directory includes @ character

2020-11-02 Thread FX via Gcc-patches
Here is an updated patch, that quotes all three makeinfo special characters: @ { } Tested on a build with source directory /tmp/foo@bar{gee}qux OK to commit? at.diff Description: Binary data at.ChangeLog Description: Binary data

Re: [PATCH v3] pass: Run cleanup passes before SLP [PR96789]

2020-11-02 Thread Kewen.Lin via Gcc-patches
Hi Richard, Thanks again for your review! on 2020/11/2 下午6:23, Richard Sandiford wrote: > "Kewen.Lin" writes: >> diff --git a/gcc/function.c b/gcc/function.c >> index 2c8fa217f1f..3e92ee9c665 100644 >> --- a/gcc/function.c >> +++ b

[PATCH v3] rs6000: Use direct move for char/short vector CTOR [PR96933]

2020-11-02 Thread Kewen.Lin via Gcc-patches
Hi David, Thanks for the review! > The patch looks fine to me, but I'll let Segher decide if it addresses > his requested changes. > > I'm trying to be stricter about the test cases. > > +++ b/gcc/testsuite/gcc.target/powerpc/pr96933-1.c > @@ -0,0 +1,14 @@ > +

[r11-4614 Regression] FAIL: libgomp.oacc-fortran/multidim-slice.f95 -DACC_DEVICE_TYPE_host=1 -DACC_MEM_SHARED=1 -foffload=disable -O3 -g execution test on Linux/x86_64

2020-11-03 Thread sunil.k.pandey via Gcc-patches
-slice.f95 -DACC_DEVICE_TYPE_host=1 -DACC_MEM_SHARED=1 -foffload=disable -O3 -g execution test with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4614/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-

[r11-4664 Regression] FAIL: gcc.dg/guality/pr54519-4.c -O2 -flto -fuse-linker-plugin -fno-fat-lto-objects -DPREVENT_OPTIMIZATION line 22 y == 68 on Linux/x86_64

2020-11-03 Thread sunil.k.pandey via Gcc-patches
-use-linker-plugin -flto-partition=none -DPREVENT_OPTIMIZATION line 22 y == 68 FAIL: gcc.dg/guality/pr54519-4.c -O2 -flto -fuse-linker-plugin -fno-fat-lto-objects -DPREVENT_OPTIMIZATION line 22 y == 68 with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc

Re: [PATCH, rs6000] Optimize pcrel access of globals (updated, ping)

2020-11-04 Thread acsawdey--- via Gcc-patches
that uses that external address. Produced by a cast of thousands: * Michael Meissner * Peter Bergner * Bill Schmidt * Alan Modra * Segher Boessenkool * Aaron Sawdey Passes bootstrap/regtest on ppc64le power10. OK for trunk? gcc/ChangeLog: * config.gcc: Add pcrel-opt.o

Re: [PATCH v3] pass: Run cleanup passes before SLP [PR96789]

2020-11-04 Thread Kewen.Lin via Gcc-patches
Hi Lyon, Thanks for reporting and sorry for the failure. >> The patch was updated as your comments above, re-tested on Power8 >> and committed in r11-4637. >> > > The new test gcc.dg/tree-ssa/pr96789.c fails on arm: > FAIL: gcc.dg/tree-ssa/pr96789.c scan-tree-dump dse3 "Deleted dead store:.*tmp"

Re: [PATCH v3] rs6000: Use direct move for char/short vector CTOR [PR96933]

2020-11-05 Thread Kewen.Lin via Gcc-patches
es not mean the OS (libc, > kernel, etc.) will actually save the full 64-bit registers -- making it > only useful on Darwin currently. > > (You *can* run all of the testsuite flawlessly on Linux with those > options, but that only works because those are small, short-running >

[PATCH] Clean up loop-closed PHIs at loopdone pass

2020-11-05 Thread guojiufu via Gcc-patches
pass_tree_loop_done. This patch passes bootstrap and regtest on ppc64le. Is this ok for trunk? gcc/ChangeLog 2020-10-05 Jiufu Guo * tree-ssa-loop.h (clean_up_loop_closed_phi): New declaration. * tree-ssa-loop.c (tree_ssa_loop_done): Call clean_up_loop_closed_phi. * tree

[r11-4733 Regression] FAIL: gcc.dg/guality/pr54519-4.c -O2 -flto -fuse-linker-plugin -fno-fat-lto-objects -DPREVENT_OPTIMIZATION line 17 y == 25 on Linux/x86_64

2020-11-05 Thread sunil.k.pandey via Gcc-patches
-fat-lto-objects -DPREVENT_OPTIMIZATION line 17 y == 25 with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4733/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran

[PATCH]ira: recompute regstat as max_regno changes [PR97705]

2020-11-05 Thread Kewen.Lin via Gcc-patches
relevant data structures that is regstat_n_sets_and_refs and reg_info_p to ensure we won't access beyond some array bounds. Bootstrapped/regtested on powerpc64le-linux-gnu P9 and powerpc64-linux-gnu P8. Any thoughts? Is it a reasonable fix? BR, Kewen ----- gcc/ChangeLog: PR rtl-optimiz

[r11-4770 Regression] FAIL: gcc.dg/ipa/modref-2.c scan-ipa-dump modref "Parm 1 param offset:0 offset:0 size:-1 max_size:64" on Linux/x86_64

2020-11-06 Thread sunil.k.pandey via Gcc-patches
"Parm 1 param offset:0 offset:0 size:-1 max_size:64" with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4770/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c

[r11-4813 Regression] FAIL: c-c++-common/Wimplicit-fallthrough-20.c -std=gnu++98 (test for warnings, line 36) on Linux/x86_64

2020-11-07 Thread sunil.k.pandey via Gcc-patches
, line 30) FAIL: c-c++-common/Wimplicit-fallthrough-20.c -std=gnu++98 (test for warnings, line 32) FAIL: c-c++-common/Wimplicit-fallthrough-20.c -std=gnu++98 (test for warnings, line 36) with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master

Re: [PATCH]ira: recompute regstat as max_regno changes [PR97705]

2020-11-08 Thread Kewen.Lin via Gcc-patches
nd_refs and reg_info_p >> to ensure we won't access beyond some array bounds. >> >> Bootstrapped/regtested on powerpc64le-linux-gnu P9 and >> powerpc64-linux-gnu P8. >> >> Any thoughts?  Is it a reasonable fix? >> > Sure, Kewen.  A bit unexpected to see l

Embedded Real-Time Operating Systems for the IoT Market Analysis: Here's the One Report That You Must Share With Your Management Team

2020-11-09 Thread garemwinmark via Gcc-patches
Hello Have a nice day ! This is JoJo from WMResearch. We recently published a new market research report: < Global Embedded Real-Time Operating Systems for the IoT Market Research Report 2015-2020>and kindly let us know if you are interested in it. If you are interested in getting more det

[r11-4852 Regression] FAIL: g++.dg/ubsan/pr61272.C (test for excess errors) on Linux/x86_64

2020-11-09 Thread sunil.k.pandey via Gcc-patches
errors, line 15) FAIL: g++.dg/ubsan/pr61272.C (test for excess errors) with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4852/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable

testsuite: Adjust pr96789.c to exclude vect_load_lanes

2020-11-09 Thread Kewen.Lin via Gcc-patches
to teach the case to respect vect_load_lanes, meanwhile to guard the check only under vect_int. Is it ok for trunk? BR, Kewen - gcc/testsuite/ChangeLog: * gcc.dg/tree-ssa/pr96789.c: Adjusted by excluding vect_load_lanes. diff --git a/gcc/testsuite/gcc.dg/tree-ssa/pr96789.c b/gcc

[r11-4883 Regression] FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times optimized "__builtin_GOMP_loop_dynamic_next " 1 on Linux/x86_64

2020-11-10 Thread sunil.k.pandey via Gcc-patches
90 -O scan-tree-dump-times optimized "__builtin_GOMP_doacross_post " 1 FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times optimized "__builtin_GOMP_doacross_wait " 1 FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times op

Re: testsuite: Adjust pr96789.c to exclude vect_load_lanes

2020-11-10 Thread Kewen.Lin via Gcc-patches
ange -ftree-vectorize to -ftree-slp-vectorize? > Or does that defeat the purpose of the test? It works, nice, thanks for the suggestion! I appended one explicit -fno-tree-loop-vectorize to avoid it to fail in case someone kicks off the testing with explicit -ftree-loop-vectorize. The updated v

[r11-4903 Regression] FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times optimized "__builtin_GOMP_loop(?:_ull)_dynamic_next " 1 on Linux/x86_64

2020-11-11 Thread sunil.k.pandey via Gcc-patches
n-57.f90 -O scan-tree-dump-times optimized "__builtin_GOMP_doacross(?:_ull)_post " 1 FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times optimized "__builtin_GOMP_doacross(?:_ull)_wait " 1 FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump

[r11-4913 Regression] FAIL: 25_algorithms/merge/constrained.cc (test for excess errors) on Linux/x86_64

2020-11-11 Thread sunil.k.pandey via Gcc-patches
/modifiers/insert/const_iterator.cc (test for excess errors) FAIL: 23_containers/vector/types/1.cc (test for excess errors) FAIL: 25_algorithms/merge/constrained.cc (test for excess errors) with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master

[r11-4958 Regression] FAIL: 30_threads/future/members/poll.cc execution test on Linux/x86_64

2020-11-12 Thread sunil.k.pandey via Gcc-patches
/members/poll.cc execution test with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4958/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl

[r10-9014 Regression] FAIL: std/ranges/iota/96042.cc (test for excess errors) on Linux/x86_64

2020-11-13 Thread sunil.k.pandey via Gcc-patches
/96042.cc (test for excess errors) with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-10/releases/gcc-10/r10-9014/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet

[PATCH] Add MODE_OPAQUE

2020-11-13 Thread acsawdey--- via Gcc-patches
the inline expansion of memcpy/memmove, allowing me to fix PR 96791. Regstrap in progress on ppc64le and x86_64, ok for trunk if successful? Thanks, Aaron gcc/ChangeLog PR target/96791 * mode-classes.def: Add MODE_OPAQUE. * machmode.def: Add OPAQUE_MODE. * tre

[r11-5029 Regression] FAIL: gcc.dg/guality/pr59776.c -Os -DPREVENT_OPTIMIZATION line pr59776.c:20 s2.f == 5.0 on Linux/x86_64

2020-11-14 Thread sunil.k.pandey via Gcc-patches
/pr59776.c -Os -DPREVENT_OPTIMIZATION line pr59776.c:20 s1.g == 6.0 FAIL: gcc.dg/guality/pr59776.c -Os -DPREVENT_OPTIMIZATION line pr59776.c:20 s2.f == 5.0 with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-5029/usr

Re: [PATCH] Add MODE_OPAQUE

2020-11-16 Thread acsawdey--- via Gcc-patches
x86_64 and ppc64le? Thanks, Aaron gcc/ChangeLog PR target/96791 * mode-classes.def: Add MODE_OPAQUE. * machmode.def: Add OPAQUE_MODE. * tree.def: Add OPAQUE_TYPE for types that will use MODE_OPAQUE. * doc/generic.texi: Document OPAQUE_TYPE.

Re: [patch] Fix build when source directory includes @ character

2020-11-17 Thread FX via Gcc-patches
> OK. You have commit privs, right? Yes, and I did commit after Richard’s OK: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=034db20e2ea8301b5dc251bf10a97ce1cf90655f … but I forgot to send an email saying I had, sorry. FX

[PATCH] asan_test.C: disable -Wstringop-overflow, PR/93058

2020-01-23 Thread slyfox.inbox.ru via gcc-patches
san_test.cc:129:22: error: writing 1 byte into a region of size 0 [-Werror=stringop-overflow=] As there is no easy way to convey pvalloc()'s granularity to gcc let's just disable the warning for this test. * g++.dg/asan/asan_test.C: disable -Wstringop-overflow. --- gcc/testsuite/Ch

[PATCH] Fix target/101934: aarch64 memset code creates unaligned stores for -mstrict-align

2021-08-31 Thread apinski--- via Gcc-patches
From: Andrew Pinski The problem here is the aarch64_expand_setmem code did not check STRICT_ALIGNMENT if it is creating an overlapping store. This patch adds that check and the testcase works. gcc/ChangeLog: PR target/101934 * config/aarch64/aarch64.c (aarch64_expand_setmem

[PATCH] Add MIPS Linux support to gcc.misc-tests/linkage.c (testsuite/51748)

2021-08-31 Thread apinski--- via Gcc-patches
From: Andrew Pinski This adds MIPS Linux support to gcc.misc-tests/linkage.exp. Basically copying what was done for MIPS IRIX and changing the options to be correct. OK? gcc/testsuite/ChangeLog: PR testsuite/51748 * gcc.misc-tests/linkage.exp: Add mips*-linux-* support

[PATCH] rs6000: Fix some issues in rs6000_can_inline_p [PR102059]

2021-08-31 Thread Kewen.Lin via Gcc-patches
BR, Kewen - gcc/ChangeLog: PR ipa/102059 * config/rs6000/rs6000.c (rs6000_can_inline_p): Adjust with target_option_default_node and consider always_inline_safe flags. gcc/testsuite/ChangeLog: PR ipa/102059 * gcc.target/powerpc/pr102059-1.c

[PATCH] rs6000: Remove useless toc-fusion option

2021-08-31 Thread Kewen.Lin via Gcc-patches
Hi! Option toc-fusion was intended for Power9 toc fusion previously, but Power9 doesn't support fusion at all eventually, this patch is to remove this useless option. Is it ok for trunk? BR, Kewen ----- gcc/ChangeLog: * config/rs6000/rs6000.opt (-mtoc-fusion): Remove. --- gcc/c

[RFC/PATCH] ipa-inline: Add target info into fn summary [PR102059]

2021-09-01 Thread Kewen.Lin via Gcc-patches
ways_inline function, function ipa_merge_fn_summary_after_inlining deals with target information merging. The patch has been bootstrapped and regress-tested on powerpc64le-linux-gnu Power9. Is it on the right track? Any comments are highly appreciated! BR, Kewen -- gcc/ChangeLog: PR ipa/102059

[PATCH] C++: add type checking for static local vector variable in template

2021-09-01 Thread wangpc via Gcc-patches
From: wangpc --- gcc/cp/pt.c| 8 +++- .../aarch64/sve/static-var-in-template.C | 18 ++ 2 files changed, 25 insertions(+), 1 deletion(-) create mode 100644 gcc/testsuite/g++.target/aarch64/sve/static-var-in-template.C diff --git

[PATCH] Remove macro check for __AMX_BF16/INT8/TILE__ in header file.

2021-09-01 Thread liuhongt via Gcc-patches
Hi: Details discussed in PR. Bootstrapped and regtested on x86-64_linux-gnu{-m32,}. Pushed to master and GCC-11. gcc/ChangeLog: PR target/102166 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__. * config/i386/amxint8intrin.h : Remove macro check

Re: [PATCH] avoid transform at run until wrap comparesion

2021-09-02 Thread Bin.Cheng via Gcc-patches
ootstrap and regtest pass on ppc64le. > > Is this ok for trunk? > > This looks like PR100740, see the discussion starting at > https://gcc.gnu.org/pipermail/gcc-patches/2021-June/571570.html > > We seem to be at a dead end figuring what's exactly required > to make th

Re: [RFC/PATCH] ipa-inline: Add target info into fn summary [PR102059]

2021-09-02 Thread Kewen.Lin via Gcc-patches
function parameter gimple* will be also enough for this example. :) IMHO, even with this target information collection, we are unable to check all ISA features, it can only work for some "dull" ISA features, like HTM on Power which can only be exploited by builtin (or inline asm), the

Re: [RFC/PATCH] ipa-inline: Add target info into fn summary [PR102059]

2021-09-02 Thread Kewen.Lin via Gcc-patches
> case > and that function gets inlined it might end up using VSX (as now active in the > caller) to vectorize the unprofitable case. But in general it should work The example seems to be what we want to avoid in PR70010 if non-VSX set for callee explicitly. :) btw, in the current implementati

[r12-3310 Regression] FAIL: gcc.dg/torture/fp-int-convert-float16-timode.c -Os execution test on Linux/x86_64

2021-09-02 Thread sunil.k.pandey via Gcc-patches
-convert-float16.c -Os execution test FAIL: gcc.dg/torture/fp-int-convert-float16-timode.c -Os execution test with GCC configured with To reproduce: $ cd {build_dir}/gcc && make check RUNTESTFLAGS="dg-torture.exp=gcc.dg/torture/fp-int-convert-float16.c --target_board='

[PATCH] Fix target/102173 ICE after error recovery

2021-09-02 Thread apinski--- via Gcc-patches
From: Andrew Pinski After the recent r12-3278-823685221de986a change, the testcase gcc.target/aarch64/sve/acle/general-c/type_redef_1.c started to ICE as the code was not ready for error_mark_node in the type. This fixes that and the testcase now passes. gcc/ChangeLog: * config

[PATCH] [aarch64] Fix target/95969: __builtin_aarch64_im_lane_boundsi interferes with gimple

2021-09-02 Thread apinski--- via Gcc-patches
From: Andrew Pinski This patch adds simple folding of __builtin_aarch64_im_lane_boundsi where we are not going to error out. It fixes the problem by the removal of the function from the IR. OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions. gcc/ChangeLog: * config

Re: [RFC/PATCH] ipa-inline: Add target info into fn summary [PR102059]

2021-09-02 Thread Kewen.Lin via Gcc-patches
port ix86_can_inline_p checks x_ix86_target_flags, x_ix86_isa_flags, x_ix86_isa_flags2, arch and tune etc. now, one HOST_WIDE_INT seems not good to it, if it wants to check more. ;-) > >> --- a/gcc/config/rs6000/rs6000-call.c >> +++ b/gcc/config/rs6000/rs6000-call.c >> @@ -1364

[r12-3321 Regression] FAIL: gfortran.dg/PR100914.f90 -Os (test for excess errors) on Linux/x86_64

2021-09-02 Thread sunil.k.pandey via Gcc-patches
errors) FAIL: gfortran.dg/PR100914.f90 -O3 -g (test for excess errors) FAIL: gfortran.dg/PR100914.f90 -Os (test for excess errors) with GCC configured with To reproduce: $ cd {build_dir}/gcc && make check RUNTESTFLAGS="dg.exp=gfortran.dg/PR100914.f90 --target_board='

[PATCH] Fix some GC issues in the aarch64 back-end.

2021-09-02 Thread apinski--- via Gcc-patches
GTY. OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions. gcc/ChangeLog: * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info): Mark with GTY. (aarch64_simd_types): Likewise. (aarch64_simd_intOI_type_node): Likewise

[PATCH] Explicitly add -msse2 to compile HF related libgcc source file.

2021-09-03 Thread liuhongt via Gcc-patches
For 32-bit libgcc configure w/o sse2, there's would be an error since GCC only support _Float16 under sse2. Explicitly add -msse2 for those HF related libgcc functions, so users can still link them w/ the upper configuration. Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. Ok for

[PATCHv2] [aarch64] Fix target/95969: __builtin_aarch64_im_lane_boundsi interferes with gimple

2021-09-03 Thread apinski--- via Gcc-patches
From: Andrew Pinski This patch adds simple folding of __builtin_aarch64_im_lane_boundsi where we are not going to error out. It fixes the problem by the removal of the function from the IR. OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions. gcc/ChangeLog: PR target

[PATCH] Fix PR tree-opt/63184: add simplification of (& + A) != (& + B)

2021-09-05 Thread apinski--- via Gcc-patches
From: Andrew Pinski These two testcases have been failing since GCC 5 but things have improved such that adding a simplification to match.pd for this case is easier than before. In the end we have the following IR: _5 = &a[1] + _4; _7 = &a + _13; if (_5 != _7) So we can fol

[PATCH] Enable auto-vectorization at O2 with very-cheap cost model.

2021-09-06 Thread liuhongt via Gcc-patches
Hi: As discussed in [1], most of (currently unopposed) targets want auto-vectorization at O2, and IMHO now would be a good time to enable O2 vectorization for GCC trunk, so it would leave enough time to expose related issues and fix them. Bootstrapped and regtested on x86_64-linux-gnu{-m32

[PATCH] Adjust the wording for x86 _Float16 type.

2021-09-06 Thread liuhongt via Gcc-patches
Hi: As discussed in [1], adjust the layout for x86 _Float16 description. Bootstrappedn and regtested on x86_64-linux-gnu{-m32,}. Ok for trunk? gcc/ChangeLog: * doc/extend.texi: (@node Floating Types): Adjust the wording. (@node Half-Precision): Ditto. --- gcc/doc

[PATCH] Avoid FROM being overwritten in expand_fix.

2021-09-06 Thread liuhongt via Gcc-patches
for trunk? gcc/ChangeLog: PR middle-end/102182 * optabs.c (expand_fix): Add from1 to avoid from being overwritten. gcc/testsuite/ChangeLog: PR middle-end/102182 *gcc.target/i386/pr101282.c: New test. --- gcc/optabs.c | 10

[PATCH] C++: add type checking for static local vector variable in template

2021-09-06 Thread wangpc via Gcc-patches
This patch adds type checking for static local vector variable in C++ template, both AArch64 SVE and RISCV RVV are of sizeless type and thay all have this issue. 2021-08-06 wangpc gcc/cp/ChangeLog * pt.c (tsubst_decl): Add type checking. gcc/testsuite/ChangeLog * g++.target

[PATCH] [i386] Optimize v4sf reduction.

2021-09-07 Thread liuhongt via Gcc-patches
Hi: The optimization is decribled in PR. The two instruction sequences are almost as fast, but the optimized instruction sequences could be one mov instruction less on sse2 and 2 mov instruction less on sse3. Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. gcc/ChangeLog: PR

[PATCH v4] rs6000: Add load density heuristic

2021-09-07 Thread Kewen.Lin via Gcc-patches
and got fixed by this patch, besides got extra 3.93% speedup against O2 and another bmk 554.roms_r got 3.24% speed up. In short, the Power10 evaluation result shows this patch is positive. > Otherwise I have one comment below... > > On 7/28/21 12:22 AM, Kewen.Lin wrote: >> Hi, >

Re: [PATCH v2] rs6000: Add load density heuristic

2021-09-08 Thread Kewen.Lin via Gcc-patches
Hi Segher, Thanks for the comments! on 2021/9/7 上午7:43, Segher Boessenkool wrote: > Hi! > > On Wed, Jul 28, 2021 at 10:59:50AM +0800, Kewen.Lin wrote: +/* As a visitor function for each statement cost entry handled in + function add_stmt_cost, gather some information and update its >

PING^3 [PATCH v2] combine: Tweak the condition of last_set invalidation

2021-09-08 Thread Kewen.Lin via Gcc-patches
Hi, Gentle ping this: https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html BR, Kewen on 2021/7/15 上午10:00, Kewen.Lin via Gcc-patches wrote: > Hi, > > Gentle ping this: > > https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html > > BR, > Kewen

[PATCH v2] ipa-inline: Add target info into fn summary [PR102059]

2021-09-08 Thread Kewen.Lin via Gcc-patches
ments are highly appreciated! [1] https://gcc.gnu.org/pipermail/gcc-patches/2021-September/578555.html BR, Kewen - gcc/ChangeLog: PR ipa/102059 * config/rs6000/rs6000-call.c (rs6000_fn_has_any_of_these_mask_bits): New function. * config/rs6000/rs6000-internal.h (rs

Re: [PATCH v4] rs6000: Add load density heuristic

2021-09-08 Thread Kewen.Lin via Gcc-patches
on 2021/9/8 下午2:57, Kewen.Lin via Gcc-patches wrote: > Hi Bill, > > Thanks for the review comments! > > on 2021/9/3 下午11:57, Bill Schmidt wrote: >> Hi Kewen, >> >> Sorry that we lost track of this patch!  The heuristic approach looks good.  >> It is limite

[PATCH] Optimize vec_extract for 256/512-bit vector when index exceeds the lower 128 bits.

2021-09-08 Thread liuhongt via Gcc-patches
$0x1, %ymm0, %xmm0 + valignq $2, %ymm0, %ymm0, %ymm0 - vextractf64x4 $0x1, %zmm0, %ymm0 - vextractf64x2 $0x1, %ymm0, %xmm0 - vunpckhpd %xmm0, %xmm0, %xmm0 + valignq $7, %zmm0, %zmm0, %zmm0 Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. gcc

[PATCH] [i386] Remove copysign post_reload splitter for scalar modes.

2021-09-09 Thread liuhongt via Gcc-patches
ust like avx512dq-abs-copysign-1.c shows. Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. gcc/ChangeLog: * config/i386/i386-expand.c (ix86_expand_copysign): Expand right into ANDNOT + AND + IOR, using paradoxical subregs. (ix86_split_copysign_const)

[r12-3425 Regression] FAIL: gcc.target/i386/vect-rebuild.c scan-assembler-not unpck on Linux/x86_64

2021-09-09 Thread sunil.k.pandey via Gcc-patches
: gcc.target/i386/vect-rebuild.c scan-assembler-not unpck FAIL: gcc.target/i386/vect-rebuild.c scan-assembler-times \tv?extractf128[ \t] 1 with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r12-3425/usr --enable-clocale=gnu --with-system-zlib

[r12-3433 Regression] FAIL: gcc.target/i386/auto-init-padding-9.c scan-assembler rep stosq on Linux/x86_64

2021-09-09 Thread sunil.k.pandey via Gcc-patches
g-3.c scan-assembler movl\t\\$0, FAIL: gcc.target/i386/auto-init-padding-3.c scan-assembler movl\t\\$16, FAIL: gcc.target/i386/auto-init-padding-3.c scan-assembler rep stosq FAIL: gcc.target/i386/auto-init-padding-4.c scan-rtl-dump-times expand "0xfffe\\]\\) repeated x16" 1

Re: [PATCH v4] rs6000: Add load density heuristic

2021-09-09 Thread Kewen.Lin via Gcc-patches
Hi Segher and Bill, Thanks a lot for your reviews and helps! on 2021/9/10 上午1:19, Bill Schmidt wrote: > On 9/9/21 11:11 AM, Segher Boessenkool wrote: >> Hi! >> >> On Wed, Sep 08, 2021 at 02:57:14PM +0800, Kewen.Lin wrote: > +  /* If we have strided or elementwise loads into a vector, it's

Re: [PATCH v4] rs6000: Add load density heuristic

2021-09-09 Thread Kewen.Lin via Gcc-patches
on 2021/9/10 上午11:22, Kewen.Lin via Gcc-patches wrote: > Hi Segher and Bill, > > Thanks a lot for your reviews and helps! > > on 2021/9/10 上午1:19, Bill Schmidt wrote: >> On 9/9/21 11:11 AM, Segher Boessenkool wrote: >>> Hi! >>> >>> On Wed, S

[PATCH] Relax condition of (vec_concat:M(vec_select op0 idx0)(vec_select op0 idx1)) to allow different modes between op0 and M, but have same inner mode.

2021-09-09 Thread liuhongt via Gcc-patches
6_64-linux-gnu{-m32,}. Ok for trunk? gcc/ChangeLog: * simplify-rtx.c (simplify_context::simplify_binary_operation_1): Relax condition of simplifying (vec_concat:M (vec_select op0 index0)(vec_select op1 index1)) to allow different modes between op0 and M

[PATCH] Disallow paradoxical subregs when outer mode is SCALAR_FLOAT_MODE_P.

2021-09-09 Thread liuhongt via Gcc-patches
-linux-gnu-gcc w/o ICE. Ok for trunk? gcc/ChangeLog: PR middle-end/102254 PR middle-end/102154 * emit-rtl.c (validate_subreg): Disallow paradoxical subregs when omode is SCALAR_FLOAT_MODE_P. gcc/testsuite/ChangeLog: * gcc.target/i386/pr102254.c: New test

[PATCH 0/2] Revert r12-3277 since it caused regressions on many other targets.

2021-09-10 Thread liuhongt via Gcc-patches
Hi: Details discussed in https://gcc.gnu.org/pipermail/gcc-patches/2021-September/579170.html. Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. Ok for trunk? liuhongt (2): Revert "Get rid of all float-int special cases in validate_subreg." validate_subreg b

[PATCH 1/2] Revert "Get rid of all float-int special cases in validate_subreg."

2021-09-10 Thread liuhongt via Gcc-patches
This reverts commit d2874d905647a1d146dafa60199d440e837adc4d. PR target/102254 PR target/102154 PR target/102211 --- gcc/emit-rtl.c | 40 1 file changed, 40 insertions(+) diff --git a/gcc/emit-rtl.c b/gcc/emit-rtl.c index 77ea8948ee8..ff3b4449b37 100644

[PATCH 2/2] validate_subreg before call gen_lowpart to avoid ICE.

2021-09-10 Thread liuhongt via Gcc-patches
gcc/ChangeLog: * expmed.c (extract_bit_field_using_extv): validate_subreg before call gen_lowpart. --- gcc/expmed.c | 6 +- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/gcc/expmed.c b/gcc/expmed.c index 3143f38e057..10d62d857a8 100644 --- a/gcc/expmed.c +++ b

[PATCH] Fix PR lto/49664: liblto_plugin.so exports too many symbols

2021-09-12 Thread apinski--- via Gcc-patches
From: Andrew Pinski So right now liblto_plugin.so exports many libiberty symbols and simple_object file symbols but really it just needs to export onload. This fixes the problem by using "-export-symbols-regex onload" on the libtool link line. lto-plugin/ChangeLog: * Makefile.am: Expor

[PATCH] Remove UNSPEC_{COPYSIGN,XORSIGN}.

2021-09-12 Thread liuhongt via Gcc-patches
Hi: UNSPEC_COPYSIGN/XORSIGN are only used by related post_reload splitters which have been removed by r12-3417 and r12-3435. Bootstrapped and regtest on x86_64-linux-gnu{-m32,}. Pushed to trunk. gcc/ChangeLog: * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove

[committed] rs6000: Remove typedef for struct rs6000_cost_data

2021-09-12 Thread Kewen.Lin via Gcc-patches
Hi, This patch follows Segher's suggestion here[1] to get rid of the typedef, it's pre-approved as [1]. Bootstrapped and regtested on powerpc64le-linux-gnu Power9. Pushed to trunk as r12-3468. [1] https://gcc.gnu.org/pipermail/gcc-patches/2021-September/579115.html BR, Kewen

[PATCH] Remove m32r{,le}-*-linux* support from GCC

2021-09-13 Thread apinski--- via Gcc-patches
-linux were rejected when building. contrib/ChangeLog: * config-list.mk: Remove m32r-linux and m32rle-linux from the list. gcc/ChangeLog: * config.gcc: Add m32r-*-linux* and m32rle-*-linux* to the Unsupported targets list. Remove support for m32r-*-linux

[r12-3495 Regression] FAIL: 29_atomics/atomic_flag/test_and_set/explicit-hle.cc (test for excess errors) on Linux/x86_64

2021-09-13 Thread sunil.k.pandey via Gcc-patches
excess errors) with GCC configured with ../../gcc/configure --prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r12-3495/usr --enable-clocale=gnu --with-system-zlib --with-demangler-in-ld --with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl --enable

Re: [PATCH v2] ipa-inline: Add target info into fn summary [PR102059]

2021-09-13 Thread Kewen.Lin via Gcc-patches
Replace the Power target-specific builtin machinery" and found that the existing interface bif_is_htm works perfectly and nothing need to be added for the HTM check here. Thanks for the efforts! btw, the diff I used for testing is listed below: diff --git a/gcc/config/rs6000/rs6000.c b/gc

[PATCH] Output vextract{i, f}{32x4, 64x2} for (vec_select:(reg:Vmode) idx) when byte_offset of idx % 16 == 0.

2021-09-14 Thread liuhongt via Gcc-patches
Hi: As describled in PR, use vextract instead on valign when byte_offset % 16 == 0. Bootstrapped and regtest on x86_64-linux-gnu{-m32,}. Pushed to trunk. 2020-09-13 Hongtao Liu Peter Cordes gcc/ChangeLog: PR target/91103 * config/i386/sse.md (extract_suf

<    10   11   12   13   14   15   16   17   18   19   >