y for trunk? Any recommendations? Thanks a lot.
>
> ChangeLog
> 2022-05-13 Haochen Gui
>
> gcc/
> PR target/95737
> * config/rs6000/rs6000.md (extenddi_ca_minus_one): Define.
>
Nit: (*extenddi_ca_minus_one): New define_insn_and_split.
> gcc/tests
;t. But I'm not sure if it's the same as what Piotr's
environments have. Hope it's similar. :-)
[1] https://gcc.gnu.org/pipermail/gcc-patches/2022-April/593193.html
[2] https://github.com/freebsd/freebsd-src/blob/main/lib/msun/powerpc/fenv.h
[3] https://github.com/freebsd/freebsd-src/blob/main/lib/msun/x86/fenv.h
BR,
Kewen
sion phase like "unresolved overload for builtin ...".
Bootstrapped and regtested on powerpc64-linux-gnu P8 and
powerpc64le-linux-gnu P9 and P10.
Is it ok for trunk?
BR,
Kewen
-
PR target/105485
gcc/ChangeLog:
* config/rs6000/rs6000-builtin.cc (rs6000_gimpl
- .value 2
- .value 3
- .value 8
- .value 9
- .value 10
- .value 11
Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
Ok for trunk?
gcc/ChangeLog:
PR target/105033
* config/i386/sse.md (*vec_concatv4si): Extend to ..
(*vec_concat
%xmm0
+ vpunpcklqdq %xmm1, %xmm0, %xmm0
...
-.LC0:
- .value 0
- .value 1
- .value 2
- .value 3
- .value 8
- .value 9
- .value 10
- .value 11
gcc/ChangeLog:
PR target/105033
* config/i386/sse.md (*vec_concatv4si): Exte
When d->perm[i] == d->perm[i-1] + 1 and d->perm[i] == nelt, it's not
continuous. It should fail if there's more than 2 continuous areas.
Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
Ok for trunk?
gcc/ChangeLog:
PR target/105587
* config/
y for trunk? Any recommendations? Thanks a lot.
>
> ChangeLog
> 2022-05-16 Haochen Gui
>
> gcc/
> PR target/95737
> * config/rs6000/rs6000.md (subfsi3_carry_in_xx_64): New.
(subfsi3_carry_in_xx_64) -> (*subfsi3_carry_in_xx_64)
Sorry for nit-picking
Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}
Ok for trunk?
gcc/ChangeLog:
PR tree-optimization/105591
* tree-ssa-forwprop.cc (simplify_bitfield_ref): Clamp
vec_perm_expr index.
gcc/testsuite/ChangeLog:
* gcc.dg/pr105591.c: New test.
---
gcc
x86_64-linux-gnu/-m32/-mx32, and
committed to mainline.
Regards,
Iain.
---
gcc/d/ChangeLog:
* dmd/MERGE: Merge upstream dmd 60bfa0ee7.
* dmd/VERSION: Update version to v2.100.0.
* d-builtins.cc (d_init_versions): Update for new front-end interface.
* d-
xtend:DI (and:SI (plus:SI (ashift:SI (const_int 1 [0x1])
428(subreg:QI (reg:SI 98) 0))
429(const_int -1 [0x]))
430(subreg:SI (reg:DI 95) 0
431(clobber (reg:CC 17 flags))
432])
Add new define_insn
P9 and P10.
v3: Update test case with dg-excess-errors.
v2: Add one test case and refine commit logs.
https://gcc.gnu.org/pipermail/gcc-patches/2022-April/593155.html
v1: https://gcc.gnu.org/pipermail/gcc-patches/2022-March/591768.html
Is it ok for trunk?
BR,
Kewen
-
PR target
ed on powerpc64-linux-gnu P8 and
powerpc64le-linux-gnu P9 and P10.
v3: Update test case with dg-excess-errors.
v2: Polish some comments and add one test case as Will and Peter suggested.
https://gcc.gnu.org/pipermail/gcc-patches/2022-April/592916.html
v1: https://gcc.gnu.org/pipermail/gcc-pa
Hi,
Move pr83660.C to g++.target. As comment #3 of PR83660,
renaming it to c isn't one option.
This test case isn't tested before, so this brings back
the test coverage.
Committed as r13-619-g297a69068ddfe2.
BR,
Kewen
-
gcc/testsuite/ChangeLog:
* gcc.target/powerpc
(test for excess errors)
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r13-456/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl
--enable
Over the past few months I have been working on re-adding gcj to gcc (
https://github.com/Zopolis4/gcj/tree/mster). It is now at the point where
all of the code in gcc/java and the related additions to other gcc files
compiles successfully, although libjava still breaks.
Given that I do not know
ted on x86_64-pc-linux-gnu{-m32,}.
Ready to push to trunk.
gcc/ChangeLog:
* config/i386/x86-tune-costs.h (skylake_cost): Increase gpr
<-> mask cost from 5 to 6.
(icelake_cost): Ditto.
gcc/testsuite/ChangeLog:
* gcc.target/i386/spill_to_mask-1.c: New test.
---
Hi Piotr,
on 2022/5/23 05:36, Piotr Kubaj wrote:
> Ping.
>
> FreeBSD has already enabled sanitizers in its base system in
> https://cgit.freebsd.org/src/commit/?id=f5024381ac16ba43d37a8bd32d54c27f6a6afa66
> The upstream (LLVM) patch was reviewed by a couple of people already.
>
> Our llvm-devel
/pr45685.c scan-assembler-times cmov 6
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r13-707/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet
ws up in
function union_defs.
PR target/105627
gcc/ChangeLog:
* config/rs6000/rs6000-p8swap.cc (union_defs): Assert def_insn can't
be a debug insn.
(union_uses): Skip debug use_insn.
gcc/testsuite/ChangeLog:
* gcc.target/powerpc/pr105627.c: New test
in one more vect_model_simple_cost dumping for each
function. Need to adjust the match count accordingly.
PR testsuite/105706
gcc/testsuite/ChangeLog:
* gcc.target/powerpc/pr78604.c: Adjust.
---
gcc/testsuite/gcc.target/powerpc/pr78604.c | 4 +++-
1 file changed, 3 insertions
-1.f90 -O execution test
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r13-726/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl
te 'm'
alternative cost is too small, especially considering that the mov cost
of gpr are 2(default for REGISTER_MOVE_COST). So this patch increase mem_cost
to 2*frequency, also increase 1 for reg_class cost when m alternative.
Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
Ok
." is still not good to get it clear. You are right, if the bit
set for the outermost loop (its father), it's not guaranteed that we will
do the propagation for it (and its children), so it shouldn't clear
father_bbs otherwise some expected propagation probably won't happen
Hi,
Gentle ping for the patch:
https://gcc.gnu.org/pipermail/gcc-patches/2020-September/553555.html
BR,
Kewen
on 2020/10/13 下午2:59, Kewen.Lin via Gcc-patches wrote:
> Hi,
>
> I'd like to gentle ping this patch:
>
> https://gcc.gnu.org/pipermail/gcc-patches/2020-S
Hi,
Gentle ping^4 this:
https://gcc.gnu.org/pipermail/gcc-patches/2020-May/546698.html
BR,
Kewen
on 2020/10/13 下午3:06, Kewen.Lin via Gcc-patches wrote:
> Hi,
>
> Gentle ping this:
>
> https://gcc.gnu.org/pipermail/gcc-patches/2020-May/546698.html
>
> BR,
> Kewen
Here is an updated patch, that quotes all three makeinfo special characters: @
{ }
Tested on a build with source directory /tmp/foo@bar{gee}qux
OK to commit?
at.diff
Description: Binary data
at.ChangeLog
Description: Binary data
Hi Richard,
Thanks again for your review!
on 2020/11/2 下午6:23, Richard Sandiford wrote:
> "Kewen.Lin" writes:
>> diff --git a/gcc/function.c b/gcc/function.c
>> index 2c8fa217f1f..3e92ee9c665 100644
>> --- a/gcc/function.c
>> +++ b
Hi David,
Thanks for the review!
> The patch looks fine to me, but I'll let Segher decide if it addresses
> his requested changes.
>
> I'm trying to be stricter about the test cases.
>
> +++ b/gcc/testsuite/gcc.target/powerpc/pr96933-1.c
> @@ -0,0 +1,14 @@
> +
-slice.f95 -DACC_DEVICE_TYPE_host=1
-DACC_MEM_SHARED=1 -foffload=disable -O3 -g execution test
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4614/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-
-use-linker-plugin
-flto-partition=none -DPREVENT_OPTIMIZATION line 22 y == 68
FAIL: gcc.dg/guality/pr54519-4.c -O2 -flto -fuse-linker-plugin
-fno-fat-lto-objects -DPREVENT_OPTIMIZATION line 22 y == 68
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc
that uses that external address.
Produced by a cast of thousands:
* Michael Meissner
* Peter Bergner
* Bill Schmidt
* Alan Modra
* Segher Boessenkool
* Aaron Sawdey
Passes bootstrap/regtest on ppc64le power10. OK for trunk?
gcc/ChangeLog:
* config.gcc: Add pcrel-opt.o
Hi Lyon,
Thanks for reporting and sorry for the failure.
>> The patch was updated as your comments above, re-tested on Power8
>> and committed in r11-4637.
>>
>
> The new test gcc.dg/tree-ssa/pr96789.c fails on arm:
> FAIL: gcc.dg/tree-ssa/pr96789.c scan-tree-dump dse3 "Deleted dead store:.*tmp"
es not mean the OS (libc,
> kernel, etc.) will actually save the full 64-bit registers -- making it
> only useful on Darwin currently.
>
> (You *can* run all of the testsuite flawlessly on Linux with those
> options, but that only works because those are small, short-running
>
pass_tree_loop_done.
This patch passes bootstrap and regtest on ppc64le. Is this ok for trunk?
gcc/ChangeLog
2020-10-05 Jiufu Guo
* tree-ssa-loop.h (clean_up_loop_closed_phi): New declaration.
* tree-ssa-loop.c (tree_ssa_loop_done): Call clean_up_loop_closed_phi.
* tree
-fat-lto-objects -DPREVENT_OPTIMIZATION line 17 y == 25
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4733/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c++,fortran
relevant data
structures that is regstat_n_sets_and_refs and reg_info_p
to ensure we won't access beyond some array bounds.
Bootstrapped/regtested on powerpc64le-linux-gnu P9 and
powerpc64-linux-gnu P8.
Any thoughts? Is it a reasonable fix?
BR,
Kewen
-----
gcc/ChangeLog:
PR rtl-optimiz
"Parm 1 param offset:0
offset:0 size:-1 max_size:64"
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4770/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c
, line 30)
FAIL: c-c++-common/Wimplicit-fallthrough-20.c -std=gnu++98 (test for
warnings, line 32)
FAIL: c-c++-common/Wimplicit-fallthrough-20.c -std=gnu++98 (test for
warnings, line 36)
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master
nd_refs and reg_info_p
>> to ensure we won't access beyond some array bounds.
>>
>> Bootstrapped/regtested on powerpc64le-linux-gnu P9 and
>> powerpc64-linux-gnu P8.
>>
>> Any thoughts? Is it a reasonable fix?
>>
> Sure, Kewen. A bit unexpected to see l
Hello
Have a nice day !
This is JoJo from WMResearch.
We recently published a new market research report:
< Global Embedded Real-Time Operating Systems for the IoT Market Research
Report 2015-2020>and kindly let us know if you are interested in it.
If you are interested in getting more det
errors, line 15)
FAIL: g++.dg/ubsan/pr61272.C (test for excess errors)
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4852/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable
to teach the case to respect vect_load_lanes,
meanwhile to guard the check only under vect_int.
Is it ok for trunk?
BR,
Kewen
-
gcc/testsuite/ChangeLog:
* gcc.dg/tree-ssa/pr96789.c: Adjusted by excluding vect_load_lanes.
diff --git a/gcc/testsuite/gcc.dg/tree-ssa/pr96789.c
b/gcc
90 -O scan-tree-dump-times
optimized "__builtin_GOMP_doacross_post " 1
FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times
optimized "__builtin_GOMP_doacross_wait " 1
FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times
op
ange -ftree-vectorize to -ftree-slp-vectorize?
> Or does that defeat the purpose of the test?
It works, nice, thanks for the suggestion!
I appended one explicit -fno-tree-loop-vectorize to avoid it to fail
in case someone kicks off the testing with explicit -ftree-loop-vectorize.
The updated v
n-57.f90 -O scan-tree-dump-times
optimized "__builtin_GOMP_doacross(?:_ull)_post " 1
FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump-times
optimized "__builtin_GOMP_doacross(?:_ull)_wait " 1
FAIL: gfortran.dg/gomp/workshare-reduction-57.f90 -O scan-tree-dump
/modifiers/insert/const_iterator.cc (test for excess
errors)
FAIL: 23_containers/vector/types/1.cc (test for excess errors)
FAIL: 25_algorithms/merge/constrained.cc (test for excess errors)
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master
/members/poll.cc execution test
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-4958/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl
/96042.cc (test for excess errors)
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-10/releases/gcc-10/r10-9014/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet
the inline expansion
of memcpy/memmove, allowing me to fix PR 96791.
Regstrap in progress on ppc64le and x86_64, ok for trunk if successful?
Thanks,
Aaron
gcc/ChangeLog
PR target/96791
* mode-classes.def: Add MODE_OPAQUE.
* machmode.def: Add OPAQUE_MODE.
* tre
/pr59776.c -Os -DPREVENT_OPTIMIZATION line pr59776.c:20
s1.g == 6.0
FAIL: gcc.dg/guality/pr59776.c -Os -DPREVENT_OPTIMIZATION line pr59776.c:20
s2.f == 5.0
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r11-5029/usr
x86_64 and ppc64le?
Thanks,
Aaron
gcc/ChangeLog
PR target/96791
* mode-classes.def: Add MODE_OPAQUE.
* machmode.def: Add OPAQUE_MODE.
* tree.def: Add OPAQUE_TYPE for types that will use
MODE_OPAQUE.
* doc/generic.texi: Document OPAQUE_TYPE.
> OK. You have commit privs, right?
Yes, and I did commit after Richard’s OK:
https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=034db20e2ea8301b5dc251bf10a97ce1cf90655f
… but I forgot to send an email saying I had, sorry.
FX
san_test.cc:129:22: error: writing 1 byte into a region
of size 0 [-Werror=stringop-overflow=]
As there is no easy way to convey pvalloc()'s granularity
to gcc let's just disable the warning for this test.
* g++.dg/asan/asan_test.C: disable -Wstringop-overflow.
---
gcc/testsuite/Ch
From: Andrew Pinski
The problem here is the aarch64_expand_setmem code did not check
STRICT_ALIGNMENT if it is creating an overlapping store.
This patch adds that check and the testcase works.
gcc/ChangeLog:
PR target/101934
* config/aarch64/aarch64.c (aarch64_expand_setmem
From: Andrew Pinski
This adds MIPS Linux support to gcc.misc-tests/linkage.exp. Basically
copying what was done for MIPS IRIX and changing the options to be correct.
OK?
gcc/testsuite/ChangeLog:
PR testsuite/51748
* gcc.misc-tests/linkage.exp: Add mips*-linux-* support
BR,
Kewen
-
gcc/ChangeLog:
PR ipa/102059
* config/rs6000/rs6000.c (rs6000_can_inline_p): Adjust with
target_option_default_node and consider always_inline_safe flags.
gcc/testsuite/ChangeLog:
PR ipa/102059
* gcc.target/powerpc/pr102059-1.c
Hi!
Option toc-fusion was intended for Power9 toc fusion previously,
but Power9 doesn't support fusion at all eventually, this patch
is to remove this useless option.
Is it ok for trunk?
BR,
Kewen
-----
gcc/ChangeLog:
* config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
---
gcc/c
ways_inline function,
function ipa_merge_fn_summary_after_inlining deals with target
information merging.
The patch has been bootstrapped and regress-tested on
powerpc64le-linux-gnu Power9.
Is it on the right track?
Any comments are highly appreciated!
BR,
Kewen
--
gcc/ChangeLog:
PR ipa/102059
From: wangpc
---
gcc/cp/pt.c| 8 +++-
.../aarch64/sve/static-var-in-template.C | 18 ++
2 files changed, 25 insertions(+), 1 deletion(-)
create mode 100644
gcc/testsuite/g++.target/aarch64/sve/static-var-in-template.C
diff --git
Hi:
Details discussed in PR.
Bootstrapped and regtested on x86-64_linux-gnu{-m32,}.
Pushed to master and GCC-11.
gcc/ChangeLog:
PR target/102166
* config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
* config/i386/amxint8intrin.h : Remove macro check
ootstrap and regtest pass on ppc64le.
> > Is this ok for trunk?
>
> This looks like PR100740, see the discussion starting at
> https://gcc.gnu.org/pipermail/gcc-patches/2021-June/571570.html
>
> We seem to be at a dead end figuring what's exactly required
> to make th
function parameter gimple* will be also enough for
this example. :)
IMHO, even with this target information collection, we are unable to check all
ISA features, it can only work for some "dull" ISA features, like HTM on
Power which can only be exploited by builtin (or inline asm), the
> case
> and that function gets inlined it might end up using VSX (as now active in the
> caller) to vectorize the unprofitable case. But in general it should work
The example seems to be what we want to avoid in PR70010 if non-VSX set for
callee
explicitly. :) btw, in the current implementati
-convert-float16.c -Os execution test
FAIL: gcc.dg/torture/fp-int-convert-float16-timode.c -Os execution test
with GCC configured with
To reproduce:
$ cd {build_dir}/gcc && make check
RUNTESTFLAGS="dg-torture.exp=gcc.dg/torture/fp-int-convert-float16.c
--target_board='
From: Andrew Pinski
After the recent r12-3278-823685221de986a change, the testcase
gcc.target/aarch64/sve/acle/general-c/type_redef_1.c started
to ICE as the code was not ready for error_mark_node in the
type. This fixes that and the testcase now passes.
gcc/ChangeLog:
* config
From: Andrew Pinski
This patch adds simple folding of __builtin_aarch64_im_lane_boundsi where
we are not going to error out. It fixes the problem by the removal
of the function from the IR.
OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions.
gcc/ChangeLog:
* config
port ix86_can_inline_p checks x_ix86_target_flags, x_ix86_isa_flags,
x_ix86_isa_flags2, arch and tune etc. now, one HOST_WIDE_INT seems not good
to it, if it wants to check more. ;-)
>
>> --- a/gcc/config/rs6000/rs6000-call.c
>> +++ b/gcc/config/rs6000/rs6000-call.c
>> @@ -1364
errors)
FAIL: gfortran.dg/PR100914.f90 -O3 -g (test for excess errors)
FAIL: gfortran.dg/PR100914.f90 -Os (test for excess errors)
with GCC configured with
To reproduce:
$ cd {build_dir}/gcc && make check
RUNTESTFLAGS="dg.exp=gfortran.dg/PR100914.f90 --target_board='
GTY.
OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions.
gcc/ChangeLog:
* config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
Mark with GTY.
(aarch64_simd_types): Likewise.
(aarch64_simd_intOI_type_node): Likewise
For 32-bit libgcc configure w/o sse2, there's would be an error since
GCC only support _Float16 under sse2. Explicitly add -msse2 for those
HF related libgcc functions, so users can still link them w/ the
upper configuration.
Bootstrapped and regtested on x86_64-linux-gnu{-m32,}.
Ok for
From: Andrew Pinski
This patch adds simple folding of __builtin_aarch64_im_lane_boundsi where
we are not going to error out. It fixes the problem by the removal
of the function from the IR.
OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions.
gcc/ChangeLog:
PR target
From: Andrew Pinski
These two testcases have been failing since GCC 5 but things
have improved such that adding a simplification to match.pd
for this case is easier than before.
In the end we have the following IR:
_5 = &a[1] + _4;
_7 = &a + _13;
if (_5 != _7)
So we can fol
Hi:
As discussed in [1], most of (currently unopposed) targets want
auto-vectorization at O2, and IMHO now would be a good time to enable O2
vectorization for GCC trunk, so it would leave enough time to expose
related issues and fix them.
Bootstrapped and regtested on x86_64-linux-gnu{-m32
Hi:
As discussed in [1], adjust the layout for x86 _Float16 description.
Bootstrappedn and regtested on x86_64-linux-gnu{-m32,}.
Ok for trunk?
gcc/ChangeLog:
* doc/extend.texi: (@node Floating Types): Adjust the wording.
(@node Half-Precision): Ditto.
---
gcc/doc
for trunk?
gcc/ChangeLog:
PR middle-end/102182
* optabs.c (expand_fix): Add from1 to avoid from being
overwritten.
gcc/testsuite/ChangeLog:
PR middle-end/102182
*gcc.target/i386/pr101282.c: New test.
---
gcc/optabs.c | 10
This patch adds type checking for static local vector variable in
C++ template, both AArch64 SVE and RISCV RVV are of sizeless type
and thay all have this issue.
2021-08-06 wangpc
gcc/cp/ChangeLog
* pt.c (tsubst_decl): Add type checking.
gcc/testsuite/ChangeLog
* g++.target
Hi:
The optimization is decribled in PR.
The two instruction sequences are almost as fast, but the optimized
instruction sequences could be one mov instruction less on sse2 and
2 mov instruction less on sse3.
Bootstrapped and regtested on x86_64-linux-gnu{-m32,}.
gcc/ChangeLog:
PR
and got fixed by this
patch, besides got extra 3.93% speedup
against O2 and another bmk 554.roms_r got 3.24% speed up.
In short, the Power10 evaluation result shows this patch is positive.
> Otherwise I have one comment below...
>
> On 7/28/21 12:22 AM, Kewen.Lin wrote:
>> Hi,
>
Hi Segher,
Thanks for the comments!
on 2021/9/7 上午7:43, Segher Boessenkool wrote:
> Hi!
>
> On Wed, Jul 28, 2021 at 10:59:50AM +0800, Kewen.Lin wrote:
+/* As a visitor function for each statement cost entry handled in
+ function add_stmt_cost, gather some information and update its
>
Hi,
Gentle ping this:
https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html
BR,
Kewen
on 2021/7/15 上午10:00, Kewen.Lin via Gcc-patches wrote:
> Hi,
>
> Gentle ping this:
>
> https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html
>
> BR,
> Kewen
ments are highly appreciated!
[1] https://gcc.gnu.org/pipermail/gcc-patches/2021-September/578555.html
BR,
Kewen
-
gcc/ChangeLog:
PR ipa/102059
* config/rs6000/rs6000-call.c (rs6000_fn_has_any_of_these_mask_bits):
New function.
* config/rs6000/rs6000-internal.h
(rs
on 2021/9/8 下午2:57, Kewen.Lin via Gcc-patches wrote:
> Hi Bill,
>
> Thanks for the review comments!
>
> on 2021/9/3 下午11:57, Bill Schmidt wrote:
>> Hi Kewen,
>>
>> Sorry that we lost track of this patch! The heuristic approach looks good.
>> It is limite
$0x1, %ymm0, %xmm0
+ valignq $2, %ymm0, %ymm0, %ymm0
- vextractf64x4 $0x1, %zmm0, %ymm0
- vextractf64x2 $0x1, %ymm0, %xmm0
- vunpckhpd %xmm0, %xmm0, %xmm0
+ valignq $7, %zmm0, %zmm0, %zmm0
Bootstrapped and regtested on x86_64-linux-gnu{-m32,}.
gcc
ust like avx512dq-abs-copysign-1.c
shows.
Bootstrapped and regtested on x86_64-linux-gnu{-m32,}.
gcc/ChangeLog:
* config/i386/i386-expand.c (ix86_expand_copysign): Expand
right into ANDNOT + AND + IOR, using paradoxical subregs.
(ix86_split_copysign_const)
: gcc.target/i386/vect-rebuild.c scan-assembler-not unpck
FAIL: gcc.target/i386/vect-rebuild.c scan-assembler-times \tv?extractf128[ \t] 1
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r12-3425/usr
--enable-clocale=gnu --with-system-zlib
g-3.c scan-assembler movl\t\\$0,
FAIL: gcc.target/i386/auto-init-padding-3.c scan-assembler movl\t\\$16,
FAIL: gcc.target/i386/auto-init-padding-3.c scan-assembler rep stosq
FAIL: gcc.target/i386/auto-init-padding-4.c scan-rtl-dump-times expand
"0xfffe\\]\\) repeated x16" 1
Hi Segher and Bill,
Thanks a lot for your reviews and helps!
on 2021/9/10 上午1:19, Bill Schmidt wrote:
> On 9/9/21 11:11 AM, Segher Boessenkool wrote:
>> Hi!
>>
>> On Wed, Sep 08, 2021 at 02:57:14PM +0800, Kewen.Lin wrote:
> + /* If we have strided or elementwise loads into a vector, it's
on 2021/9/10 上午11:22, Kewen.Lin via Gcc-patches wrote:
> Hi Segher and Bill,
>
> Thanks a lot for your reviews and helps!
>
> on 2021/9/10 上午1:19, Bill Schmidt wrote:
>> On 9/9/21 11:11 AM, Segher Boessenkool wrote:
>>> Hi!
>>>
>>> On Wed, S
6_64-linux-gnu{-m32,}.
Ok for trunk?
gcc/ChangeLog:
* simplify-rtx.c
(simplify_context::simplify_binary_operation_1): Relax
condition of simplifying (vec_concat:M (vec_select op0
index0)(vec_select op1 index1)) to allow different modes
between op0 and M
-linux-gnu-gcc w/o ICE.
Ok for trunk?
gcc/ChangeLog:
PR middle-end/102254
PR middle-end/102154
* emit-rtl.c (validate_subreg): Disallow paradoxical subregs
when omode is SCALAR_FLOAT_MODE_P.
gcc/testsuite/ChangeLog:
* gcc.target/i386/pr102254.c: New test
Hi:
Details discussed in
https://gcc.gnu.org/pipermail/gcc-patches/2021-September/579170.html.
Bootstrapped and regtested on x86_64-linux-gnu{-m32,}.
Ok for trunk?
liuhongt (2):
Revert "Get rid of all float-int special cases in validate_subreg."
validate_subreg b
This reverts commit d2874d905647a1d146dafa60199d440e837adc4d.
PR target/102254
PR target/102154
PR target/102211
---
gcc/emit-rtl.c | 40
1 file changed, 40 insertions(+)
diff --git a/gcc/emit-rtl.c b/gcc/emit-rtl.c
index 77ea8948ee8..ff3b4449b37 100644
gcc/ChangeLog:
* expmed.c (extract_bit_field_using_extv): validate_subreg
before call gen_lowpart.
---
gcc/expmed.c | 6 +-
1 file changed, 5 insertions(+), 1 deletion(-)
diff --git a/gcc/expmed.c b/gcc/expmed.c
index 3143f38e057..10d62d857a8 100644
--- a/gcc/expmed.c
+++ b
From: Andrew Pinski
So right now liblto_plugin.so exports many libiberty symbols and
simple_object file symbols but really it just needs to export onload.
This fixes the problem by using "-export-symbols-regex onload" on
the libtool link line.
lto-plugin/ChangeLog:
* Makefile.am: Expor
Hi:
UNSPEC_COPYSIGN/XORSIGN are only used by related post_reload splitters
which have been removed by r12-3417 and r12-3435.
Bootstrapped and regtest on x86_64-linux-gnu{-m32,}.
Pushed to trunk.
gcc/ChangeLog:
* config/i386/i386.md: (UNSPEC_COPYSIGN): Remove
Hi,
This patch follows Segher's suggestion here[1] to get rid of
the typedef, it's pre-approved as [1].
Bootstrapped and regtested on powerpc64le-linux-gnu Power9.
Pushed to trunk as r12-3468.
[1] https://gcc.gnu.org/pipermail/gcc-patches/2021-September/579115.html
BR,
Kewen
-linux were rejected
when building.
contrib/ChangeLog:
* config-list.mk: Remove m32r-linux and m32rle-linux
from the list.
gcc/ChangeLog:
* config.gcc: Add m32r-*-linux* and m32rle-*-linux*
to the Unsupported targets list.
Remove support for m32r-*-linux
excess errors)
with GCC configured with
../../gcc/configure
--prefix=/local/skpandey/gccwork/toolwork/gcc-bisect-master/master/r12-3495/usr
--enable-clocale=gnu --with-system-zlib --with-demangler-in-ld
--with-fpmath=sse --enable-languages=c,c++,fortran --enable-cet --without-isl
--enable
Replace the Power target-specific builtin machinery" and found
that the existing
interface bif_is_htm works perfectly and nothing need to be added for the HTM
check here.
Thanks for the efforts! btw, the diff I used for testing is listed below:
diff --git a/gcc/config/rs6000/rs6000.c b/gc
Hi:
As describled in PR, use vextract instead on valign when
byte_offset % 16 == 0.
Bootstrapped and regtest on x86_64-linux-gnu{-m32,}.
Pushed to trunk.
2020-09-13 Hongtao Liu
Peter Cordes
gcc/ChangeLog:
PR target/91103
* config/i386/sse.md (extract_suf
1401 - 1500 of 41351 matches
Mail list logo