[Bug middle-end/17886] variable rotate and unsigned long long rotate should be better optimized

2023-10-10 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=17886 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/110701] [14 Regression] Wrong code at -O1/2/3/s on x86_64-linux-gnu

2023-10-11 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110701 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/116275] [15 regression] ICE when building llvm-18.1.8 (convert_insn, at config/i386/i386-features.cc:1502)

2024-08-07 Thread roger at nextmovesoftware dot com via Gcc-bugs
|1 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com Last reconfirmed||2024-08-07 Target Milestone|--- |15.0 --- Comment #1 from Roger Sayle --- Doh! This is almost certainly caused

[Bug target/116275] [15 regression] ICE when building llvm-18.1.8 (convert_insn, at config/i386/i386-features.cc:1502)

2024-08-08 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116275 --- Comment #4 from Roger Sayle --- Created attachment 58868 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58868&action=edit proposed patch Here's my proposed fix (the first of two patches) that resolves the ICE with the testcase. The p

[Bug target/116576] `INT64_MAX` and `-Oz`

2024-09-09 Thread roger at nextmovesoftware dot com via Gcc-bugs
||roger at nextmovesoftware dot com Ever confirmed|0 |1 Last reconfirmed||2024-09-09 --- Comment #2 from Roger Sayle --- The constant ~0 can be materialized on x86 in only three bytes using either of the sequences "pu

[Bug tree-optimization/109424] ~((x > y) ? x : y) produces two not instructions

2023-04-07 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109424 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/109476] Missing optimization for 8bit/8bit multiplication / regression

2023-04-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109476 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/109476] Missing optimization for 8bit/8bit multiplication / regression

2023-04-13 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109476 --- Comment #14 from Roger Sayle --- My apologies for the delay/issues. My bootstrap and regression testing of this patch (on x86_64-pc-linux-gnu) revealed an issue or two (including the reported ICE). My plan was to fix/resolve all these befo

[Bug target/54816] [avr] shift is better than widening mul

2023-04-15 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=54816 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/54816] [avr] shift is better than widening mul

2023-04-16 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=54816 Roger Sayle changed: What|Removed |Added Target Milestone|--- |13.0 Resolution|---

[Bug target/66511] [avr] whole-byte shifts not optimized away for uint64_t

2023-04-16 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66511 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug tree-optimization/106888] [RISCV] Negative optimization that excess andi instructions are generated in gcc.dg/pr90838.c

2023-04-21 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106888 --- Comment #5 from Roger Sayle --- Created attachment 54905 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=54905&action=edit proposed patch This patch should fix this problem, by adding another pattern the machine description to also rec

[Bug middle-end/25186] (short)(((int)short_var) <<1) should be folded so that the shift is done in the short type

2023-04-22 Thread roger at nextmovesoftware dot com via Gcc-bugs
|RESOLVED Resolution|--- |FIXED CC||roger at nextmovesoftware dot com Known to work||13.0 --- Comment #10 from Roger Sayle --- This is now fixed (at the tree level) on

[Bug middle-end/19986] [meta-bug] fold missing optimizations (compared to RTL)

2023-04-22 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=19986 Bug 19986 depends on bug 25186, which changed state. Bug 25186 Summary: (short)(((int)short_var) <<1) should be folded so that the shift is done in the short type https://gcc.gnu.org/bugzilla/show_bug.cgi?id=25186 What|Removed

[Bug target/95674] Unnecessary move when doing division-by-multiplication

2023-04-22 Thread roger at nextmovesoftware dot com via Gcc-bugs
|--- |12.0 CC||roger at nextmovesoftware dot com Known to work||12.0 Status|NEW |RESOLVED --- Comment #6 from Roger Sayle --- This (testcase) has now been fixed on mainline

[Bug rtl-optimization/109476] Missing optimization for 8bit/8bit multiplication / regression

2023-04-23 Thread roger at nextmovesoftware dot com via Gcc-bugs
at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #17 from Roger Sayle --- I've submitted an improved version of my patch for review: https://gcc.gnu.org/pipermail/gcc-patches/2023-April/616527.html

[Bug target/105991] [12 Regression] rldicl+sldi+add generated instead of rldimi

2023-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105991 Roger Sayle changed: What|Removed |Added Status|REOPENED|RESOLVED Resolution|---

[Bug rtl-optimization/109476] Missing optimization for 8bit/8bit multiplication / regression

2023-05-07 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109476 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Target Milestone|---

[Bug middle-end/109766] Passing doubles through the stack generates a stack adjustment per each such argument at -Os/-Oz.

2023-05-08 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109766 Roger Sayle changed: What|Removed |Added Last reconfirmed||2023-05-08 Ever confirmed|0

[Bug middle-end/109840] [14 Regression] internal compiler error: in expand_fn_using_insn, at internal-fn.cc:153 when building graphite2

2023-05-14 Thread roger at nextmovesoftware dot com via Gcc-bugs
at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #4 from Roger Sayle --- Doh! The recent popcount(bswap(x)) optimizations shouldn't be changing the width of the popcount, i.e. the convert? extension needs to be re-inserted, it's only the bswap that gets elimina

[Bug target/114544] [x86] stv should transform (subreg DI (V1TI) 8) as (vec_select:DI (V2DI) (const_int 1))

2024-04-07 Thread roger at nextmovesoftware dot com via Gcc-bugs
|UNCONFIRMED |NEW Ever confirmed|0 |1 CC||roger at nextmovesoftware dot com

[Bug tree-optimization/114767] gfortran AVX2 complex multiplication by (0d0,1d0) suboptimal

2024-04-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114767 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug tree-optimization/114767] gfortran AVX2 complex multiplication by (0d0,1d0) suboptimal

2024-04-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114767 --- Comment #5 from Roger Sayle --- Another interesting (simpler) case of -ffast-math pessimization is: void foo(_Complex double *c) { for (int i=0; i<16; i++) c[i] += __builtin_complex(1.0,0.0); } Again without -ffast-math we vectori

[Bug middle-end/111701] [11/12/13/14 Regression] wrong code for __builtin_signbit(x*x)

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111701 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/97756] [11/12/13 Regression] Inefficient handling of 128-bit arguments

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97756 Roger Sayle changed: What|Removed |Added Known to work||14.0 Summary|[11/12/13/14/15 Re

[Bug target/43644] __uint128_t missed optimizations.

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs
||roger at nextmovesoftware dot com Status|NEW |RESOLVED Target Milestone|--- |14.0 --- Comment #6 from Roger Sayle --- This is now fixed on mainline (for GCC 14 and GCC 15).

[Bug tree-optimization/113673] [12/13/14/15 Regression] ICE: verify_flow_info failed: BB 5 cannot throw but has an EH edge with -Os -finstrument-functions -fnon-call-exceptions -ftrapv

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113673 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/113832] [14/15 Regression] 6% exec time regression of 464.h264ref on aarch64

2024-04-30 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113832 --- Comment #5 from Roger Sayle --- I'm trying to confirm that there are actually widening multiplications in 464.h264ref (on aarch64), but if anyone's already done an analysis of what might be causing these performance swings, please do post (a

[Bug middle-end/85559] [meta-bug] Improve conditional move

2024-05-06 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85559 Bug 85559 depends on bug 78947, which changed state. Bug 78947 Summary: sub-optimal code for (bool)(int ? int : int) https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78947 What|Removed |Added ---

[Bug middle-end/78947] sub-optimal code for (bool)(int ? int : int)

2024-05-06 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78947 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/115021] [14/15 regression] unnecessary spill for vpternlog

2024-05-10 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115021 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug rtl-optimization/115021] [14/15 regression] unnecessary spill for vpternlog

2024-05-10 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115021 --- Comment #2 from Roger Sayle --- Here's a reduced test case that should be unaffected by the pending changes to how V8QI shifts are expanded. Note that the final "t -= t4" is required to convince the register allocator to "spill". typedef s

[Bug target/106060] Inefficient constant broadcast on x86_64

2024-05-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106060 Roger Sayle changed: What|Removed |Added Resolution|--- |FIXED Known to work|

[Bug target/115161] [15 Regression] highway-1.0.7 miscompilation of some SSE2 intrinsics

2024-05-20 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115161 Roger Sayle changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug target/115351] [14/15 regression] pointless movs when passing by value on x86-64

2024-06-05 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115351 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/115351] [14 regression] pointless movs when passing by value on x86-64

2024-06-08 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115351 Roger Sayle changed: What|Removed |Added Assignee|roger at nextmovesoftware dot com |unassigned at gcc dot gnu.org

[Bug target/115397] [15 Regression] ICE 'during RTL pass: split1' on numpy-1.26.4 i686-linux '-fPIC -mavx512f' since r15-1100-gec985bc97a0157

2024-06-08 Thread roger at nextmovesoftware dot com via Gcc-bugs
||2024-06-08 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com Status|UNCONFIRMED |ASSIGNED --- Comment #4 from Roger Sayle --- Created attachment 58386 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58386&

[Bug bootstrap/111812] [14 regression] Can't build with gcc 4.8.5

2023-10-16 Thread roger at nextmovesoftware dot com via Gcc-bugs
| CC||roger at nextmovesoftware dot com Build|powerpc64-linux-gnu |

[Bug target/110551] [11/12/13/14 Regression] an extra mov when doing 128bit multiply

2023-10-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110551 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2023-10-20 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #2 from Roger Sayle --- Created attachment 56162 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=56162&action=edit proof-of-concept patch

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2023-10-20 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #3 from Roger Sayle --- This patch addresses the regression, but probably isn't the correct fix. The issue is that the backend now has a way of representing the concatenation of two registers (for example, TI is constructed for two

[Bug target/112103] [14 regression] gcc.target/powerpc/rlwinm-0.c fails after r14-4941-gd1bb9569d70304

2023-10-26 Thread roger at nextmovesoftware dot com via Gcc-bugs
|NEW CC||roger at nextmovesoftware dot com Last reconfirmed||2023-10-26

[Bug target/112298] Poor code for DImode operations on H8 port

2023-10-30 Thread roger at nextmovesoftware dot com via Gcc-bugs
||roger at nextmovesoftware dot com Ever confirmed|0 |1 Last reconfirmed||2023-10-30

[Bug c++/50755] [avr] ICE: tree check: expected class 'constant', have 'unary' (convert_expr)

2023-11-03 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=50755 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/112380] [14 regression] ICE when building Mesa (in combine, internal compiler error: in simplify_subreg) since r14-2526-g8911879415d6c2

2023-11-05 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112380 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug rtl-optimization/91865] Combine misses opportunity to remove (sign_extend (zero_extend)) before searching for insn patterns

2023-11-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91865 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/110551] [11/12/13 Regression] an extra mov when doing 128bit multiply

2023-11-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110551 Roger Sayle changed: What|Removed |Added Target Milestone|11.5|14.0 Resolution|---

[Bug rtl-optimization/112380] [14 regression] ICE when building Mesa (in combine, internal compiler error: in simplify_subreg) since r14-2526-g8911879415d6c2

2023-11-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112380 --- Comment #12 from Roger Sayle --- Patch proposed (actually two alternatives proposed) at https://gcc.gnu.org/pipermail/gcc-patches/2023-November/636203.html

[Bug middle-end/109840] [14 Regression] internal compiler error: in expand_fn_using_insn, at internal-fn.cc:153 when building graphite2

2023-05-26 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109840 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/107812] [11/12/13/14 Regression] RTL SSA forwprop introduced regression since r11-6188

2023-05-29 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107812 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/109973] [13/14 Regression] Wrong code for AVX2 since 13.1 by combining VPAND and VPTEST since r13-2006-ga56c1641e9d25e

2023-05-29 Thread roger at nextmovesoftware dot com via Gcc-bugs
||https://gcc.gnu.org/bugzill ||a/show_bug.cgi?id=80040 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #5 from Roger Sayle --- Many thanks to Benji for reporting this issue. I&#x

[Bug c++/110070] New: Code quality regression with for (int i: {1,2,4,6})

2023-06-01 Thread roger at nextmovesoftware dot com via Gcc-bugs
Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- The fix for PR c++/70167 (in GCC 11.3) inadvertently introduced a code quality regression for simple range-for using initializer lists. The motivating example

[Bug target/110083] [14 Regression] ICEs for testcase on fp-int-convert*timode after r14-1466-g3635e8c67e1

2023-06-02 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110083 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/110083] [14 Regression] ICEs for testcase on fp-int-convert*timode after r14-1466-g3635e8c67e1

2023-06-02 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110083 --- Comment #2 from Roger Sayle --- Created attachment 55241 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=55241&action=edit proposed patch This patch fixes the problem. Bootstrap and regression tests underway.

[Bug target/110104] gcc produces sub-optimal code for _addcarry_u64 chain

2023-06-04 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110104 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug middle-end/31985] Wide operations (i.e. adddi3) are split too late

2023-06-04 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=31985 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/30829] extra register zero extends on x86_64

2023-06-04 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=30829 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/110104] gcc produces sub-optimal code for _addcarry_u64 chain

2023-06-08 Thread roger at nextmovesoftware dot com via Gcc-bugs
nextmovesoftware dot com |unassigned at gcc dot gnu.org --- Comment #1 from Roger Sayle --- I proposed a fix at https://gcc.gnu.org/pipermail/gcc-patches/2023-June/620823.html but this was obsoleted by a much more comprehensive patch (for PR79193) proposed by Jakub just an hour earlier: https

[Bug target/110083] [14 Regression] ICEs for testcase on fp-int-convert*timode after r14-1466-g3635e8c67e1

2023-06-09 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110083 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug middle-end/109766] Passing doubles through the stack generates a stack adjustment per each such argument at -Os/-Oz.

2023-06-16 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109766 --- Comment #3 from Roger Sayle --- For the record a solution was proposed at https://gcc.gnu.org/pipermail/gcc-patches/2023-May/618197.html but this approach failed review at https://gcc.gnu.org/pipermail/gcc-patches/2023-May/618278.html

[Bug middle-end/88873] missing vectorization for decomposed operations on a vector type

2023-06-21 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88873 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/78904] zero-extracts are not effective

2023-06-23 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78904 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug middle-end/31985] Wide operations (i.e. adddi3) are split too late

2023-06-30 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=31985 Roger Sayle changed: What|Removed |Added Target Milestone|--- |14.0 Resolution|---

[Bug middle-end/52070] missing integer comparison optimization

2023-07-01 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=52070 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/109973] [13 Regression] Wrong code for AVX2 since 13.1 by combining VPAND and VPTEST since r13-2006-ga56c1641e9d25e

2023-07-03 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109973 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|NEW Summary|[13/14 Regression]

[Bug target/110533] [x86-64] naked with -O0 and register-passed struct/int128 clobbers parameters/callee-saved regs

2023-07-06 Thread roger at nextmovesoftware dot com via Gcc-bugs
|1 CC||roger at nextmovesoftware dot com Status|UNCONFIRMED |NEW --- Comment #3 from Roger Sayle --- The patch recently proposed at https://gcc.gnu.org/pipermail/gcc-patches/2023-July/623756.html would

[Bug tree-optimization/110362] Range information on lower bytes of __uint128_t

2023-07-06 Thread roger at nextmovesoftware dot com via Gcc-bugs
||roger at nextmovesoftware dot com --- Comment #2 from Roger Sayle --- The good news is that this has been fixed in the RTL optimizers/x86 backend, and GCC-14 currently produces the optimal "mov rax, rdx". However, I agree with Richard Biener that could/shou

[Bug target/91681] Missed optimization for 128 bit arithmetic operations

2023-07-07 Thread roger at nextmovesoftware dot com via Gcc-bugs
gcc dot gnu.org |roger at nextmovesoftware dot com CC||roger at nextmovesoftware dot com --- Comment #4 from Roger Sayle --- Advance warning, the testcase pr91681-1.c will start FAILing (temporarily) due to changes/improvements in __int128

[Bug target/82580] Optimize comparisons for __int128 on x86-64

2023-07-07 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82580 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/110588] btl (on x86_64) not always generated

2023-07-08 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110588 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/110598] [14 Regression] wrong code on llvm-14.0.6 due to memcmp being miscompiled

2023-07-09 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110598 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/110598] [14 Regression] wrong code on llvm-14.0.6 due to memcmp being miscompiled

2023-07-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110598 Roger Sayle changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/110649] [14 Regression] 25% sphinx3 spec2006 regression on Ice Lake and zen between g:acaa441a98bebc52 (2023-07-06 11:36) and g:55900189ab517906 (2023-07-07 00:23)

2023-07-17 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110649 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/110587] [14 regression] 96% pr28071.c compile time regression since r14-2337-g37a231cc7594d1

2023-07-17 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110587 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/110587] [14 regression] 96% pr28071.c compile time regression since r14-2337-g37a231cc7594d1

2023-07-17 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110587 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug c/106264] [10/11/12/13 Regression] spurious -Wunused-value on a folded frexp, modf, and remquo calls with unused result since r9-1295-g781ff3d80e88d7d0

2023-07-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106264 --- Comment #9 from Roger Sayle --- *** Bug 101090 has been marked as a duplicate of this bug. ***

[Bug c/101090] incorrect -Wunused-value warning on remquo with constant values

2023-07-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
||roger at nextmovesoftware dot com Status|NEW |RESOLVED --- Comment #4 from Roger Sayle --- Many thanks to Vincent for spotting/confirming that his bug report is a duplicate of PR 106264, which was fixed in GCC 13. *** This bug has been

[Bug c/89180] [meta-bug] bogus/missing -Wunused warnings

2023-07-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89180 Bug 89180 depends on bug 101090, which changed state. Bug 101090 Summary: incorrect -Wunused-value warning on remquo with constant values https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101090 What|Removed |Added

[Bug rtl-optimization/110701] [14 Regression] Wrong code at -O1/2/3/s on x86_64-linux-gnu

2023-07-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110701 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/110701] [14 Regression] Wrong code at -O1/2/3/s on x86_64-linux-gnu

2023-07-18 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110701 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/112992] New: Inefficient vector initialization using vec_duplicate/broadcast

2023-12-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- The following four functions should in theory all produce the same code: typedef unsigned long long v4di __attribute

[Bug rtl-optimization/112380] [14 regression] ICE when building Mesa (in combine, internal compiler error: in simplify_subreg) since r14-2526-g8911879415d6c2

2023-12-16 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112380 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug rtl-optimization/104914] [MIPS] wrong comparison with scrabbled int value

2023-12-24 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104914 --- Comment #17 from Roger Sayle --- I think this patch might resolve the problem (or move it somewhere else): diff --git a/gcc/expr.cc b/gcc/expr.cc index 9fef2bf6585..218bca905f5 100644 --- a/gcc/expr.cc +++ b/gcc/expr.cc @@ -6274,10 +6274,7

[Bug rtl-optimization/104914] [MIPS] wrong comparison with scrabbled int value

2023-12-24 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104914 --- Comment #18 from Roger Sayle --- Please ignore comment #17, the above patch is completely bogus/broken.

[Bug rtl-optimization/104914] [MIPS] wrong comparison with scrabbled int value

2023-12-24 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104914 --- Comment #19 from Roger Sayle --- Created attachment 56930 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=56930&action=edit proposed patch And now for a patch that does (or should) work. This even contains an optimization, we middle-e

[Bug target/113231] x86_64 uses SSE instructions for `*mem <<= const` at -Os

2024-01-04 Thread roger at nextmovesoftware dot com via Gcc-bugs
at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #4 from Roger Sayle --- I'm testing a patch, for more accurate conversion gains/costs in the scalar-to-vector pass. Adding -mno-stv will work around the problem.

[Bug target/113231] x86_64 uses SSE instructions for `*mem <<= const` at -Os

2024-01-09 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113231 Roger Sayle changed: What|Removed |Added Resolution|--- |FIXED Target Milestone|---

[Bug other/113336] New: libatomic (testsuite) regressions on armv6-linux-gnueabihf

2024-01-11 Thread roger at nextmovesoftware dot com via Gcc-bugs
Priority: P3 Component: other Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- As suggested by Richard Earnshaw, this opens a bugzilla PR for tracking this issue. All the tests in libatomic currently fail on a

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-12 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #6 from Roger Sayle --- Sorry for the delay in replying/answering Jakub's questions/comments. Yes, using a define_insn_and_split in the backend fixes/works around the issue (and I agree your implementation/refinement in comment #5 i

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-13 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #7 from Roger Sayle --- Very many thanks to Jeff Law for pointing me to fwprop. The following simple patch also fixes this regression. diff --git a/gcc/fwprop.cc b/gcc/fwprop.cc index 0c588f8..cbba44e 100644 --- a/gcc/fwprop.cc +++

[Bug target/112992] Inefficient vector initialization using vec_duplicate/broadcast

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112992 Roger Sayle changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/106060] Inefficient constant broadcast on x86_64

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106060 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs
at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #8 from Roger Sayle --- Now we're in stage4, I'll take this. I'm bootstrapping and regression testing a variant of my tweak to try_fwprop_subst_pattern. The change in comment #7 can hang loop inde

[Bug other/113336] libatomic (testsuite) regressions on armv6-linux-gnueabihf

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs
|UNCONFIRMED |ASSIGNED Ever confirmed|0 |1 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #1 from Roger Sayle --- As there's a patch for this regression (awaiting review), I should upgrade the PR stat

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-15 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #10 from Roger Sayle --- A revised and improved patch has been posted for review at https://gcc.gnu.org/pipermail/gcc-patches/2024-January/643062.html

[Bug target/91681] Missed optimization for 128 bit arithmetic operations

2024-01-21 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91681 Roger Sayle changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs
|UNCONFIRMED |NEW CC||roger at nextmovesoftware dot com Ever confirmed|0 |1 --- Comment #6 from Roger Sayle --- To help diagnose the problem, I came up with this simple patch: diff --git a/gcc/fwprop.cc b/gcc

[Bug rtl-optimization/113542] New: gcc.target/arm/bics_3.c regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs
Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- This patch is a placeholder for tracking the reported failures of FAIL: gcc.target/arm/bics_3.c scan-assembler-times

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113533 --- Comment #8 from Roger Sayle --- Created attachment 57190 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=57190&action=edit proposed patch Proposed patch to provide a sane/saner set of rtx_costs for SH. There's plenty more that could b

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113533 --- Comment #10 from Roger Sayle --- Hi Oleg. Great question. The "speed" parameter passed to rtx_costs, and address_cost indicates whether the middle-end is optimizing for peformance, and interested in the nummber of cycles taken by each inst

<    1   2   3   4   5   >