from:"roger at nextmovesoftware dot com"

[Bug target/120601] New: Add support for nvptx's adc, add.cc and friends via uaddc{si,di}5 optabs

2025-06-09 Thread roger at nextmovesoftware dot com via Gcc-bugs

erity: enhancement Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- Target: nvptx-*-* nvptx supports instructions for integer addition and subtraction optio

[Bug tree-optimization/120296] Missed VRP with __builtin_popcount() when bitshift with __builtin_{ctzg, clzg}

2025-05-19 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120296 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/115024] [14/15 regression] 128 bit division performance regression, x86, between gcc-14 and gcc-13 using target clones on skylake platform

2025-03-07 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115024 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug middle-end/118608] [14/15 regression][mips64] Lack of sign extension with -Os/-O1 after r14-6915

2025-02-06 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118608 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/117012] [15 Regression] incorrect RTL simplification around vector AND and shifts

2024-10-08 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117012 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/116576] `INT64_MAX` and `-Oz`

2024-09-09 Thread roger at nextmovesoftware dot com via Gcc-bugs

||roger at nextmovesoftware dot com Ever confirmed|0 |1 Last reconfirmed||2024-09-09 --- Comment #2 from Roger Sayle --- The constant ~0 can be materialized on x86 in only three bytes using either of the sequences "pu

[Bug target/116275] [15 regression] ICE when building llvm-18.1.8 (convert_insn, at config/i386/i386-features.cc:1502)

2024-08-08 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116275 --- Comment #4 from Roger Sayle --- Created attachment 58868 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58868&action=edit proposed patch Here's my proposed fix (the first of two patches) that resolves the ICE with the testcase. The p

[Bug target/116275] [15 regression] ICE when building llvm-18.1.8 (convert_insn, at config/i386/i386-features.cc:1502)

2024-08-07 Thread roger at nextmovesoftware dot com via Gcc-bugs

|1 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com Last reconfirmed||2024-08-07 Target Milestone|--- |15.0 --- Comment #1 from Roger Sayle --- Doh! This is almost certainly caused

[Bug middle-end/114661] Bit operations not optimized to multiplication

2024-07-18 Thread roger at nextmovesoftware dot com via Gcc-bugs

|RESOLVED CC||roger at nextmovesoftware dot com Resolution|--- |FIXED Known to work||15.0 --- Comment #4 from Roger Sayle --- This should now be fixed/implemented on

[Bug target/115751] [15 Regression] ICE building 521.wrf_r

2024-07-17 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115751 Roger Sayle changed: What|Removed |Added Known to work||15.0 Resolution|---

[Bug target/115749] Non optimal assembly for integer modulo by a constant on x86-64 CPUs

2024-07-16 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115749 --- Comment #12 from Roger Sayle --- I owe Kim an apology. It does appear that modern x86_64 processors perform (many) multiplications faster than the latencies given in the Intel/AMD/Agner Fog documentation.

[Bug target/115756] default tuning for x86_64 produces shifts for `*240`

2024-07-02 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115756 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/115751] ICE building 521.wrf_r

2024-07-02 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115751 --- Comment #4 from Roger Sayle --- Created attachment 58567 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58567&action=edit proposed patch Here's my proposed patch.

[Bug target/115751] ICE building 521.wrf_r

2024-07-02 Thread roger at nextmovesoftware dot com via Gcc-bugs

|1 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com CC||roger at nextmovesoftware dot com Status|UNCONFIRMED |ASSIGNED --- Comment #3 from Roger Sayle --- Doh

[Bug tree-optimization/113673] [12/13/14 Regression] ICE: verify_flow_info failed: BB 5 cannot throw but has an EH edge with -Os -finstrument-functions -fnon-call-exceptions -ftrapv

2024-06-26 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113673 Roger Sayle changed: What|Removed |Added Assignee|roger at nextmovesoftware dot com |unassigned at gcc dot gnu.org

[Bug c/109618] [12/13/14/15 Regression] ICE: tree check: expected class ‘type’, have ‘exceptional’ (error_mark) in generic_simplify_CONVERT_EXPR, at generic-match.cc since r12-3278-g823685221de986

2024-06-23 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109618 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/115565] [11/12/13/14/15 Regression] CSE: Comparison incorrectly evaluated as constant causing optimization to produce wrong code

2024-06-20 Thread roger at nextmovesoftware dot com via Gcc-bugs

|UNCONFIRMED |NEW CC||roger at nextmovesoftware dot com Ever confirmed|0 |1 --- Comment #3 from Roger Sayle --- Doh! I hadn't noticed (twenty years ago) that -1 was used to represent an invalid quantity numb

[Bug tree-optimization/115489] [12/13/14/15 regression] ICE: tree check: expected class 'type', have 'exceptional' (error_mark) in create_tmp_from_val, at gimplify.cc:589 since r12-3278-g823685221de98

2024-06-17 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115489 Roger Sayle changed: What|Removed |Added Component|c |tree-optimization --- Comment #3 from Rog

[Bug rtl-optimization/115021] [14 regression] unnecessary spill for vpternlog

2024-06-17 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115021 Roger Sayle changed: What|Removed |Added Summary|[14/15 regression] |[14 regression] unnecessary

[Bug target/115397] [15 Regression] ICE 'during RTL pass: split1' on numpy-1.26.4 i686-linux '-fPIC -mavx512f' since r15-1100-gec985bc97a0157

2024-06-17 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115397 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/115478] [15 Regression] gcc.target/aarch64/bitint-args.c fails

2024-06-13 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115478 --- Comment #3 from Roger Sayle --- Hi Jeff, many thanks for looking into this/assigning the PR to yourself. I'd suggest that the fix is to add a define_code_iterator to aarch64.md called any_or_plus matching the definition in i386.md. (define_c

[Bug target/115397] [15 Regression] ICE 'during RTL pass: split1' on numpy-1.26.4 i686-linux '-fPIC -mavx512f' since r15-1100-gec985bc97a0157

2024-06-08 Thread roger at nextmovesoftware dot com via Gcc-bugs

||2024-06-08 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com Status|UNCONFIRMED |ASSIGNED --- Comment #4 from Roger Sayle --- Created attachment 58386 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=58386&

[Bug target/115351] [14 regression] pointless movs when passing by value on x86-64

2024-06-08 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115351 Roger Sayle changed: What|Removed |Added Assignee|roger at nextmovesoftware dot com |unassigned at gcc dot gnu.org

[Bug target/115351] [14/15 regression] pointless movs when passing by value on x86-64

2024-06-05 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115351 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/115161] [15 Regression] highway-1.0.7 miscompilation of some SSE2 intrinsics

2024-05-20 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115161 Roger Sayle changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug target/106060] Inefficient constant broadcast on x86_64

2024-05-12 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106060 Roger Sayle changed: What|Removed |Added Resolution|--- |FIXED Known to work|

[Bug rtl-optimization/115021] [14/15 regression] unnecessary spill for vpternlog

2024-05-10 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115021 --- Comment #2 from Roger Sayle --- Here's a reduced test case that should be unaffected by the pending changes to how V8QI shifts are expanded. Note that the final "t -= t4" is required to convince the register allocator to "spill". typedef s

[Bug rtl-optimization/115021] [14/15 regression] unnecessary spill for vpternlog

2024-05-10 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115021 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug middle-end/78947] sub-optimal code for (bool)(int ? int : int)

2024-05-06 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78947 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug middle-end/85559] [meta-bug] Improve conditional move

2024-05-06 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85559 Bug 85559 depends on bug 78947, which changed state. Bug 78947 Summary: sub-optimal code for (bool)(int ? int : int) https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78947 What|Removed |Added ---

[Bug target/113832] [14/15 Regression] 6% exec time regression of 464.h264ref on aarch64

2024-04-30 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113832 --- Comment #5 from Roger Sayle --- I'm trying to confirm that there are actually widening multiplications in 464.h264ref (on aarch64), but if anyone's already done an analysis of what might be causing these performance swings, please do post (a

[Bug tree-optimization/113673] [12/13/14/15 Regression] ICE: verify_flow_info failed: BB 5 cannot throw but has an EH edge with -Os -finstrument-functions -fnon-call-exceptions -ftrapv

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113673 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/43644] __uint128_t missed optimizations.

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs

||roger at nextmovesoftware dot com Status|NEW |RESOLVED Target Milestone|--- |14.0 --- Comment #6 from Roger Sayle --- This is now fixed on mainline (for GCC 14 and GCC 15).

[Bug rtl-optimization/97756] [11/12/13 Regression] Inefficient handling of 128-bit arguments

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97756 Roger Sayle changed: What|Removed |Added Known to work||14.0 Summary|[11/12/13/14/15 Re

[Bug middle-end/111701] [11/12/13/14 Regression] wrong code for __builtin_signbit(x*x)

2024-04-26 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111701 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug tree-optimization/114767] gfortran AVX2 complex multiplication by (0d0,1d0) suboptimal

2024-04-18 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114767 --- Comment #5 from Roger Sayle --- Another interesting (simpler) case of -ffast-math pessimization is: void foo(_Complex double *c) { for (int i=0; i<16; i++) c[i] += __builtin_complex(1.0,0.0); } Again without -ffast-math we vectori

[Bug tree-optimization/114767] gfortran AVX2 complex multiplication by (0d0,1d0) suboptimal

2024-04-18 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114767 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/114544] [x86] stv should transform (subreg DI (V1TI) 8) as (vec_select:DI (V2DI) (const_int 1))

2024-04-07 Thread roger at nextmovesoftware dot com via Gcc-bugs

|UNCONFIRMED |NEW Ever confirmed|0 |1 CC||roger at nextmovesoftware dot com

[Bug middle-end/114552] [13/14 Regression] wrong code at -O1 and above on x86_64-linux-gnu since r13-990

2024-04-02 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114552 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/114284] [14 Regression] arm: Load of volatile short gets miscompiled (loaded twice) since r14-8319

2024-03-09 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114284 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/114187] [14 regression] bizarre register dance on x86_64 for pass-by-value struct since r14-2526

2024-03-01 Thread roger at nextmovesoftware dot com via Gcc-bugs

at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #4 from Roger Sayle --- Created attachment 57587 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=57587&action=edit proposed patch Proposed fix attached. Currently bootstrapping and regression testing. The prob

[Bug target/114187] [14 regression] bizarre register dance on x86_64 for pass-by-value struct since r14-2526

2024-03-01 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114187 Roger Sayle changed: What|Removed |Added Last reconfirmed||2024-03-01 Status|UNCONFIRMED

[Bug other/113336] [14 Regression] libatomic (testsuite) regressions on arm

2024-02-17 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113336 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/106060] Inefficient constant broadcast on x86_64

2024-02-16 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106060 Roger Sayle changed: What|Removed |Added Target Milestone|--- |15.0 --- Comment #5 from Roger Sayle ---

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-02-16 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/113690] [13 Regression] ICE: in as_a, at machmode.h:381 with -O2 -fno-dce -fno-forward-propagate -fno-split-wide-types -funroll-loops

2024-02-16 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113690 Roger Sayle changed: What|Removed |Added Summary|[13/14 Regression] ICE: in |[13 Regression] ICE: in

[Bug tree-optimization/112508] [14 Regression] Size regression when using -Os starting with r14-4089-gd45ddc2c04e

2024-02-15 Thread roger at nextmovesoftware dot com via Gcc-bugs

|1 CC||roger at nextmovesoftware dot com Last reconfirmed||2024-02-15 --- Comment #2 from Roger Sayle --- The issue appears to be with (poor costing in) loop invariant store motion. Adding the command line

[Bug target/113764] [X86] __builtin_clz generates lzcnt when bsr is sufficient

2024-02-11 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113764 Roger Sayle changed: What|Removed |Added Summary|[X86] Generates lzcnt when |[X86] __builtin_clz |bs

[Bug target/113764] [X86] Generates lzcnt when bsr is sufficient

2024-02-09 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113764 --- Comment #2 from Roger Sayle --- Investigating further, the thinking behind GCC's current behaviour can be found in Agner Fog's instruction tables; on many architectures BSR is much slower than LZCNT. Legacy AMD: BSR=4 cycles, LZCNT=2

[Bug tree-optimization/113673] [12/13/14 Regression] ICE: verify_flow_info failed: BB 5 cannot throw but has an EH edge with -Os -finstrument-functions -fnon-call-exceptions -ftrapv

2024-02-08 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113673 --- Comment #4 from Roger Sayle --- The identified patch implements += the same way as |=. Presumably a version of the test case replacing "m += *data++;" with "m |= *data++;" would be more useful at identifying a patch that actually changed EH

[Bug target/113832] [14 Regression] 6% exec time regression of 464.h264ref on aarch64

2024-02-08 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113832 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/113764] [X86] Generates lzcnt when bsr is sufficient

2024-02-07 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113764 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug tree-optimization/113759] [14 regression] ICE when building fdk-aac-2.0.3 since r14-8680

2024-02-06 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113759 --- Comment #9 from Roger Sayle --- Many thanks Jakub. Sorry again for the inconvenience.

[Bug target/113720] [14 Regression] internal compiler error: in extract_insn, at recog.cc:2812 targeting alpha-linux-gnu

2024-02-02 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113720 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/113690] [13/14 Regression] ICE: in as_a, at machmode.h:381 with -O2 -fno-dce -fno-forward-propagate -fno-split-wide-types -funroll-loops

2024-02-01 Thread roger at nextmovesoftware dot com via Gcc-bugs

at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #4 from Roger Sayle --- I'm bootstrapping and regression testing a fix.

[Bug target/113701] Issues with __int128 argument passing

2024-02-01 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113701 Roger Sayle changed: What|Removed |Added See Also||https://gcc.gnu.org/bugzill

[Bug other/113336] [14 Regression] libatomic (testsuite) regressions on arm

2024-01-28 Thread roger at nextmovesoftware dot com via Gcc-bugs

at gcc dot gnu.org |roger at nextmovesoftware dot com Target Milestone|--- |14.0 --- Comment #7 from Roger Sayle --- A revised patch has been posted for review/approval to gcc-patches: https://gcc.gnu.org/pipermail/gcc-patches/2024-January/644147.html

[Bug target/113560] Strange code generated when optimizing a multiplication on x86_64

2024-01-28 Thread roger at nextmovesoftware dot com via Gcc-bugs

at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #7 from Roger Sayle --- I'm bootstrapping and regression testing a patch.

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-27 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113533 --- Comment #14 from Roger Sayle --- My apologies for not keeping folks updated on my thinking. Following Oleg's feedback, I've decided to slim down my proposed fix to the bare minimum, and postpone the other rtx_costs improvements until GCC 15

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-26 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113533 Roger Sayle changed: What|Removed |Added See Also||https://gcc.gnu.org/bugzill

[Bug other/113336] libatomic (testsuite) regressions on arm

2024-01-25 Thread roger at nextmovesoftware dot com via Gcc-bugs

nextmovesoftware dot com |unassigned at gcc dot gnu.org Summary|libatomic (testsuite) |libatomic (testsuite) |regressions on |regressions on arm |armv6-linux-gnueabihf | --- Comment #4 from Roger Sayle --- Hi Victor, Yes, I agree

[Bug target/113560] Strange code generated when optimizing a multiplication on x86_64

2024-01-24 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113560 --- Comment #6 from Roger Sayle --- In the .optimized dump, we have: __int128 unsigned __res; __int128 unsigned _12; ... __res_11 = in_2(D) w* 184467440738; _12 = __res_11 & 18446744073709551615; __res_7 = _12 * 100; So the first mu

[Bug target/113560] Strange code generated when optimizing a multiplication on x86_64

2024-01-24 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113560 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113533 --- Comment #10 from Roger Sayle --- Hi Oleg. Great question. The "speed" parameter passed to rtx_costs, and address_cost indicates whether the middle-end is optimizing for peformance, and interested in the nummber of cycles taken by each inst

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113533 --- Comment #8 from Roger Sayle --- Created attachment 57190 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=57190&action=edit proposed patch Proposed patch to provide a sane/saner set of rtx_costs for SH. There's plenty more that could b

[Bug rtl-optimization/113542] New: gcc.target/arm/bics_3.c regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs

Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- This patch is a placeholder for tracking the reported failures of FAIL: gcc.target/arm/bics_3.c scan-assembler-times

[Bug rtl-optimization/113533] [14 Regression] Code generation regression after change for pr111267

2024-01-22 Thread roger at nextmovesoftware dot com via Gcc-bugs

|UNCONFIRMED |NEW CC||roger at nextmovesoftware dot com Ever confirmed|0 |1 --- Comment #6 from Roger Sayle --- To help diagnose the problem, I came up with this simple patch: diff --git a/gcc/fwprop.cc b/gcc

[Bug target/91681] Missed optimization for 128 bit arithmetic operations

2024-01-21 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91681 Roger Sayle changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-15 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #10 from Roger Sayle --- A revised and improved patch has been posted for review at https://gcc.gnu.org/pipermail/gcc-patches/2024-January/643062.html

[Bug other/113336] libatomic (testsuite) regressions on armv6-linux-gnueabihf

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs

|UNCONFIRMED |ASSIGNED Ever confirmed|0 |1 Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #1 from Roger Sayle --- As there's a patch for this regression (awaiting review), I should upgrade the PR stat

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs

at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #8 from Roger Sayle --- Now we're in stage4, I'll take this. I'm bootstrapping and regression testing a variant of my tweak to try_fwprop_subst_pattern. The change in comment #7 can hang loop inde

[Bug target/106060] Inefficient constant broadcast on x86_64

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106060 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug target/112992] Inefficient vector initialization using vec_duplicate/broadcast

2024-01-14 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112992 Roger Sayle changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-13 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #7 from Roger Sayle --- Very many thanks to Jeff Law for pointing me to fwprop. The following simple patch also fixes this regression. diff --git a/gcc/fwprop.cc b/gcc/fwprop.cc index 0c588f8..cbba44e 100644 --- a/gcc/fwprop.cc +++

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2024-01-12 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #6 from Roger Sayle --- Sorry for the delay in replying/answering Jakub's questions/comments. Yes, using a define_insn_and_split in the backend fixes/works around the issue (and I agree your implementation/refinement in comment #5 i

[Bug other/113336] New: libatomic (testsuite) regressions on armv6-linux-gnueabihf

2024-01-11 Thread roger at nextmovesoftware dot com via Gcc-bugs

Priority: P3 Component: other Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- As suggested by Richard Earnshaw, this opens a bugzilla PR for tracking this issue. All the tests in libatomic currently fail on a

[Bug target/113231] x86_64 uses SSE instructions for `*mem <<= const` at -Os

2024-01-09 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113231 Roger Sayle changed: What|Removed |Added Resolution|--- |FIXED Target Milestone|---

[Bug target/113231] x86_64 uses SSE instructions for `*mem <<= const` at -Os

2024-01-04 Thread roger at nextmovesoftware dot com via Gcc-bugs

at gcc dot gnu.org |roger at nextmovesoftware dot com --- Comment #4 from Roger Sayle --- I'm testing a patch, for more accurate conversion gains/costs in the scalar-to-vector pass. Adding -mno-stv will work around the problem.

[Bug rtl-optimization/104914] [MIPS] wrong comparison with scrabbled int value

2023-12-24 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104914 --- Comment #19 from Roger Sayle --- Created attachment 56930 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=56930&action=edit proposed patch And now for a patch that does (or should) work. This even contains an optimization, we middle-e

[Bug rtl-optimization/104914] [MIPS] wrong comparison with scrabbled int value

2023-12-24 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104914 --- Comment #18 from Roger Sayle --- Please ignore comment #17, the above patch is completely bogus/broken.

[Bug rtl-optimization/104914] [MIPS] wrong comparison with scrabbled int value

2023-12-24 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104914 --- Comment #17 from Roger Sayle --- I think this patch might resolve the problem (or move it somewhere else): diff --git a/gcc/expr.cc b/gcc/expr.cc index 9fef2bf6585..218bca905f5 100644 --- a/gcc/expr.cc +++ b/gcc/expr.cc @@ -6274,10 +6274,7

[Bug rtl-optimization/112380] [14 regression] ICE when building Mesa (in combine, internal compiler error: in simplify_subreg) since r14-2526-g8911879415d6c2

2023-12-16 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112380 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/112992] New: Inefficient vector initialization using vec_duplicate/broadcast

2023-12-12 Thread roger at nextmovesoftware dot com via Gcc-bugs

Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: roger at nextmovesoftware dot com Target Milestone: --- The following four functions should in theory all produce the same code: typedef unsigned long long v4di __attribute

[Bug rtl-optimization/112380] [14 regression] ICE when building Mesa (in combine, internal compiler error: in simplify_subreg) since r14-2526-g8911879415d6c2

2023-11-12 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112380 --- Comment #12 from Roger Sayle --- Patch proposed (actually two alternatives proposed) at https://gcc.gnu.org/pipermail/gcc-patches/2023-November/636203.html

[Bug target/110551] [11/12/13 Regression] an extra mov when doing 128bit multiply

2023-11-12 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110551 Roger Sayle changed: What|Removed |Added Target Milestone|11.5|14.0 Resolution|---

[Bug rtl-optimization/91865] Combine misses opportunity to remove (sign_extend (zero_extend)) before searching for insn patterns

2023-11-12 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91865 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug rtl-optimization/112380] [14 regression] ICE when building Mesa (in combine, internal compiler error: in simplify_subreg) since r14-2526-g8911879415d6c2

2023-11-05 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112380 Roger Sayle changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |roger at nextmovesoftware dot com

[Bug c++/50755] [avr] ICE: tree check: expected class 'constant', have 'unary' (convert_expr)

2023-11-03 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=50755 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/112298] Poor code for DImode operations on H8 port

2023-10-30 Thread roger at nextmovesoftware dot com via Gcc-bugs

||roger at nextmovesoftware dot com Ever confirmed|0 |1 Last reconfirmed||2023-10-30

[Bug target/112103] [14 regression] gcc.target/powerpc/rlwinm-0.c fails after r14-4941-gd1bb9569d70304

2023-10-26 Thread roger at nextmovesoftware dot com via Gcc-bugs

|NEW CC||roger at nextmovesoftware dot com Last reconfirmed||2023-10-26

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2023-10-20 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #3 from Roger Sayle --- This patch addresses the regression, but probably isn't the correct fix. The issue is that the backend now has a way of representing the concatenation of two registers (for example, TI is constructed for two

[Bug rtl-optimization/111267] [14 Regression] Codegen regression from i386 argument passing changes

2023-10-20 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111267 --- Comment #2 from Roger Sayle --- Created attachment 56162 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=56162&action=edit proof-of-concept patch

[Bug target/110551] [11/12/13/14 Regression] an extra mov when doing 128bit multiply

2023-10-18 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110551 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug bootstrap/111812] [14 regression] Can't build with gcc 4.8.5

2023-10-16 Thread roger at nextmovesoftware dot com via Gcc-bugs

| CC||roger at nextmovesoftware dot com Build|powerpc64-linux-gnu |

[Bug rtl-optimization/110701] [14 Regression] Wrong code at -O1/2/3/s on x86_64-linux-gnu

2023-10-11 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110701 Roger Sayle changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug middle-end/17886] variable rotate and unsigned long long rotate should be better optimized

2023-10-10 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=17886 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug tree-optimization/111519] [13/14 Regression] Wrong code at -O3 on x86_64-linux-gnu since r13-455-g1fe04c497d

2023-10-09 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111519 --- Comment #2 from Roger Sayle --- Complicated. Things have gone wrong before the strlen pass which is given: _73 = e; _72 = *_73; ... *_73 = prephitmp_23; d = _72; Here the assignment to *_73 overwrites the value of f (at *e) which

[Bug target/71749] Define _REENTRANT on ARC when -pthread is passed

2023-09-28 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71749 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/91251] Revision 272645 on top of 9.1.0 caused ICE: in extract_insn, at recog.c:2310

2023-09-23 Thread roger at nextmovesoftware dot com via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91251 Roger Sayle changed: What|Removed |Added CC||roger at nextmovesoftware dot com

[Bug target/91591] Arc: ICE in trunc_int_for_mode, at explow.c:60

2023-09-23 Thread roger at nextmovesoftware dot com via Gcc-bugs

|--- |8.4 CC||roger at nextmovesoftware dot com Status|UNCONFIRMED |RESOLVED --- Comment #6 from Roger Sayle --- As reported by Giulio, this bug has now been fixed.

1 2 3 4 5 >

1 - 100 of 429 matches

Mail list logo