[Bug tree-optimization/116265] New: Missing optimization: Vectorization of modulo operator

2024-08-06 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116265 Bug ID: 116265 Summary: Missing optimization: Vectorization of modulo operator Product: gcc Version: 15.0 Status: UNCONFIRMED Keywords: missed-optimization Severity: enh

[Bug tree-optimization/116265] Missing optimization: Vectorization of modulo operator

2024-08-06 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116265 Jennifer Schmitz changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Ever confirmed|0

[Bug tree-optimization/101390] Expand vector mod as vector div + multiply-subtract

2024-08-07 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101390 Jennifer Schmitz changed: What|Removed |Added Status|NEW |ASSIGNED Assignee|unassign

[Bug tree-optimization/101390] Expand vector mod as vector div + multiply-subtract

2024-08-07 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101390 --- Comment #7 from Jennifer Schmitz --- Thank you for the reply. Seems like I have been looking in the right places. I'm a new member of the GCC community, so I'm still getting familiar with many parts of the code base. I have been trying to fi

[Bug target/116365] Add user-friendly arguments to --param aarch64-autovec-preference=N

2024-08-22 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116365 Jennifer Schmitz changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug tree-optimization/101390] Expand vector mod as vector div + multiply-subtract

2024-08-22 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101390 Jennifer Schmitz changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/53947] [meta-bug] vectorizer missed-optimizations

2024-08-22 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53947 Bug 53947 depends on bug 101390, which changed state. Bug 101390 Summary: Expand vector mod as vector div + multiply-subtract https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101390 What|Removed |Added -

[Bug tree-optimization/116569] [15 Regression] ICE in to_constant, at poly-int.h:592

2024-09-06 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116569 --- Comment #5 from Jennifer Schmitz --- I looked into the issue and summarize below what I found: My current fix that checks for the support of the mod optab for vectors looks like this: @@ -894,7 +894,9 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT

[Bug tree-optimization/116569] [15 Regression] ICE in to_constant, at poly-int.h:592

2024-09-06 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116569 --- Comment #7 from Jennifer Schmitz --- Thanks for the quick reply. I tried (simplify (minus (convert1? @0) (convert2? (mult:c (trunc_div @@0 @@1) @1))) (if (INTEGRAL_TYPE_P (type) || (VECTOR_INTEGER_TYPE_P (type) && optimize

[Bug tree-optimization/116831] [15 Regression] ICE with trunc mod vectorising for SVE

2024-10-10 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116831 Jennifer Schmitz changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/86710] 3 missing logarithm optimizations

2024-10-11 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86710 Jennifer Schmitz changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/116826] Optimise log (1.0 / x) into -log (x)

2024-10-11 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116826 Jennifer Schmitz changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug tree-optimization/117093] Missing detection of REV64 vector permute

2024-10-31 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117093 Jennifer Schmitz changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |jschmitz at gcc dot gnu.org

[Bug tree-optimization/116826] Optimise log (1.0 / x) into -log (x)

2024-09-24 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116826 Jennifer Schmitz changed: What|Removed |Added Last reconfirmed||2024-09-24 Status|UNCONF

[Bug tree-optimization/116569] [15 Regression] ICE in to_constant, at poly-int.h:592

2024-09-18 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116569 Jennifer Schmitz changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug tree-optimization/86710] 3 missing logarithm optimizations

2024-09-25 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86710 Jennifer Schmitz changed: What|Removed |Added CC||jschmitz at gcc dot gnu.org

[Bug tree-optimization/116831] [15 Regression] ICE with trunc mod vectorising for SVE

2024-10-02 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116831 Jennifer Schmitz changed: What|Removed |Added Status|NEW |ASSIGNED Assignee|unassign

[Bug target/106329] No optimization for SVE pfalse predicate

2024-10-24 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106329 Jennifer Schmitz changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |jschmitz at gcc dot gnu.org

[Bug testsuite/117704] gcc.dg/tree-ssa/pow_fold_1.c FAILs on 32-bit x86

2024-11-28 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117704 Jennifer Schmitz changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug testsuite/117704] gcc.dg/tree-ssa/pow_fold_1.c FAILs on 32-bit x86

2024-11-20 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117704 Jennifer Schmitz changed: What|Removed |Added Last reconfirmed||2024-11-20 Ever confirmed|0

[Bug tree-optimization/117093] Missing detection of REV64 vector permute

2024-11-15 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117093 Jennifer Schmitz changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/117093] Missing detection of REV64 vector permute

2024-11-16 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117093 --- Comment #9 from Jennifer Schmitz --- Thanks for reporting it, I'll look into it on Monday.

[Bug target/106329] No optimization for SVE pfalse predicate

2024-12-05 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106329 Jennifer Schmitz changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/114999] A few missing optimizations due to `a - b` and `b - a` not being detected as negatives of each other

2024-12-11 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114999 Jennifer Schmitz changed: What|Removed |Added CC||jschmitz at gcc dot gnu.org --- Comm

[Bug target/117978] Optimise 128-bit-predicated SVE loads to Advanced SIMD LDRs

2024-12-12 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117978 Jennifer Schmitz changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |jschmitz at gcc dot gnu.org

[Bug tree-optimization/114999] A few missing optimizations due to `a - b` and `b - a` not being detected as negatives of each other

2025-01-14 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114999 --- Comment #12 from Jennifer Schmitz --- Created attachment 60149 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=60149&action=edit Proposed patch for detecting abs diff for signed integers

[Bug target/119009] New: AArch64: Commit 'Node clones share order' causes regression in Snappy workload for -mcpu=neoverse-v2 with LTO

2025-02-25 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119009 Bug ID: 119009 Summary: AArch64: Commit 'Node clones share order' causes regression in Snappy workload for -mcpu=neoverse-v2 with LTO Product: gcc Version: 15.0

[Bug target/118999] New: AArch64: Switching off early scheduling causes regressions in Snappy workload for -mcpu=neoverse-v2

2025-02-24 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118999 Bug ID: 118999 Summary: AArch64: Switching off early scheduling causes regressions in Snappy workload for -mcpu=neoverse-v2 Product: gcc Version: 15.0 Status: UNCONFIRME

[Bug tree-optimization/114999] A few missing optimizations due to `a - b` and `b - a` not being detected as negatives of each other

2025-02-20 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114999 --- Comment #13 from Jennifer Schmitz --- Created attachment 60540 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=60540&action=edit Patch for improving codegen of absolute differences of unsigned integers in aarch64 This patch builds on t

[Bug target/118999] [15 regression] AArch64: Switching off early scheduling (r15-6661-gc5db3f50bdf34e) causes regressions in Snappy workload for -mcpu=neoverse-v2

2025-03-10 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118999 --- Comment #2 from Jennifer Schmitz --- Thanks for looking into this. The regression looks to have been resolved by: AArch64: Enable early scheduling for -O3 and higher (PR118351) On our machines, the runtimes are back to normal. Do you still s

[Bug ipa/119009] [15 regression] AArch64: Commit 'Node clones share order' (r15-6345-g0895aef01c64c3) causes regression in Snappy workload for -mcpu=neoverse-v2 with LTO

2025-03-05 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119009 --- Comment #4 from Jennifer Schmitz --- Thanks for looking into this. Indeed, the runtime has recovered in the meantime. From our side, we can close the PR.

[Bug target/117978] Optimise 128-bit-predicated SVE loads to Advanced SIMD LDRs

2025-03-17 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117978 --- Comment #6 from Jennifer Schmitz --- Created attachment 60790 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=60790&action=edit Proposed patch for folding SVE load/store with certain ptrue patterns to LDR/STR

[Bug tree-optimization/119706] [12/13/14 regression] ICE in gimple pass 'dom' for -O3 -mcpu=grace --param=aarch64-autovec-preference=sve-only

2025-04-10 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119706 --- Comment #7 from Jennifer Schmitz --- Great, thanks a lot for the quick fix!

[Bug tree-optimization/119706] New: [15 regression] ICE in gimple pass 'dom' for -O3 -mcpu=grace --param=aarch64-autovec-preference=sve-only

2025-04-10 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119706 Bug ID: 119706 Summary: [15 regression] ICE in gimple pass 'dom' for -O3 -mcpu=grace --param=aarch64-autovec-preference=sve-only Product: gcc Version: 15.0

[Bug tree-optimization/119606] New: [15 regression] Commit 'Optimize string constructor' causes regression in Snappy workload for -mcpu=neoverse-v2 with LTO

2025-04-03 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119606 Bug ID: 119606 Summary: [15 regression] Commit 'Optimize string constructor' causes regression in Snappy workload for -mcpu=neoverse-v2 with LTO Product: gcc Ver

[Bug libstdc++/119606] [15 regression] Commit 'Optimize string constructor' causes regression in Snappy workload for -mcpu=neoverse-v2 with LTO

2025-04-03 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119606 --- Comment #6 from Jennifer Schmitz --- (In reply to Jan Hubicka from comment #5) > the patch to string constructor should be kind of orthogonal to PR86590. > I downloaded snappy and perfed it on znver3 machine and while I see there > are some

[Bug libstdc++/119606] [15/16 regression] Commit 'Optimize string constructor' causes regression in Snappy workload for -mcpu=neoverse-v2 with LTO

2025-04-24 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119606 --- Comment #7 from Jennifer Schmitz --- For another regression in the Snappy workload (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119910), we found that it was caused by an alignment issue. I added -falign-functions=32 -falign-loops=32 -falig

[Bug target/119910] [15 regression] Commit 'combine: Allow 2->2 combinations...' causes regression in Snappy workload for -mcpu=neoverse-v2

2025-04-24 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119910 --- Comment #3 from Jennifer Schmitz --- Yes, it seems to be an alignment problem: I took a look with perf at the hot sections and the assembly sequence is the same. But objdump of the benchmark executable showed that the number of nops differs

[Bug rtl-optimization/119910] New: [15 regression] Commit 'combine: Allow 2->2 combinations...' causes regression in Snappy workload for -mcpu=neoverse-v2

2025-04-23 Thread jschmitz at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119910 Bug ID: 119910 Summary: [15 regression] Commit 'combine: Allow 2->2 combinations...' causes regression in Snappy workload for -mcpu=neoverse-v2 Product: gcc Vers