[Bug tree-optimization/119706] [12/13 regression] ICE in gimple pass 'dom' for -O3 -mcpu=grace --param=aarch64-autovec-preference=sve-only

2025-04-22 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119706 --- Comment #11 from ktkachov at gcc dot gnu.org --- (In reply to GCC Commits from comment #10) > The releases/gcc-14 branch has been updated by Richard Biener > : > > https://gcc.gnu.org/g:2bb4a431eace7e77562e686ecc9c9504045da003

[Bug tree-optimization/119706] [15 regression] ICE in gimple pass 'dom' for -O3 -mcpu=grace --param=aarch64-autovec-preference=sve-only

2025-04-10 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119706 ktkachov at gcc dot gnu.org changed: What|Removed |Added Known to fail||15.0 Target Milestone

[Bug tree-optimization/119706] [15 regression] ICE in gimple pass 'dom' for -O3 -mcpu=grace --param=aarch64-autovec-preference=sve-only

2025-04-10 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119706 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |NEW Ever confirmed|0

[Bug tree-optimization/119187] vectorizer should be able to SLP already vectorized code

2025-04-07 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119187 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug tree-optimization/119351] [15 Regression] Wrong code in GROMACS for AArch64 generic SVE VLS target

2025-04-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119351 --- Comment #11 from ktkachov at gcc dot gnu.org --- (In reply to Jakub Jelinek from comment #10) > Has this worked in GCC 14? If so, has it been bisected what commit caused > this (or made a bug no longer latent)? Yes,

[Bug target/119572] [15 Regression] Recent change triggers regression on RISC-V vector test

2025-04-01 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119572 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug rtl-optimization/97286] simplified subreg used outside of the loop can cause conflict and cause an extra move inside the loop

2025-04-01 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97286 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot gnu.org

[Bug middle-end/119442] [14 Regression] Regression in creating SVE predicate

2025-03-31 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119442 ktkachov at gcc dot gnu.org changed: What|Removed |Added Summary|[14/15 Regression] |[14 Regression] Regression

[Bug middle-end/119442] [14/15 Regression] Regression in creating SVE predicate

2025-03-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119442 --- Comment #2 from ktkachov at gcc dot gnu.org --- Patch at https://gcc.gnu.org/pipermail/gcc-patches/2025-March/679115.html

[Bug middle-end/119442] [14/15 Regression] Regression in creating SVE predicate

2025-03-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119442 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED

[Bug middle-end/119442] New: [14/15 Regression] Regression in creating SVE predicate

2025-03-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: middle-end Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 The testcase is nonsense in itself but is heavily reduced from a real

[Bug tree-optimization/119351] [15 Regression] Wrong code in GROMACS for AArch64 generic SVE VLS target

2025-03-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119351 --- Comment #5 from ktkachov at gcc dot gnu.org --- (In reply to Tamar Christina from comment #4) > While looking at the codegen it looks like GROMACS has a lot of loops that > get vectorized now and it's showing some inefficien

[Bug middle-end/119384] Extra move in tight loop with SIMD and subregs

2025-03-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119384 --- Comment #1 from ktkachov at gcc dot gnu.org --- > We have a workload for aarch64 using the SIMDe translation error Oops, this should say "SIMDe translation layer"

[Bug middle-end/119384] New: Extra move in tight loop with SIMD and subregs

2025-03-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: middle-end Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 We have a workload for aarch64 using the SIMDe translation error that results in slower code than

[Bug tree-optimization/119351] [15 Regression] Wrong code in GROMACS for AArch64 generic SVE VLS target

2025-03-18 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119351 --- Comment #1 from ktkachov at gcc dot gnu.org --- > -DCMAKE_C_COMPILER=$COMPILERBIN -DCMAKE_CXX_COMPILER=$COMPILERXXBIN $COMPILERBIN and $COMPILERXXBIN should point to the gcc and g++ executables

[Bug tree-optimization/119351] New: [15 Regression] Wrong code in GROMACS for AArch64 generic SVE VLS target

2025-03-18 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
: wrong-code Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org CC: acoplan at gcc dot gnu.org, tnfchris at gcc dot gnu.org Target Milestone

[Bug tree-optimization/119193] New: Suboptimal packing codegen

2025-03-10 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 Example source: #include #define PACK_8_TO_64( a, b, c, d, e, f, g, h )\ (((uint64_t)a&

[Bug rtl-optimization/119046] [15 Regression] Performance drop from not forming lane-wise FMLAs with Eigen library

2025-03-05 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119046 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|ASSIGNED|RESOLVED

[Bug rtl-optimization/119046] [15 Regression] Performance drop from not forming lane-wise FMLAs with Eigen library

2025-02-27 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119046 ktkachov at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |ktkachov at gcc dot

[Bug rtl-optimization/119046] [15 Regression] Performance drop from not forming lane-wise FMLAs with Eigen library

2025-02-27 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119046 --- Comment #2 from ktkachov at gcc dot gnu.org --- (In reply to Tamar Christina from comment #1) > The late-combine pass was supposed to handle these. probably worth a look > into why it's not folding them in. Yeah you're rig

[Bug rtl-optimization/119046] New: [15 Regression] Performance drop from not forming lane-wise FMLAs with Eigen library

2025-02-27 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Keywords: missed-optimization Severity: normal Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Created attachment 60603 --> https://gcc.gnu.org/bugzi

[Bug tree-optimization/119042] New: Optimize more !struct.x && !struct.y codegen cases

2025-02-27 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 Taken from the LLVM report: https://github.com/llvm/llvm-project/issues/12

[Bug target/118976] [12/13/14/15 regression] Correctness Issue: SVE vectorization results in data corruption when cpu has 128bit vectors but compiled with -mcpu=neoverse-v1 (which is only for 256bit v

2025-02-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118976 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/118151] Relax the SVE PTEST matching conditions for any/none (ne/eq)

2025-02-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118151 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/118974] New: Use SVE cbranch sequence for Neon modes when TARGET_SVE

2025-02-21 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org CC: tnfchris at gcc dot gnu.org Target Milestone: --- Target: aarch64 For example, the testcase

[Bug target/118952] AArch64 get_fpcr and set_fpcr builtins don't block reordering of operations past them

2025-02-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118952 --- Comment #2 from ktkachov at gcc dot gnu.org --- (In reply to Richard Sandiford from comment #1) > I think this is essentially the same problem as PR34678. Thanks, yeah I don't see PR34678 getting generally resolved any time soon.

[Bug target/118952] New: AArch64 get_fpcr and set_fpcr builtins don't block reordering of operations past them

2025-02-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
IRMED Keywords: wrong-code Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 The __builtin_aarch64_set_fpc

[Bug target/117978] Optimise 128-bit-predicated SVE loads to Advanced SIMD LDRs

2025-02-17 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117978 --- Comment #4 from ktkachov at gcc dot gnu.org --- (In reply to Richard Sandiford from comment #3) > I think this would be better done in expand rather than gimple. The gimple > representation would be a vector load in a 128-bit type, fo

[Bug tree-optimization/118852] [15 regression] Train run of 502.gcc_r compiled with -Ofast -fprofile-generate -march=x86_64-v3 fails

2025-02-13 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118852 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug tree-optimization/118852] [15 regression] Train run of 502.gcc_r compiled with -Ofast -fprofile-generate -march=x86_64-v3 fails

2025-02-13 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118852 --- Comment #5 from ktkachov at gcc dot gnu.org --- (In reply to Tamar Christina from comment #4) > (In reply to ktkachov from comment #3) > > FWIW I see this also on aarch64 > > I filed the AArch64 bug weeks ago > https://gc

[Bug middle-end/118490] [15 Regression] ICE Indefinite recursion transforming exp-log with -frounding-math since r15-5116-ge232dc3bb5c3e8

2025-01-27 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118490 ktkachov at gcc dot gnu.org changed: What|Removed |Added Resolution|--- |FIXED Status

[Bug target/116445] gcc.target/arm/unsigned-extend-2.c on Cortex-M55 and misses possible Cortex-M optimization

2025-01-21 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116445 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug middle-end/118490] [15 Regression] ICE Indefinite recursion transforming exp-log with -frounding-math since r15-5116-ge232dc3bb5c3e8

2025-01-16 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118490 ktkachov at gcc dot gnu.org changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |soumyaa at gcc dot

[Bug middle-end/118490] [15 Regression] ICE Indefinite recursion transforming exp-log with -frounding-math since r15-5116-ge232dc3bb5c3e8

2025-01-15 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118490 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/118377] ICE when using C division operator with svint8 and svint16

2025-01-15 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118377 --- Comment #8 from ktkachov at gcc dot gnu.org --- (In reply to Tejas Belagod from comment #7) > Sorry for the delay in replying. Though variable-length(VLA) SVE vector > types behave as GNU vectors for C/C++ operator semantics, th

[Bug target/118133] Consider alternative ways of writing aarch64-simd-pragma-builtins.def

2024-12-19 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118133 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/117978] Optimise 128-bit-predicated SVE loads to Advanced SIMD LDRs

2024-12-10 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117978 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug target/117978] New: Optimise 128-bit-predicated SVE loads to Advanced SIMD LDRs

2024-12-09 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
, missed-optimization Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 When it is known that the predicate on a zero-predicated

[Bug testsuite/117704] gcc.dg/tree-ssa/pow_fold_1.c FAILs on 32-bit x86

2024-11-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117704 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug tree-optimization/117557] [15 Regression] gcc_r miscompiles in SPECCPU 2017 with SVE2 since g:3d498cfe022f6e035ff24e0d78ff744da83ebf42

2024-11-13 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117557 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed

[Bug tree-optimization/117554] [15 Regression] ICE building 538.imagick_r with SVE -msve-vector-bits=128

2024-11-13 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117554 ktkachov at gcc dot gnu.org changed: What|Removed |Added Target Milestone|--- |15.0

[Bug tree-optimization/117554] New: [15 Regression] ICE building 538.imagick_r with SVE -msve-vector-bits=128

2024-11-13 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
: aarch64-sve, ice-on-valid-code Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 Building imagick ICEs on aarch64

[Bug tree-optimization/117499] New: [15 Regression] Segfault ICE building 511.povray_r

2024-11-08 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 Building the benchmark from SPEC2017 runs into a segfault ICE on aarch64. Reduced

[Bug target/117449] [15 Regression] ICE in gen_reg_rtx on aarch64 via aarch64_emit_opt_vec_rotate

2024-11-05 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117449 ktkachov at gcc dot gnu.org changed: What|Removed |Added Resolution|--- |FIXED Status

[Bug target/117449] [15 Regression] ICE in gen_reg_rtx on aarch64 via aarch64_emit_opt_vec_rotate

2024-11-05 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117449 ktkachov at gcc dot gnu.org changed: What|Removed |Added Last reconfirmed||2024-11-05 Target

[Bug target/117048] Failure to combine into XAR instruction

2024-11-04 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117048 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|ASSIGNED|RESOLVED

[Bug target/117344] New: Suboptimal use of movprfx in SVE intrinsics code

2024-10-29 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 I'm not sure how bad this is in real code but spotted the testcase: #in

[Bug target/106329] No optimization for SVE pfalse predicate

2024-10-22 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106329 ktkachov at gcc dot gnu.org changed: What|Removed |Added Assignee|prathamesh3492 at gcc dot gnu.org |unassigned at gcc

[Bug tree-optimization/117093] Missing detection of REV64 vector permute

2024-10-16 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117093 --- Comment #4 from ktkachov at gcc dot gnu.org --- (In reply to ktkachov from comment #3) > If we remove the casts: > uint32x4_t ror32_neon_tgt_gcc_bad(uint32x4_t r) { > uint32x4_t a = r; > uint32_t t; > t = a[0]; a

[Bug tree-optimization/117093] Missing detection of REV64 vector permute

2024-10-16 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117093 --- Comment #3 from ktkachov at gcc dot gnu.org --- I think it's the VIEW_CONVERT_EXPR that are hurting us (more complete dump before expand): _1 = VIEW_CONVERT_EXPR(r_3(D)); t_4 = BIT_FIELD_REF ; a_5 = VEC_PERM_EXPR <_1, _1, { 1,

[Bug target/117048] Failure to combine into XAR instruction

2024-10-16 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117048 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|RESOLVED|ASSIGNED

[Bug tree-optimization/117093] New: Missing detection of REV64 vector permute

2024-10-11 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org CC: tnfchris at gcc dot gnu.org Target Milestone: --- Target: aarch64 This testcase is reduced from a hashing code

[Bug target/117048] Failure to combine into XAR instruction

2024-10-11 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117048 ktkachov at gcc dot gnu.org changed: What|Removed |Added Known to work||15.0 Resolution

[Bug tree-optimization/117050] [15 Regression] ice in vect_build_slp_tree_2

2024-10-10 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117050 ktkachov at gcc dot gnu.org changed: What|Removed |Added Last reconfirmed||2024-10-10 Ever

[Bug target/117048] Failure to combine into XAR instruction

2024-10-09 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117048 ktkachov at gcc dot gnu.org changed: What|Removed |Added Last reconfirmed||2024-10-09

[Bug target/117048] Failure to combine into XAR instruction

2024-10-09 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117048 --- Comment #1 from ktkachov at gcc dot gnu.org --- Yeah, there is code in simplify-rtx.cc:3467 to simplify this pattern to a rotate but it doesn't handle vector operands

[Bug target/117048] New: Failure to combine into XAR instruction

2024-10-09 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 A testcase derived from a hashing algorithm: #include #include #include static inline uint64x2_t

[Bug target/117045] Incorrect fold of SVE's svwhilele

2024-10-09 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117045 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/117013] aarch64 should define spaceship4 optab

2024-10-08 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117013 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/116999] Fold SVE whilelt/le comparisons with max int value to ptrue

2024-10-07 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116999 --- Comment #1 from ktkachov at gcc dot gnu.org --- This is inspired by the LLVM PR https://github.com/llvm/llvm-project/pull/83

[Bug target/116999] New: Fold SVE whilelt/le comparisons with max int value to ptrue

2024-10-07 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
-optimization Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 Example testcase: #include #include svbool_t foo_s32_le (int32_t

[Bug target/116934] [15 Regression] ICE building 526.blender_r

2024-10-07 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116934 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED

[Bug tree-optimization/116956] [15 Regression] ICE when building PALM with gfortran: in vect_analyze_loop_1, at tree-vect-loop.cc:3510

2024-10-03 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116956 ktkachov at gcc dot gnu.org changed: What|Removed |Added Target Milestone|--- |15.0 Summary

[Bug target/116934] [15 Regression] ICE building 526.blender_r

2024-10-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116934 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||saurabh.jha at arm dot com

[Bug target/116934] [15 Regression] ICE building 526.blender_r

2024-10-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116934 ktkachov at gcc dot gnu.org changed: What|Removed |Added Target Milestone|--- |15.0

[Bug target/116934] New: [15 Regression] ICE building 526.blender_r

2024-10-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 blender from SPEC2017 ICEs with current trunk. The reduced testcase is: int a; float *b; void c() { for

[Bug target/111733] Emit inline SVE FSCALE instruction for ldexp

2024-09-30 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111733 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|NEW |ASSIGNED --- Comment #3

[Bug tree-optimization/116902] [15 Regression] ICE Another definition in block 43 follows the use

2024-09-30 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116902 ktkachov at gcc dot gnu.org changed: What|Removed |Added Target Milestone|--- |15.0 Summary

[Bug tree-optimization/116902] New: ICE Another definition in block 43 follows the use

2024-09-30 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- The following C++ testcase ICEs on aarch64 with -O3: unsigned a; #include void i(long b, char c[][4], long d[][4

[Bug tree-optimization/116842] [15 Regression] ICE definition in block follows the use

2024-09-25 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116842 --- Comment #1 from ktkachov at gcc dot gnu.org --- (In reply to ktkachov from comment #0) > short a, b, c; > unsigned d(unsigned, int e) { return e; } > void f(bool g, short e[][3][3][3][3], unsigned h[][3][3], char i[][8], >

[Bug tree-optimization/116842] [15 Regression] ICE definition in block follows the use

2024-09-25 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116842 ktkachov at gcc dot gnu.org changed: What|Removed |Added Target Milestone|--- |15.0

[Bug tree-optimization/116842] New: [15 Regression] ICE definition in block follows the use

2024-09-25 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- short a, b, c; unsigned d(unsigned, int e) { return e; } void f(bool g, short e[][3][3][3][3

[Bug target/116840] New: Optimise __builtin_parity for aarch64

2024-09-25 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 int foob (unsigned char x) { return __builtin_parityg (x); } int fooh (unsigned short x) { return

[Bug tree-optimization/116826] Optimise log (1.0 / x) into -log (x)

2024-09-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116826 --- Comment #3 from ktkachov at gcc dot gnu.org --- (In reply to Andrew Pinski from comment #2) > Note PR 86710 lists the opposite (except without being a CST for the > division). > > Just like PR 86710, this applies for log, log10

[Bug tree-optimization/116831] [15 Regression] ICE with trunk mod vectorising for SVE

2024-09-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116831 ktkachov at gcc dot gnu.org changed: What|Removed |Added Target Milestone|--- |15.0 CC

[Bug tree-optimization/116831] New: [15 Regression] ICE with trunk mod vectorising for SVE

2024-09-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 long a; int b, c; void d(int e[][5], short f[][5][5][5]) { for (short g; g; g += 4

[Bug middle-end/116815] Make better use of overflow flags in codegen of min/max(a, add/sub(a, b))

2024-09-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116815 --- Comment #2 from ktkachov at gcc dot gnu.org --- (In reply to Andrew Pinski from comment #1) > The easiest way to fix this is transform (late in gimple): > _1 = a_2(D) + b_3(D); > _5 = MAX_EXPR <_1, a_2(D)>; >

[Bug tree-optimization/116826] New: Optimise log (1.0 / x) into -log (x)

2024-09-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- float foo (float a) { return __builtin_logf (1.0f / a); } We can avoid the division by folding it into -__builtin_logf (a

[Bug tree-optimization/116819] [15 Regression] ICE in vect_transform_stmt

2024-09-24 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116819 --- Comment #4 from ktkachov at gcc dot gnu.org --- (In reply to Andrew Pinski from comment #1) > Note this code is undefined: > prephitmp_52 = _44 ? _17 : _16(D); > > This is due to lifetime of the temp being bound for the ca

[Bug tree-optimization/116819] New: [15 Regression] ICE in vect_transform_stmt

2024-09-23 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 char a, b; const unsigned long long &c(const unsigned long lon

[Bug tree-optimization/116812] [15 Regression] ICE on valid code at -O2 with "-fno-tree-dce -fno-tree-dse" on x86_64-linux-gnu: verify_flow_info failed

2024-09-23 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116812 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/116815] New: Make better use of overflow flags in codegen of min/max(a, add/sub(a, b))

2024-09-23 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
: missed-optimization Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 The LLVM report https://github.com/llvm/llvm-project

[Bug tree-optimization/116684] [vectorization][x86-64] dot_16x1x16_uint8_int8_int32 could be better optimized

2024-09-11 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116684 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |NEW Ever confirmed|0

[Bug tree-optimization/57492] Optimize 2.0**i to ldexp(1.0,i)

2024-09-11 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=57492 ktkachov at gcc dot gnu.org changed: What|Removed |Added Last reconfirmed|2013-05-31 00:00:00 |2024-9-11

[Bug target/116630] New: Implement spaceshipm3 optab for aarch64

2024-09-06 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 For the testcase: #include auto cmp4way(double a, double b) { return a <=> b; } auto cmp4wayf

[Bug tree-optimization/116569] [15 Regression] ICE in to_constant, at poly-int.h:592

2024-09-06 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116569 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||rguenth at gcc dot gnu.org

[Bug tree-optimization/116628] [15 Regression] ICE in vect_analyze_loop_1 on aarch64 with -Ofast in TSVC

2024-09-06 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116628 --- Comment #2 from ktkachov at gcc dot gnu.org --- (In reply to Andrew Pinski from comment #1) > What revision/commit id are you using? Is it before or after r15-3411 ? It's a fairly fresh g:6a1a856ba78589f7f5285b00ecd40ba2bbeef8b0

[Bug tree-optimization/116628] [15 Regression] ICE in vect_analyze_loop_1 on aarch64 with -Ofast in TSVC

2024-09-06 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116628 ktkachov at gcc dot gnu.org changed: What|Removed |Added Known to work||14.2.0 Known to fail

[Bug tree-optimization/116628] New: [15 Regression] ICE in vect_analyze_loop_1 on aarch64 with -Ofast in TSVC

2024-09-06 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
-on-valid-code Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 The testcase is reduced from TSVC: typedef

[Bug tree-optimization/116569] [15 Regression] ICE in to_constant, at poly-int.h:592

2024-09-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116569 ktkachov at gcc dot gnu.org changed: What|Removed |Added Target Milestone|--- |15.0

[Bug tree-optimization/116569] New: [15 Regression] ICE in to_constant, at poly-int.h:592

2024-09-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 int a; short b, c, e; long d, f; long g(long h) { if (h) return h

[Bug target/63521] The AArch64 backend doesn't define REG_ALLOC_ORDER.

2024-09-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63521 ktkachov at gcc dot gnu.org changed: What|Removed |Added Resolution|--- |FIXED Status

[Bug target/116564] [12/13/14/15 Regression] aarch64: gcc can't finish when compiling vst2_f64 instrinsic with opt level >= O1

2024-09-02 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116564 ktkachov at gcc dot gnu.org changed: What|Removed |Added Ever confirmed|0 |1 Target

[Bug rtl-optimization/116541] New: [14/15 Regression] Inefficient missing use of reg+reg addressing modes

2024-08-30 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
-optimization Severity: normal Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org CC: tnfchris at gcc dot gnu.org Target Milestone: --- Target: aarch64

[Bug target/114063] Use IFN_CHECK_RAW_PTRS/IFN_CHECK_WAR_PTRS for Advanced. SIMD

2024-08-28 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114063 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||ktkachov at gcc dot

[Bug target/116509] 128bit integer compares can be improved

2024-08-28 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116509 ktkachov at gcc dot gnu.org changed: What|Removed |Added Status|UNCONFIRMED |NEW Ever confirmed|0

[Bug rtl-optimization/116238] [12/13/14/15 Regression] ICE building 526.blender_r on aarch64 SVE after r15-1619-g3b9b8d6cfdf593

2024-08-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116238 ktkachov at gcc dot gnu.org changed: What|Removed |Added Summary|[15 Regression] ICE |[12/13/14/15 Regression

[Bug rtl-optimization/116238] [15 Regression] ICE building 526.blender_r on aarch64 SVE after r15-1619-g3b9b8d6cfdf593

2024-08-20 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116238 ktkachov at gcc dot gnu.org changed: What|Removed |Added CC||rsandifo at gcc dot

[Bug target/116391] New: Emit spellcheck suggestions for __arm_rsr and __arm_wsr intrinsics

2024-08-16 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
Severity: enhancement Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Target: aarch64 GCC 14 added intrinsics to read and write to sysregs, allowing the

[Bug tree-optimization/116367] New: Handle vector shuffles better

2024-08-13 Thread ktkachov at gcc dot gnu.org via Gcc-bugs
: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: ktkachov at gcc dot gnu.org Target Milestone: --- Apologies for the broad summary, feel free to make it more targeted. Testcase: void test(short * restrict x, short * restrict y

  1   2   3   4   5   6   7   8   9   10   >