[Bug ipa/102720] [12 regression] gcc.dg/tree-ssa/ldist-strlen-1.c and ldist-strlen-2.c fail after r12-4324

2021-10-15 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102720 --- Comment #7 from Jan Hubicka --- simplified testcase is: typedef unsigned char uint8_t; typedef __SIZE_TYPE__ size_t; extern void* malloc (size_t); extern void* memset (void*, int, size_t); #define test(T, U)\ __attribute__((noinline

[Bug ipa/102557] [12 Regression] ICE: Segmentation fault signal terminated program cc1plus (indefinite recursion in modref_ref_node::insert_access) since r12-3202-gf5ff3a8ed4ca9173

2021-10-15 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102557 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug ipa/102720] [12 regression] gcc.dg/tree-ssa/ldist-strlen-1.c and ldist-strlen-2.c fail after r12-4324

2021-10-15 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102720 --- Comment #8 from Jan Hubicka --- so it is really pt_solutions_intersect in ref_maybe_used_by_call returning false. We get: (gdb) p *pt1 $6 = {anything = 0, nonlocal = 1, escaped = 1, ipa_escaped = 0, null = 0, vars_contains_nonlocal = 0, vars

[Bug ipa/102720] [12 regression] gcc.dg/tree-ssa/ldist-strlen-1.c and ldist-strlen-2.c fail after r12-4324

2021-10-15 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102720 --- Comment #9 from Jan Hubicka --- OK, with -alias dump we have: int main () { uint8_t * q; void * p; long unsigned int _2; : # PT = null { D.2008 } # ALIGN = 8, MISALIGN = 0 # USE = anything # CLB = anything p_5 = malloc

[Bug ipa/102720] [12 regression] gcc.dg/tree-ssa/ldist-strlen-1.c and ldist-strlen-2.c fail after r12-4324

2021-10-15 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102720 --- Comment #10 from Jan Hubicka --- copied ealias dump rather than alis dump in previous comment. alias dump is int main () { void * p; long unsigned int _1; [local count: 1073741824]: # PT = null { D.2014 } # ALIGN = 8, MISALIGN =

[Bug ipa/102720] [12 regression] gcc.dg/tree-ssa/ldist-strlen-1.c and ldist-strlen-2.c fail after r12-4324

2021-10-15 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102720 --- Comment #11 from Jan Hubicka --- Aha, the problem is in the way I updated computing use/clobber sets. I accidentally disabled code that copies the solution from solver local representation into the final form. As a result we failed to updat

[Bug middle-end/102947] New: SPEC2006 compiler time regression (-Ofast -march=native -flto) between 1932e1169a236849 and 9cfb95f9b92326e8

2021-10-26 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102947 Bug ID: 102947 Summary: SPEC2006 compiler time regression (-Ofast -march=native -flto) between 1932e1169a236849 and 9cfb95f9b92326e8 Product: gcc Version: 12.0

[Bug middle-end/102947] SPEC2006 compiler time regression (-Ofast -march=native -flto) between 1932e1169a236849 and 9cfb95f9b92326e8

2021-10-26 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102947 --- Comment #1 from Jan Hubicka --- It seems enough to lookat the WRP benchmark build time. https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=322.270.8&plot.1=307.270.8&plot.2=343.270.8&plot.3=266.270.8&plot.4=395.270.8&plot.5=412.270.8&p

[Bug middle-end/102948] New: 60% build time regression on gamess in range 2fc2e3917f9c8fd94f5d101477971d16c483ef88...c16f21c7cf97ce48967e42d3b5d22ea169a9c2c8

2021-10-26 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102948 Bug ID: 102948 Summary: 60% build time regression on gamess in range 2fc2e3917f9c8fd94f5d101477971d16c483ef88...c16f21c7cf9 7ce48967e42d3b5d22ea169a9c2c8 Product: gcc

[Bug tree-optimization/101908] [12 regression] cray regression with -O2 -ftree-slp-vectorize compared to -O2

2021-10-28 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101908 Jan Hubicka changed: What|Removed |Added Summary|cray regression with -O2|[12 regression] cray |-

[Bug tree-optimization/101908] [12 regression] cray regression with -O2 -ftree-slp-vectorize compared to -O2

2021-10-28 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101908 --- Comment #6 from Jan Hubicka --- zen https://lnt.opensuse.org/db_default/v4/CPP/graph?plot.0=198.639.0&plot.1=180.639.0&plot.2=201.639.0&plot.3=150.639.0&plot.4=246.639.0&plot.5=256.639.0&plot.6=176.639.0&; kabylake https://lnt.opensuse.org/d

[Bug middle-end/102997] New: 45% calculix regression with LTO+PGO -march=native -Ofast between ce4d1f632ff3f680550d3b186b60176022f41190 and 6fca1761a16c68740f875fc487b98b6bde8e9be7 on Zen

2021-10-29 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102997 Bug ID: 102997 Summary: 45% calculix regression with LTO+PGO -march=native -Ofast between ce4d1f632ff3f680550d3b186b60176022f41190 and 6fca1761a16c68740f875fc487b9

[Bug tree-optimization/102058] [12 regression] 450.soplex regressed on x86_64 with -Ofast -march=generic (by 8-15%)

2021-10-29 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102058 Jan Hubicka changed: What|Removed |Added CC||hubicka at gcc dot gnu.org Sum

[Bug tree-optimization/102943] [12 Regression] VRP threader compile-time hog with 521.wrf_r

2021-10-31 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102943 --- Comment #7 from Jan Hubicka --- this is compile time plot https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=227.270.8 https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=289.270.8 (-O2 and -Ofast with lto) Things has improved but

[Bug tree-optimization/107715] New: TSVC s161 for double runs at zen4 30 times slower when vectorization is enabled

2022-11-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107715 Bug ID: 107715 Summary: TSVC s161 for double runs at zen4 30 times slower when vectorization is enabled Product: gcc Version: 13.0 Status: UNCONFIRMED Severity

[Bug tree-optimization/99408] s3251 benchmark of TSVC vectorized by clang runs about 7 times faster compared to gcc

2022-11-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99408 --- Comment #2 from Jan Hubicka --- This also reproduces with zen4 and double. jh@alberti:~/tsvc/bin> cat tt.c typedef double real_t; #define iterations 10 #define LEN_1D 32000 #define LEN_2D 256 real_t a[LEN_1D],b[LEN_1D],c[LEN_1D],d[LEN_1D

[Bug middle-end/107718] New: clang optimizes TSVC s317 a lot better

2022-11-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107718 Bug ID: 107718 Summary: clang optimizes TSVC s317 a lot better Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: middle-en

[Bug tree-optimization/99411] s311, s312, s31111, s31111, s3110, vsumr benchmark of TSVC is vectorized by clang better than by gcc

2022-11-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99411 --- Comment #7 from Jan Hubicka --- With znver4 current trunk and clang15 I still see this problem (clang code is about 60% faster) for s311, s312 and s3111. Curious s3 and s3110 no longer shows a regression.

[Bug tree-optimization/107715] TSVC s161 and s277 for double runs at zen4 30 times slower when vectorization is enabled

2022-11-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107715 Jan Hubicka changed: What|Removed |Added Summary|TSVC s161 for double runs |TSVC s161 and s277 for

[Bug middle-end/107719] New: 14% regression on TSVC s3113 on znve4 compared to GCC 7.5

2022-11-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107719 Bug ID: 107719 Summary: 14% regression on TSVC s3113 on znve4 compared to GCC 7.5 Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Pri

[Bug ipa/107769] [12/13 Regression] -flto with -Os/-O2/-O3 emitted code with gcc 12.x segfaults via mutated global in .rodata since r12-2887-ga6da2cddcf0e959d

2022-11-21 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107769 Jan Hubicka changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |hubicka at gcc dot gnu.org

[Bug c++/107597] LTO causes static inline variables to get a non-uniqued global symbol

2023-01-04 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107597 --- Comment #7 from Jan Hubicka --- So I guess it is asan being confused by our optimization. We intentionaly duplicate the symbol in order to reduce cost of dynamic linking in situations where we know it does not change semantics, but asan loo

[Bug tree-optimization/107467] [12/13 Regression] Miscompilation involing -Os , -flto and -fno-strict-aliasing since r12-656-ga564da506f52be66

2023-01-04 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107467 Jan Hubicka changed: What|Removed |Added Status|NEW |ASSIGNED Assignee|unassigned at

[Bug lto/105727] __builtin_constant_p expansion in LTO

2022-05-25 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105727 --- Comment #6 from Jan Hubicka --- I don't know what clang does, but GCC keeps builtin_constant_p till late optimization and resolves it then. So here we cross module inline (or constant propagate) and then it becomes constant. Outcome of __bu

[Bug tree-optimization/105739] [9/10 Regression] Miscompilation of Linux kernel update.c

2022-05-26 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105739 Jan Hubicka changed: What|Removed |Added Ever confirmed|0 |1 Last reconfirmed|

[Bug tree-optimization/105739] [10 Regression] Miscompilation of Linux kernel update.c

2022-06-09 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105739 --- Comment #8 from Jan Hubicka --- After inlning I see: IPA function summary for rcu_tasks_trace_pertask/5350 inlinable global time: 13.535950 self size: 11

[Bug tree-optimization/105739] [10 Regression] Miscompilation of Linux kernel update.c

2022-06-10 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105739 --- Comment #9 from Jan Hubicka --- Indeed volatile checks seems to be missing across ipa-prop code. Here is smaller testcase: __attribute__((noinline)) static int test2(int a) { if (__builtin_constant_p (a)) __builtin_a

[Bug tree-optimization/105739] [10 Regression] Miscompilation of Linux kernel update.c

2022-06-10 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105739 --- Comment #10 from Jan Hubicka --- I am testing diff --git a/gcc/ipa-prop.cc b/gcc/ipa-prop.cc index afd9222b5a2..c037668e7d8 100644 --- a/gcc/ipa-prop.cc +++ b/gcc/ipa-prop.cc @@ -1112,6 +1112,10 @@ ipa_load_from_parm_agg (struct ipa_func_bod

[Bug ipa/105917] New: Missed passthru jump function

2022-06-10 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105917 Bug ID: 105917 Summary: Missed passthru jump function Product: gcc Version: unknown Status: UNCONFIRMED Severity: normal Priority: P3 Component: ipa A

[Bug ipa/105917] [10/11/12/13 regression] Missed passthru jump function

2022-06-10 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105917 Jan Hubicka changed: What|Removed |Added Summary|Missed passthru jump|[10/11/12/13 regression]

[Bug c++/106057] New: Missed stmt_can_throw_external check in stmt_kills_ref_p

2022-06-22 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106057 Bug ID: 106057 Summary: Missed stmt_can_throw_external check in stmt_kills_ref_p Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Prio

[Bug c++/106057] Missed stmt_can_throw_external check in stmt_kills_ref_p

2022-06-22 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106057 --- Comment #1 from Jan Hubicka --- C only testcase (also misoptimized in clang) #include int b; jmp_buf buf; __attribute__((noinline)) int maybethrow() { if (!b) longjmp (buf,1); return 2; } void test(int *a)

[Bug c++/106057] Missed stmt_can_throw_external check in stmt_kills_ref_p

2022-06-23 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106057 --- Comment #2 from Jan Hubicka --- The second testcase (with longjmp) invalid since longjmp can clobber automatic variable and making the variable static breaks the testcase since we believe htat longjmp reads global memory state (it doesn't).

[Bug middle-end/106075] New: Wrong DSE with -fnon-call-exceptions

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106075 Bug ID: 106075 Summary: Wrong DSE with -fnon-call-exceptions Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: middle-end

[Bug ipa/106077] New: Invalid IPA-SRA with non-call exceptions

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106077 Bug ID: 106077 Summary: Invalid IPA-SRA with non-call exceptions Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: ipa

[Bug middle-end/106075] Wrong DSE with -fnon-call-exceptions

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106075 --- Comment #4 from Jan Hubicka --- PR106077 demonstrates related problem where ipa-sra concludes it is safe to move dereference earlier in the code. It uses dominator test for that.

[Bug middle-end/106075] Wrong DSE with -fnon-call-exceptions

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106075 --- Comment #5 from Jan Hubicka --- Also note that the longjmp testcase will not get misoptimized since we consider longjmp as using all global memory.

[Bug middle-end/106078] New: Invalid loop invariant motion with non-call-exceptions

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106078 Bug ID: 106078 Summary: Invalid loop invariant motion with non-call-exceptions Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Comp

[Bug middle-end/106078] Invalid loop invariant motion with non-call-exceptions

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106078 --- Comment #1 from Jan Hubicka --- This is version that does not need -fnon-call-exceptions If called test (NULL, 0) it should be indefinitely increasing val rather then segfaulting. Seems clang gets this one right. int array[1]; volatile

[Bug ipa/106077] Invalid IPA-SRA with non-call exceptions

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106077 --- Comment #1 from Jan Hubicka --- Also note that the dominance check is written the wrong way, so it only passes for first BB in the function diff --git a/gcc/ipa-sra.cc b/gcc/ipa-sra.cc index 96b020fb2dd..6b2df2f3ff0 100644 --- a/gcc/ipa-sra

[Bug middle-end/106081] New: missed vectorization

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106081 Bug ID: 106081 Summary: missed vectorization Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: middle-end Assign

[Bug middle-end/106081] missed vectorization

2022-06-24 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106081 --- Comment #1 from Jan Hubicka --- This is an attempt to vectorize by hand, but it seems we do not generate vpmovsxwd for the vector short->double conversion struct pixels { short a __attribute__ ((vector_size(4*2))); } *pixels; struct

[Bug middle-end/106081] missed vectorization

2022-06-27 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106081 --- Comment #4 from Jan Hubicka --- Thanks! It seems that imagemagick has quite few loops that inovlve consuming shorts and producing doubles. Also it would be nice to do something about __builtin_convertvector so it does not produce stupid code

[Bug ipa/105682] [12/13 Regression] Both `-Wsuggest-attribute=pure` and `-Wsuggest-attribute=const` on same function since r12-5177-g494bdadf28d0fb35

2022-07-18 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105682 --- Comment #6 from Jan Hubicka --- gcc-12.1.0 (bogus warning: `caller()` has no right to be const; it calls a pure function, and that function even contains inline assembly): I think the conlcusion here is correct. callee has pure attribute a

[Bug ipa/105676] [12/13 Regression] Bogus `-Wsuggest-attribute=pure` on function marked `__attribute__((const))` since r12-5437-g09a4ffb72aa2f513

2022-07-18 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105676 --- Comment #3 from Jan Hubicka --- Such code is not that obviously safe. It is possible that getval will get inlined to some calls and not other within single function. In that case the calling function will read and modify cache variable and

[Bug ipa/101257] [11/12 Regression] Maybe wrong code since IPA mod ref was introduced

2021-06-30 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101257 Jan Hubicka changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |hubicka at gcc dot gnu.org

[Bug ipa/101270] error: inlining failed in call to ‘always_inline’ ‘open.localalias’: function not inlinable with -fPIC -fno-semantic-interposition

2021-06-30 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101270 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Assignee|unassigned at

[Bug tree-optimization/92740] induct2 (from polyhedron) regresses 267% with -O2 -ftree-vectorize -ftree-slp-vectorize -fvect-cost-modes=dynamic or cheap compared to -O2

2021-08-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92740 Jan Hubicka changed: What|Removed |Added Summary|induct2 (from polyhedron) |induct2 (from polyhedron)

[Bug middle-end/101908] New: cray regression with -O2 -ftree-slp-vectorize compared to -O2

2021-08-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101908 Bug ID: 101908 Summary: cray regression with -O2 -ftree-slp-vectorize compared to -O2 Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal

[Bug middle-end/101909] New: 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware

2021-08-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101909 Bug ID: 101909 Summary: 73% regression on tfft benchmark for -O2 -ftree-loop-vectorize compared to -O2 on zen hardware Product: gcc Version: 12.0 Status: UNCONFIRMED

[Bug middle-end/101910] New: tsvc regressions for -O2 -ftree-loop-vectorize at zen hardware

2021-08-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101910 Bug ID: 101910 Summary: tsvc regressions for -O2 -ftree-loop-vectorize at zen hardware Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal

[Bug target/101296] Addition of x86 addsub SLP patterned slowed down 433.milc by 12% on znver2 with -Ofast -flto

2021-08-22 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101296 --- Comment #7 from Jan Hubicka --- "every access" means that we no longer track individual bases+offsets+sizes and everything matching the base/ref alias set will be considered conflicting. I planned to implement smarter merging of accesses so

[Bug ipa/101257] [11/12 Regression] Maybe wrong code since IPA mod ref was introduced

2021-08-22 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101257 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug rtl-optimization/97836] wrong code at -O1 on x86_64-pc-linux-gnu by r11-5029

2021-08-22 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97836 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug ipa/97565] -flto -ipa-pta ICE: at cgraph_node::get_untransformed_body()

2021-08-23 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97565 --- Comment #6 from Jan Hubicka --- has_gimple_body_p really cares about the WPA unit (we should probably note that in the comment). Here you seem to have function that is in the WPA translation unit but lands in different partition and in that

[Bug tree-optimization/113787] [12/13/14 Regression] Wrong code at -O with ipa-modref on aarch64

2024-02-13 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113787 --- Comment #13 from Jan Hubicka --- So my understanding is that ivopts does something like offset = &base2 - &base1 and then translate val = base2[i] to val = *((base1+i)+offset) Where (base1+i) is then an iv variable. I wonder if we con

[Bug middle-end/113907] [14 regression] ICU miscompiled since on x86 since r14-5109-ga291237b628f41

2024-02-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113907 Jan Hubicka changed: What|Removed |Added CC||hubicka at gcc dot gnu.org --- Comment #2

[Bug ipa/113291] [14 Regression] compilation never (?) finishes with recursive always_inline functions at -O and above since r14-2172

2024-02-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113291 --- Comment #4 from Jan Hubicka --- There is a cap in want_inline_self_recursive_call_p which gives up on inlining after reaching max recursive inlining depth of 8. Problem is that the tree here is too wide. After early inlining f0 contains 4 ca

[Bug ipa/113291] [14 Regression] compilation never (?) finishes with recursive always_inline functions at -O and above since r14-2172

2024-02-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113291 --- Comment #5 from Jan Hubicka --- There is a cap in want_inline_self_recursive_call_p which gives up on inlining after reaching max recursive inlining depth of 8. Problem is that the tree here is too wide. After early inlining f0 contains 4 ca

[Bug middle-end/111054] [14 Regression] ICE: in to_sreal, at profile-count.cc:472 with -O3 -fno-guess-branch-probability since r14-2967

2024-02-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111054 Jan Hubicka changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug ipa/113291] [14 Regression] compilation never (?) finishes with recursive always_inline functions at -O and above since r14-2172

2024-02-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113291 --- Comment #6 from Jan Hubicka --- Created attachment 57427 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=57427&action=edit patch The patch makes compilation to finish in reasonable time. I ended up in need to dropping DISREGARD_INLINE_

[Bug middle-end/113907] [14 regression] ICU miscompiled since on x86 since r14-5109-ga291237b628f41

2024-02-15 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113907 --- Comment #31 from Jan Hubicka --- Having a testcase is great. I was just playing with crafting one. I am still concerned about value ranges in ipa-prop's jump functions. Let me see if I can modify the testcase to also trigger problem with val

[Bug middle-end/113907] [14 regression] ICU miscompiled since on x86 since r14-5109-ga291237b628f41

2024-02-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113907 --- Comment #39 from Jan Hubicka --- This testcase #include int data[100]; __attribute__((noinline)) int bar (int d, unsigned int d2) { if (d2 > 10) printf ("Bingo\n"); return d + d2; } int test2 (unsigned int i) { if (i > 10) _

[Bug middle-end/113907] [12/13/14 regression] ICU miscompiled since on x86 since r14-5109-ga291237b628f41

2024-02-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113907 Jan Hubicka changed: What|Removed |Added Summary|[14 regression] ICU |[12/13/14 regression] ICU

[Bug ipa/111960] [14 Regression] ICE: during GIMPLE pass: rebuild_frequencies: SIGSEGV (Invalid read of size 4) with -fdump-tree-rebuild_frequencies-all

2024-02-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=111960 --- Comment #5 from Jan Hubicka --- hmm. cfg.cc:815 for me is: fputs (", maybe hot", outf); which seems quite safe. The problem does not seem to reproduce for me: jh@ryzen3:~/gcc/build/gcc> ./xgcc -B ./ tt.c -O --param=max-inline-r

[Bug ipa/108802] [11/12/13/14 Regression] missed inlining of call via pointer to member function

2024-02-16 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108802 --- Comment #5 from Jan Hubicka --- I don't think we can reasonably expect every caller of lambda function to be early inlined, so we need to extend ipa-prop to understand the obfuscated code. I disucussed that with Martin some time ago - I thi

[Bug tree-optimization/114052] [11/12/13/14 Regression] Wrong code at -O2 for well-defined infinite loop

2024-02-22 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114052 --- Comment #5 from Jan Hubicka --- So if I understand it right, you want to determine the property that if the loop header is executed then BB containing undefined behavior at that iteration will be executed, too. modref tracks if function wil

[Bug lto/85432] Wodr can be more verbose for C code

2024-03-03 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85432 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug tree-optimization/114207] [12/13/14 Regression] modref gets confused by vecotorized code ` -O3 -fno-tree-forwprop` since r12-5439

2024-03-03 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114207 Jan Hubicka changed: What|Removed |Added Status|NEW |ASSIGNED Assignee|unassigned at

[Bug debug/92387] [11/12/13 Regression] gcc generates wrong debug information at -O1 since r10-1907-ga20f263ba1a76a

2024-03-04 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92387 --- Comment #5 from Jan Hubicka --- The revision is changing inlining decisions, so it would be probably possible to reproduce the problem without that change with right alaways_inline and noinline attributes.

[Bug lto/114241] False-positive -Wodr warning when using -flto and -fno-semantic-interposition

2024-03-06 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114241 Jan Hubicka changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |hubicka at gcc dot gnu.org

[Bug ipa/106716] Identical Code Folding (-fipa-icf) confuses between functions with different [[likely]] attributes

2024-03-10 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106716 --- Comment #6 from Jan Hubicka --- The reason why GIMPLE_PREDICT is ignored is that it is never used after ipa-icf and gets removed at the very beggining of late optimizations. GIMPLE_PREDICT is consumed by profile_generate pass which is run

[Bug ipa/113907] [11/12/13/14 regression] ICU miscompiled since on x86 since r14-5109-ga291237b628f41

2024-03-13 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113907 --- Comment #55 from Jan Hubicka --- > Anyway, can we in the spot my patch changed just walk all > source->node->callees > cgraph_edges, for each of them find the corresponding > cgraph_edge in the alias > and for each walk all the jump_functi

[Bug ipa/113907] [11/12/13/14 regression] ICU miscompiled since on x86 since r14-5109-ga291237b628f41

2024-03-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113907 --- Comment #58 from Jan Hubicka --- Created attachment 57702 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=57702&action=edit Compare value ranges in jump functions This patch implements the jump function compare, however it is not good

[Bug ipa/113907] [11/12/13/14 regression] ICU miscompiled since on x86 since r14-5109-ga291237b628f41

2024-03-14 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113907 --- Comment #59 from Jan Hubicka --- just to explain what happens in the testcase. There is test and testb. They are almost same: int testb(void) { struct bar *fp; test2 ((void *)&fp); fp = NULL; (*ptr)++; test3 ((void *)&fp); } the

[Bug testsuite/109596] [14 Regression] Lots of guality testcase fails on x86_64 after r14-162-gcda246f8b421ba

2024-03-19 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109596 --- Comment #4 from Jan Hubicka --- The change makes loop iteration estimates more realistics, but does not introduce any new code that actually changes the IL, so it seems this makes existing problem more visible. I will try to debug what happ

[Bug testsuite/109596] [14 Regression] Lots of guality testcase fails on x86_64 after r14-162-gcda246f8b421ba

2024-03-19 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109596 --- Comment #6 from Jan Hubicka --- On this testcase trunk does get same dump as gcc13 for pass just before ch2 with ch2 we get: @@ -192,9 +236,8 @@ # DEBUG BEGIN_STMT goto ; [100.00%] - [local count: 954449105]: + [local count: 9544

[Bug testsuite/109596] [14 Regression] Lots of guality testcase fails on x86_64 after r14-162-gcda246f8b421ba

2024-03-19 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109596 Jan Hubicka changed: What|Removed |Added Status|NEW |ASSIGNED Assignee|unassigned at

[Bug gcov-profile/113765] [14 Regression] ICE: autofdo: val-profiler-threads-1.c compilation, error: probability of edge from entry block not initialized

2024-03-26 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113765 --- Comment #6 from Jan Hubicka --- Running auto-fdo without guessing branch probabilities is somewhat odd idea in general. I suppose we can indeed just avoid setting full_profile flag. Though the optimization passes are not that much tested to

[Bug ipa/109817] internal error in ICF pass on Ada interfaces

2024-04-02 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109817 Jan Hubicka changed: What|Removed |Added CC||hubicka at gcc dot gnu.org --- Comment #5

[Bug middle-end/110757] New: 7% parest regression on zen3 -Ofast -march=native -flto between g:4dbb3af1efe55174 (2023-07-14 00:54) and g:a5088dc3f5ef73c8 (2023-07-17 03:24)

2023-07-20 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110757 Bug ID: 110757 Summary: 7% parest regression on zen3 -Ofast -march=native -flto between g:4dbb3af1efe55174 (2023-07-14 00:54) and g:a5088dc3f5ef73c8 (2023-07-17 03:24) Prod

[Bug target/110758] New: 8% hmmer regression on zen1 and zen3 with -Ofast -march=native -flto between g:8377cf1bf41a0a9d (2023-07-05 01:46) and g:3a61ca1b9256535e (2023-07-06 16:56) and g:d76d19c9bc5e

2023-07-20 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110758 Bug ID: 110758 Summary: 8% hmmer regression on zen1 and zen3 with -Ofast -march=native -flto between g:8377cf1bf41a0a9d (2023-07-05 01:46) and g:3a61ca1b9256535e (2023-07-06

[Bug middle-end/110757] [14 Regression] 7% parest regression on zen3 -Ofast -march=native -flto between g:4dbb3af1efe55174 (2023-07-14 00:54) and g:a5088dc3f5ef73c8 (2023-07-17 03:24)

2023-07-26 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110757 --- Comment #4 from Jan Hubicka --- Most of the profile based regression is gone between g:1c6231c05bdccab3 (2023-07-21 03:06) and g:f33fdf9e7c038639 (2023-07-23 00:17) This should be: commit a31ef26b056d0c4f0a9f08b6eb81456ea257298e Author: Ja

[Bug target/110758] [14 Regression] 8% hmmer regression on zen1/3 with -Ofast -march=native -flto between g:8377cf1bf41a0a9d (2023-07-05 01:46) and g:3a61ca1b9256535e (2023-07-06 16:56); g:d76d19c9bc5

2023-07-26 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110758 Jan Hubicka changed: What|Removed |Added Last reconfirmed||2023-07-26 Status|UNCONFIRMED

[Bug middle-end/110832] New: 14% capacita -O2 regression between g:9fdbd7d6fa5e0a76 (2023-07-26 01:45) and g:ca912a39cccdd990 (2023-07-27 03:44) on zen3 and core

2023-07-27 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110832 Bug ID: 110832 Summary: 14% capacita -O2 regression between g:9fdbd7d6fa5e0a76 (2023-07-26 01:45) and g:ca912a39cccdd990 (2023-07-27 03:44) on zen3 and core Product: gcc

[Bug middle-end/110832] 14% capacita -O2 regression between g:9fdbd7d6fa5e0a76 (2023-07-26 01:45) and g:ca912a39cccdd990 (2023-07-27 03:44) on zen3 and core

2023-07-27 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110832 --- Comment #1 from Jan Hubicka --- This time it seems that there is only one profile change: commit 645c67f80c6258c1f54ec567f604008adbdb8a04 Author: Jan Hubicka Date: Wed Jul 26 08:59:23 2023 +0200 Fix profile_count::to_sreal_scale

[Bug tree-optimization/106293] [13/14 Regression] 456.hmmer at -Ofast -march=native regressed by 19% on zen2 and zen3 in July 2022

2023-07-27 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106293 --- Comment #15 from Jan Hubicka --- if (bb_loop_depth (best_bb) == bb_loop_depth (early_bb) /* If result of comparsion is unknown, prefer EARLY_BB. Thus use !(...>=..) rather than (...<...) */ - && !(best_bb->count * 10

[Bug middle-end/110833] New: gamess regression on Ice Lake with -Ofast -march=native between g:1c6231c05bdccab3 (2023-07-21 03:06) and g:bbc1a102735c72e3 (2023-07-23 04:55)

2023-07-27 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110833 Bug ID: 110833 Summary: gamess regression on Ice Lake with -Ofast -march=native between g:1c6231c05bdccab3 (2023-07-21 03:06) and g:bbc1a102735c72e3 (2023-07-23 04:55) Prod

[Bug tree-optimization/106293] [13/14 Regression] 456.hmmer at -Ofast -march=native regressed by 19% on zen2 and zen3 in July 2022

2023-07-27 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106293 --- Comment #16 from Jan Hubicka --- It is really hard to make loop splitting to do something. It does not like canonicalized invariant variables since loop exit condition should not be NE_EXPR and it does not like when VRP turns LT/GT into NE.

[Bug other/110831] [14 regression] gcc.dg/stack-check-3.c ICEs after r14-2822-g499b8079a6419b

2023-07-27 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110831 Jan Hubicka changed: What|Removed |Added Last reconfirmed||2023-07-28 Status|UNCONFIRMED

[Bug tree-optimization/77689] Missing vectorization lead to huge performance loss

2023-07-28 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77689 Jan Hubicka changed: What|Removed |Added CC||hubicka at gcc dot gnu.org --- Comment #15

[Bug tree-optimization/77689] Missing vectorization lead to huge performance loss

2023-07-28 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77689 --- Comment #16 from Jan Hubicka --- I am testing the following that makes loop splitting understand when first iteration is special. diff --git a/gcc/tree-ssa-loop-split.cc b/gcc/tree-ssa-loop-split.cc index 70cd0aaefa7..1fd3ee1d1e5 100644 ---

[Bug tree-optimization/77689] Missing vectorization lead to huge performance loss

2023-07-28 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77689 --- Comment #17 from Jan Hubicka --- I posted the patch. With it we split the loop, but we don't get really big improvements from that h@ryzen3:~/gcc/build3/gcc> ./xgcc -B ./ -Ofast c.ii -S -fopt-info 2>&1 | grep split ; perf stat ./a.out c.C:15

[Bug target/110758] [14 Regression] 8% hmmer regression on zen1/3 with -Ofast -march=native -flto between g:8377cf1bf41a0a9d (2023-07-05 01:46) and g:3a61ca1b9256535e (2023-07-06 16:56); g:d76d19c9bc5

2023-07-31 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110758 Jan Hubicka changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2023-07-31 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 110758, which changed state. Bug 110758 Summary: [14 Regression] 8% hmmer regression on zen1/3 with -Ofast -march=native -flto between g:8377cf1bf41a0a9d (2023-07-05 01:46) and g:3a61ca1b9256535e (2023-07-06 16:56); g:d7

[Bug tree-optimization/106293] [13/14 Regression] 456.hmmer at -Ofast -march=native regressed by 19% on zen2 and zen3 in July 2022

2023-07-31 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106293 --- Comment #21 from Jan Hubicka --- Fixing loop distribution and vectorizer profile update seems to do the trick with profile feedback. Without we are still worse than in July last year on zen2 tester (zen3 and ice lake seems to behave differen

[Bug middle-end/110857] aarch64-linux-gnu profiledbootstrap broken

2023-08-01 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110857 --- Comment #1 from Jan Hubicka --- The sanity check fires since profile count involved are not compatible that should never happen within a single function. Would it be possible to dump them? From debugger one should be able to call p this->

[Bug tree-optimization/106293] [13/14 Regression] 456.hmmer at -Ofast -march=native regressed by 19% on zen2 and zen3 in July 2022

2023-08-01 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106293 --- Comment #23 from Jan Hubicka --- Thanks, I think I will need to work out the remaining vectorizer problems. One issue seems to be interaction with loop distribution. Loop distribution seems to intorduce alias checks that are later removed

[Bug tree-optimization/110852] [14 Regression] ICE: in get_predictor_value, at predict.cc:2695 with -O -fno-tree-fre and __builtin_expect()

2023-08-01 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110852 --- Comment #1 from Jan Hubicka --- This is likely g:eab57b825bcc350e9ff44eb2fa739a80199d9bb1 which fixed prediction order and uncovered latent bug in combininig predictions with known probabilities. I will take a look.

[Bug tree-optimization/106293] [13/14 Regression] 456.hmmer at -Ofast -march=native regressed by 19% on zen2 and zen3 in July 2022

2023-08-02 Thread hubicka at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106293 --- Comment #24 from Jan Hubicka --- g:2e93b92c1ec5fbbbe10765c6e059c3c90d564245 fixes the profile update after cancelled distribution. However it does not help hmmer since we actually vectorize that loop iterating 0 times. We need to figure out

<    1   2   3   4   5   6   7   8   9   >