[Bug inline-asm/87733] local register variable not honored with earlyclobber

2020-03-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87733 --- Comment #14 from Alexander Monakov --- Just to clarify, the two testcases added in the quoted commit don't try to catch the issue discussed here: that the operand is passed in a wrong register.

[Bug inline-asm/87733] local register variable not honored with earlyclobber

2020-03-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87733 --- Comment #21 from Alexander Monakov --- > I could guess the compiler might ignore your inputs/outputs that you specify > if you don't have any % usages for them. Are you seriously suggesting that examples in the GCC manual are invalid and ev

[Bug rtl-optimization/94728] [haifa-sched][restore_pattern] recalculate INSN_TICK for the dependence type of REG_DEP_CONTROL

2020-04-23 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94728 Alexander Monakov changed: What|Removed |Added CC||abel at gcc dot gnu.org Reso

[Bug bootstrap/91972] Bootstrap should use -Wmissing-declarations

2020-05-05 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91972 --- Comment #1 from Alexander Monakov --- Another reason to have -Wmissing-declarations is that otherwise mismatches of unused functions are not caught until it's too late (mismatching definition is assumed to be an overload of the function decla

[Bug bootstrap/91972] Bootstrap should use -Wmissing-declarations

2020-05-05 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91972 --- Comment #4 from Alexander Monakov --- > Why is it missing the static keyword then? (Or alternatively, why isn't it in > an anonymous namespace?) Huh? Without the warning developers may simply forget to put the 'static' keyword. With the war

[Bug c++/95103] Unexpected -Wclobbered in bits/vector.tcc with -O2

2020-05-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95103 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug rtl-optimization/95123] [10/11 Regression] Wrong code w/ -O2 -fselective-scheduling2 -funroll-loops --param early-inlining-insns=5 --param loop-invariant-max-bbs-in-loop=3 --param max-jump-thread

2020-05-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95123 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c++/95103] Unexpected -Wclobbered in bits/vector.tcc with -O2

2020-05-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95103 --- Comment #5 from Alexander Monakov --- No, this analogy does not work. setjmp both sets up a buffer and receives control, so it corresponds to both try and catch together. A matching "C++" code would look like: > void f3() { > std::vector

[Bug rtl-optimization/95123] [10/11 Regression] Wrong code w/ -O2 -fselective-scheduling2 -funroll-loops --param early-inlining-insns=5 --param loop-invariant-max-bbs-in-loop=3 --param max-jump-thread

2020-05-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95123 --- Comment #6 from Alexander Monakov --- Oh, you're probably configuring your compiler with --enable-default-pie. Please paste the entire gcc -v. I can reproduce the miscompile it if I pass -fpie -pie.

[Bug c/95379] Don't warn about the universal zero initializer for a structure with the 'designated_init' attribute.

2020-05-28 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95379 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-05-30 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/95435] bad builtin memcpy performance with znver1/znver2 and 32bit

2020-06-01 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95435 --- Comment #8 from Alexander Monakov --- There's no tuning tables for memcmp at all, existing structs cover only memset and memcpy. So as far as I see retuning memset/memcpy doesn't need to wait for [1], because there's no infrastructure in plac

[Bug ipa/95558] Invalid IPA optimizations based on weak definition

2020-06-06 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95558 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug other/92396] -ftime-trace support

2020-07-28 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92396 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c/96420] -Wsign-extensions warnings are generated from system header macros

2020-08-02 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96420 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug tree-optimization/96633] missed optimization?

2020-08-17 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96633 --- Comment #2 from Alexander Monakov --- Martin added me to CC so I assume he wants me to chime in. First of all, I find Nathan's behavior in that gcc@ thread distasteful at best (but if you ask me, such responses are simply more harm than good

[Bug tree-optimization/96672] Missing -Wclobbered diagnostic, or: __attribute__((returns_twice)) does not inhibit constant folding across call site

2020-08-18 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96672 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug middle-end/95189] [9/10 Regression] memcmp being wrongly stripped like strcmp

2020-09-03 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95189 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug libstdc++/96942] std::pmr::monotonic_buffer_resource causes CPU cache misses

2020-09-07 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96942 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug libstdc++/96942] std::pmr::monotonic_buffer_resource causes CPU cache misses

2020-09-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96942 --- Comment #9 from Alexander Monakov --- The most pronounced difference for depth=18 seems to be caused by m_b_r over-allocating by 2x: internally it mallocs 2x of the size given to the constructor, and then Linux pre-faults those extra pages, p

[Bug libstdc++/96942] std::pmr::monotonic_buffer_resource causes CPU cache misses

2020-09-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96942 --- Comment #14 from Alexander Monakov --- > It adds 11 bytes to the size given to the constructor (for its internal > bookkeeping) and then rounds up to a power of two. What is the purpose of this rounding up?

[Bug libstdc++/96942] std::pmr::monotonic_buffer_resource causes CPU cache misses

2020-09-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96942 --- Comment #18 from Alexander Monakov --- Huh? malloc is capable of splitting the tail of the last page for reuse in subsequent small allocations, why not let it do it? It will not be "wasted".

[Bug libstdc++/96942] std::pmr::monotonic_buffer_resource causes CPU cache misses

2020-09-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96942 --- Comment #20 from Alexander Monakov --- Round up to 64 bytes (typical cache line size).

[Bug libstdc++/96942] std::pmr::monotonic_buffer_resource causes CPU cache misses

2020-09-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96942 --- Comment #23 from Alexander Monakov --- Are you benchmarking with bt_pmr_0thrd (attached in comment #3) with depth=18? On earlier tests there are other effects in play too.

[Bug target/97127] FMA3 code transformation leads to slowdown on Skylake

2020-09-21 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97127 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/97127] FMA3 code transformation leads to slowdown on Skylake

2020-09-21 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97127 --- Comment #4 from Alexander Monakov --- > More so, gcc variant occupies 2 reservation station entries (2 fused uOps) vs > 4 entries by de-transformed sequence. I don't think this is true for the test at hand? With base+offset memory operand th

[Bug inline-asm/92151] Spurious register copying

2019-10-21 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92151 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug middle-end/92250] valgrind: ira_traverse_loop_tree – Conditional jump or move depends on uninitialised value

2019-10-28 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92250 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug rtl-optimization/87047] [7/8/9 Regression] performance regression because of if-conversion

2019-11-05 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87047 --- Comment #16 from Alexander Monakov --- I'd like to backport this to gcc-9 branch and then close this bug (Richi already indicated that further backports are not desirable). Thoughts?

[Bug rtl-optimization/87047] [7/8/9 Regression] performance regression because of if-conversion

2019-11-06 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87047 Alexander Monakov changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug tree-optimization/92283] [10 Regression] 454.calculix miscomparison since r276645 with -O2 -march=znver2

2019-11-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92283 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/92462] [arm32] -ftree-pre makes a variable to be wrongly hoisted out

2019-11-12 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92462 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/92462] [arm32] -ftree-pre makes a variable to be wrongly hoisted out

2019-11-12 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92462 --- Comment #10 from Alexander Monakov --- > atomic_cmpxchg_func tries to cast 'dest' from uint8_t* to int* I made a typo here, I meant uint32_t rather than uint8_t, and there's no aliasing violation here as signedness difference is explicitly O

[Bug rtl-optimization/91161] [9/10 Regression] ICE in begin_move_insn, at sched-ebb.c:175

2019-11-20 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91161 --- Comment #3 from Alexander Monakov --- With -fno-dce, a NOTE_INSN_DELETED_LABEL appears between the last "real" insn in the basic block (a sibcall) and a barrier rtx: (call_insn/u/c 20 19 12 3 (call (mem:QI (symbol_ref:DI ("ni") [flags 0x3]

[Bug c++/92597] std::fma gives nan using -march=sandybridge+ with asm volatile

2019-11-20 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92597 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c++/92572] Vague linkage does not work reliably when a matching segment is in a dynamically linked libarary on Linux

2019-11-22 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92572 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug tree-optimization/92645] Hand written vector code is 450 times slower when compiled with GCC compared to Clang

2019-11-25 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92645 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug tree-optimization/92768] [8/9/10 Regression] Maybe a wrong code for vector constants

2019-12-03 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92768 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c++/92855] -fvisibility-inlines-hidden failing to hide out-of-line copies of certain inline member functions

2019-12-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92855 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c++/92855] -fvisibility-inlines-hidden failing to hide out-of-line copies of certain inline member functions

2019-12-09 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92855 Alexander Monakov changed: What|Removed |Added Resolution|INVALID |DUPLICATE --- Comment #6 from Alexan

[Bug c++/47877] -fvisibility-inlines-hidden does not hide member template functions

2019-12-09 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=47877 Alexander Monakov changed: What|Removed |Added CC||thiago at kde dot org --- Comment #4

[Bug c++/47877] -fvisibility-inlines-hidden does not hide member template functions

2019-12-09 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=47877 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug rtl-optimization/92905] New: [10 Regression] Spills float-int union to memory

2019-12-11 Thread amonakov at gcc dot gnu.org
Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: amonakov at gcc dot gnu.org Target Milestone: --- gcc-10 branch regressed for code that needs bitwise operations on floats: float f(float x) { union {float f

[Bug target/92905] [10 Regression] Spills float-int union to memory

2019-12-11 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92905 --- Comment #4 from Alexander Monakov --- Perhaps only xmm0 is problematic, as making xmm0 unused by adding a dummy argument brings back the old spill-free result: float my_copysign(float dummy, float x, float y) { union {float f; unsigned i

[Bug rtl-optimization/92953] New: Undesired if-conversion with overflow builtins

2019-12-16 Thread amonakov at gcc dot gnu.org
: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: amonakov at gcc dot gnu.org Target Milestone: --- Consider: /* Return 0 if a==b, any positive value if a>b, any negative value otherwise. */ int foo(int a, int b) { int c; if (__builtin_sub_overflow(a

[Bug target/92953] Undesired if-conversion with overflow builtins

2019-12-16 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92953 --- Comment #2 from Alexander Monakov --- Well, the aarch64 backend does not implement subv4 pattern in the first place, which would be required for efficient branchy code: foo: subsw0, w0, w1 b.vc.LBB0_2 mvn

[Bug target/66120] __builtin_add/sub_overflow for int32_t emit poor code on ARM

2019-12-16 Thread amonakov at gcc dot gnu.org
||amonakov at gcc dot gnu.org Resolution|--- |FIXED --- Comment #5 from Alexander Monakov --- Looks like the documentation was added in r230651, overflow patterns for arm in r239739, and for arm64 in r262890.

[Bug target/92953] Undesired if-conversion with overflow builtins

2019-12-16 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92953 --- Comment #4 from Alexander Monakov --- At least then GCC should try to use cmovno instead of seto-test-cmove for if-conversion: foo: movl%edi, %eax subl%esi, %eax notl%eax orl $1, %eax s

[Bug c/93031] Wish: When the underlying ISA does not force pointer alignment, option to make GCC not assume it

2019-12-21 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93031 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/93039] New: Fails to use SSE bitwise ops for float-as-int manipulations

2019-12-21 Thread amonakov at gcc dot gnu.org
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: amonakov at gcc dot gnu.org Target Milestone: --- (the non-regression part of PR 92905) libm functions need to manipulate individual bits of float

[Bug target/92905] [10 Regression] Spills float-int union to memory

2019-12-21 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92905 --- Comment #8 from Alexander Monakov --- (In reply to Alexander Monakov from comment #0) > Eventually it would be nicer to use SSE bitwise operations for this, for > example LLVM already generates > f: > orps.LCPI0_0(%rip), %xmm0 Th

[Bug tree-optimization/93055] accumulation loops in stepanov_vector benchmark use more instruction level parpallelism

2019-12-24 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93055 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug tree-optimization/93055] accumulation loops in stepanov_vector benchmark use more instruction level parpallelism

2019-12-24 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93055 --- Comment #2 from Alexander Monakov --- Can you attach preprocessed source and double-check command-line flags? I can't reproduce the problem with lea, and the code does not have explicit prefetch instructions that I get with -O3 -march=bdver1

[Bug tree-optimization/93056] Poor codegen for heapsort in stephanov_vector benchmark

2019-12-24 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93056 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug tree-optimization/93055] accumulation loops in stepanov_vector benchmark use more instruction level parpallelism

2019-12-24 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93055 --- Comment #4 from Alexander Monakov --- The attachment is edited to test insertion_sort, and doesn't call accumulate_vector at all - looks like you attached a wrong file?

[Bug c/93072] [8/9/10 Regression] ICE: gimplifier segfault with undefined nested function

2019-12-25 Thread amonakov at gcc dot gnu.org
Status|UNCONFIRMED |NEW Last reconfirmed||2019-12-25 CC||amonakov at gcc dot gnu.org Summary|ICE: Segmentation fault |[8/9/10 Regression] ICE

[Bug target/93078] Missing fma and round functions auto-vectorization with x86-64 (sse2)

2019-12-27 Thread amonakov at gcc dot gnu.org
Status|UNCONFIRMED |NEW Last reconfirmed||2019-12-27 CC||amonakov at gcc dot gnu.org Component|tree-optimization |target Ever confirmed|0 |1 --- Comment #1

[Bug rtl-optimization/49330] Integer arithmetic on addresses optimised with pointer arithmetic rules

2019-12-30 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=49330 --- Comment #29 from Alexander Monakov --- (In reply to Alexander Cherepanov from comment #28) > I see the same even with pure pointers. I guess RTL doesn't care about such > differences but it means the problem could bite a relatively innocent c

[Bug target/29776] result of ffs/clz/ctz/popcount/parity are already sign-extended

2019-12-31 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=29776 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug rtl-optimization/93165] avoidable 2x penalty on unpredicted overwrite

2020-01-06 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93165 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/93039] Fails to use SSE bitwise ops for float-as-int manipulations

2020-01-08 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93039 --- Comment #3 from Alexander Monakov --- > The question is for which CPUs is it actually faster to use SSE? In the context of chains where the source and the destination need to be SSE registers, pretty much all CPUs? Inter-unit moves typically

[Bug target/93039] Fails to use SSE bitwise ops for float-as-int manipulations

2020-01-09 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93039 --- Comment #5 from Alexander Monakov --- Ah, in that sense. The extra load is problematic in cold code where it's likely a TLB miss. For hot code: the load does not depend on any previous computations and so does not increase dependency chains.

[Bug target/93274] target_clones produces symbols with random digits with -fPIC

2020-01-15 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93274 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug testsuite/90565] [10 regression] test cases gcc.dg/uninit-18.c and uninit-pr90394-1-gimple.c broken as of r271460

2020-01-17 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90565 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c/93278] huge almost empty array takes huge time to compile and produces huge object file

2020-01-18 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93278 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/91838] [8/9 Regression] incorrect use of shr and shrx to shift by 64, missed optimization of vector shift

2020-01-23 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91838 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/91824] unnecessary sign-extension after _mm_movemask_epi8 or __builtin_popcount

2020-01-23 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91824 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug rtl-optimization/93402] [8/9/10 Regression] Wrong code when returning padded struct

2020-01-23 Thread amonakov at gcc dot gnu.org
||2020-01-23 CC||amonakov at gcc dot gnu.org Summary|Wrong code when returning |[8/9/10 Regression] Wrong |padded struct |code when returning padded

[Bug middle-end/90348] [8/9/10 Regression] Partition of char arrays is incorrect in some cases

2020-01-23 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90348 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug middle-end/90348] [8/9/10 Regression] Partition of char arrays is incorrect in some cases

2020-01-23 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90348 --- Comment #19 from Alexander Monakov --- (In reply to Michael Matz from comment #18) > represent all accesses indirectly via pointers Would that be necessary in presence of a verifier that ensures that all references are dominated by births?

[Bug target/91838] [8/9 Regression] incorrect use of shr and shrx to shift by 64, missed optimization of vector shift

2020-01-23 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91838 --- Comment #5 from Alexander Monakov --- Ah, indeed, it should be explicitly UB, and the documentation should mention that as well as that implicit integer promotion does not happen for vector shifts and other operations.

[Bug tree-optimization/93301] Wrong optimization: instability of uninitialized variables leads to nonsense

2020-01-26 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93301 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug tree-optimization/93444] New: [8/9/10 Regression] unswitching introduces unconditional use of uninitialized variable

2020-01-26 Thread amonakov at gcc dot gnu.org
Keywords: wrong-code Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: amonakov at gcc dot gnu.org CC: ch3root at openwall dot com Target Milestone: --- Splitting out bug

[Bug tree-optimization/93301] Wrong optimization: instability of uninitialized variables leads to nonsense

2020-01-26 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93301 --- Comment #8 from Alexander Monakov --- Pasted that to new PR 93444 (should have done that right away, sorry).

[Bug tree-optimization/93444] [8/9/10 Regression] ssa-loop-im introduces unconditional use of uninitialized variable

2020-01-26 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93444 Alexander Monakov changed: What|Removed |Added Summary|[8/9/10 Regression] |[8/9/10 Regression]

[Bug tree-optimization/93444] [8/9/10 Regression] ssa-loop-im introduces unconditional use of uninitialized variable

2020-01-27 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93444 --- Comment #5 from Alexander Monakov --- The problem is lifting a conditional access. We don't have an example where lifting an invariant from an always-executed block in a loop to its preheader poses a problem. LLVM adopted an approach where h

[Bug tree-optimization/93491] Wrong optimization: const-function moved over control flow leading to crashes

2020-01-30 Thread amonakov at gcc dot gnu.org
||amonakov at gcc dot gnu.org --- Comment #4 from Alexander Monakov --- (In reply to Alexander Cherepanov from comment #2) > > Do you have a testcase were gcc does this optimize without the user adding > > const and still traps? > > No. I'll file a separat

[Bug tree-optimization/93521] 40% slower in O2 than O1 (tree-pre)

2020-01-31 Thread amonakov at gcc dot gnu.org
||amonakov at gcc dot gnu.org Resolution|--- |DUPLICATE --- Comment #1 from Alexander Monakov --- Dup. *** This bug has been marked as a duplicate of bug 93056 ***

[Bug tree-optimization/93056] Poor codegen for heapsort in stephanov_vector benchmark

2020-01-31 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93056 Alexander Monakov changed: What|Removed |Added CC||hehaochen at hotmail dot com --- Com

[Bug c++/92572] Vague linkage does not work reliably when a matching segment is in a dynamically linked libarary on Linux

2020-02-10 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92572 --- Comment #5 from Alexander Monakov --- GCC is emitting static_local as @gnu_unique_object, so it should be unified by the Glibc dynamic linker. You can use 'nm -CD' to check its type after linking for the main executable and the library to mak

[Bug rtl-optimization/88879] [9 Regression] ICE in sel_target_adjust_priority, at sel-sched.c:3332

2020-02-11 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88879 --- Comment #15 from Alexander Monakov --- This should not be reproducible with current HEAD because the assert was simply eliminated. If GCC master definitely fails, can you please provide the exact diagnostic? As for 9.2 this is sadly expected

[Bug tree-optimization/93734] [8/9/10 Regression] Invalid code generated with -O2 -march=haswell -ftree-vectorize

2020-02-13 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93734 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug rtl-optimization/88879] [9 Regression] ICE in sel_target_adjust_priority, at sel-sched.c:3332

2020-02-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88879 Alexander Monakov changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug rtl-optimization/93743] [9/10 Regression] swapped arguments in atan2l

2020-02-14 Thread amonakov at gcc dot gnu.org
-*-* Status|UNCONFIRMED |NEW Keywords||wrong-code Last reconfirmed||2020-02-14 Component|c |rtl-optimization CC||amonakov at gcc dot gnu.org

[Bug rtl-optimization/93743] [9/10 Regression] swapped arguments in atan2l

2020-02-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93743 Alexander Monakov changed: What|Removed |Added CC||uros at gcc dot gnu.org Com

[Bug tree-optimization/93745] Redundant store not eliminated with intermediate instruction

2020-02-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93745 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug middle-end/93744] [8/9/10 Regression] Different results between gcc-9 and gcc-7

2020-02-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93744 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug tree-optimization/93745] Redundant store not eliminated with intermediate instruction

2020-02-14 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93745 --- Comment #4 from Alexander Monakov --- Placement new is translated to a plain pointer assignment on GIMPLE, so optimizers cannot distinguish programs that had placement new from programs that did not. (in C we need memory from malloc to be re

[Bug gcov-profile/93623] No need to dump gcdas when forking

2020-02-21 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93623 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c/93848] missing -Warray-bounds warning for array subscript 1 is outside array bounds

2020-02-21 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93848 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/93934] Unnecessary fld of uninitialized float stack variable results in ub of valid C++ code

2020-02-26 Thread amonakov at gcc dot gnu.org
||amonakov at gcc dot gnu.org Resolution|--- |INVALID --- Comment #2 from Alexander Monakov --- fcmov can only raise an x87 fpu exception on x87 stack underflow, which cannot happen here. Even if it did raise FE_INVALID for SNaNs, note that GCC

[Bug target/93934] Unnecessary fld of uninitialized float stack variable results in ub of valid C++ code

2020-02-26 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93934 --- Comment #5 from Alexander Monakov --- Ah, indeed. fld won't raise FE_INVALID for 80-bit long double, but here 'result' is stored on the stack in 64-bit format. So: fcmov and 80-bit fldt don't trap, 32-bit flds and 64-bit fldl do. Somehow RT

[Bug target/93934] Unnecessary fld of uninitialized float stack variable results in ub of valid C++ code

2020-02-26 Thread amonakov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93934 --- Comment #8 from Alexander Monakov --- I think regstack is fine as x87 only supports computations in its native 80-bit format and conversions to/from ieee float/double happen only on memory loads/stores. > I suppose a fldt followed by "trunca

[Bug middle-end/56077] [4.6/4.7/4.8 Regression] volatile ignored when function inlined

2013-02-04 Thread amonakov at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56077 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug target/56200] queens benchmark is faster with -O0 than with any other optimization level

2013-02-04 Thread amonakov at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56200 --- Comment #2 from Alexander Monakov 2013-02-04 21:36:38 UTC --- (In reply to comment #1) > What happens if you also use -fno-ivopts ? For me, -fno-ivopts gives a small improvement, but still slower than -O0. I think the slowdown is r

[Bug target/56200] queens benchmark is faster with -O0 than with any other optimization level

2013-02-05 Thread amonakov at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56200 Alexander Monakov changed: What|Removed |Added CC||hjl.tools at gmail dot com,

[Bug sanitizer/56393] SIGSEGV when -fsanitize=address and dynamic lib with global objects

2013-02-21 Thread amonakov at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56393 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug c/56507] GCC -march=native for Core2Duo

2013-03-04 Thread amonakov at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56507 Alexander Monakov changed: What|Removed |Added CC||amonakov at gcc dot gnu.org

[Bug other/39851] gcc -Q --help=target does not list extensions selected by -march=

2013-03-04 Thread amonakov at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=39851 Alexander Monakov changed: What|Removed |Added CC||bratsinot at gmail dot com

[Bug tree-optimization/53265] Warn when undefined behavior implies smaller iteration count

2013-03-11 Thread amonakov at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=53265 --- Comment #10 from Alexander Monakov 2013-03-11 16:15:36 UTC --- (In reply to comment #8) > Not sure about the warning wording What about (... "iteration %E invokes undefined behavior", max)? > plus no idea how to call the warning o

  1   2   3   4   5   6   7   8   9   10   >