[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-24 Thread jakub at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #11 from Jakub Jelinek --- Which is what I wrote in #c9. Let's add a target hook and do the cap in there, rather than artificially lowering the max-rtl-if-conversion-insns default.

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-24 Thread ubizjak at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #10 from Uroš Bizjak --- (In reply to Jakub Jelinek from comment #8) > Try --param=max-rtl-if-conversion-insns=2 . > Or try some -mtune that is not: > /* X86_TUNE_ONE_IF_CONV_INSNS: Restrict a number of set insns to be >if-convert

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-24 Thread jakub at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #9 from Jakub Jelinek --- Perhaps we just want to limit number of cmov converted insns in a bb rather than all noce* attempts, or we should also check how far is the user of the conditional move; on this exact testcase we turn one of

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-24 Thread jakub at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #8 from Jakub Jelinek --- Try --param=max-rtl-if-conversion-insns=2 . Or try some -mtune that is not: /* X86_TUNE_ONE_IF_CONV_INSNS: Restrict a number of set insns to be if-converted to one. */ DEF_TUNE (X86_TUNE_ONE_IF_CONV_INSN,

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-24 Thread ubizjak at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #7 from Uroš Bizjak --- The testcase for RTL if-conversion part: --cut here-- extern double A[32]; int foo (void) { double t = A[0]; int jp = 0; int i; for (i = 0; i < 32; i++) { double ab = A[i]; if (ab > t

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-24 Thread rguenther at suse dot de
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #6 from rguenther at suse dot de --- On Fri, 24 Feb 2017, ubizjak at gmail dot com wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 > > Uroš Bizjak changed: > >What|Removed |Added > -

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-24 Thread ubizjak at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 Uroš Bizjak changed: What|Removed |Added CC||jakub at gcc dot gnu.org --- Comment #5 fr

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-06 Thread rguenth at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #4 from Richard Biener --- On trunk I see with -fno-split-paths: .L5: movq(%r14,%rdx,8), %rcx vmovsd (%rcx,%rbx), %xmm0 vandpd %xmm3, %xmm0, %xmm0 vucomisd%xmm1, %xmm0 jbe .L4

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-06 Thread krister.walfridsson at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #3 from krister.walfridsson at gmail dot com --- Correction: -fno-split-paths does not help the trunk compiler. But it restores the result when using the r242550 compiler...

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-06 Thread krister.walfridsson at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 --- Comment #2 from krister.walfridsson at gmail dot com --- No, I get the same reduced performance when using -fno-split-paths

[Bug tree-optimization/79390] 10% performance drop in SciMark2 LU after r242550

2017-02-06 Thread rguenth at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79390 Richard Biener changed: What|Removed |Added Keywords||missed-optimization Status|