[Bug middle-end/67014] New: builtin_tolower is inefficient.

2015-07-25 Thread neleai at seznam dot cz
Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Target Milestone: --- Hi, When one uses __builtin_tolower it could be much slower to tolower. In this example builtin_tolower gets just expanded to call. If one uses tolower instead it gets expanded to

[Bug middle-end/66989] poor performance of builtin_isfinite on x64

2015-07-24 Thread neleai at seznam dot cz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66989 --- Comment #2 from Ondrej Bilka --- Created attachment 36050 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=36050&action=edit testing script

[Bug middle-end/66989] poor performance of builtin_isfinite on x64

2015-07-24 Thread neleai at seznam dot cz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66989 --- Comment #1 from Ondrej Bilka --- Created attachment 36049 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=36049&action=edit benchmark

[Bug middle-end/66989] New: poor performance of builtin_isfinite on x64

2015-07-24 Thread neleai at seznam dot cz
: middle-end Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Target Milestone: --- This is another part of considering floating classification builtin performance. This starts to be more cpu dependent as benchmark show large improvement for core2 but

[Bug target/66986] poor performance of __builtin_isinf on x64

2015-07-24 Thread neleai at seznam dot cz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66986 Ondrej Bilka changed: What|Removed |Added Attachment #36047|0 |1 is obsolete|

[Bug target/66986] poor performance of __builtin_isinf on x64

2015-07-24 Thread neleai at seznam dot cz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66986 --- Comment #4 from Ondrej Bilka --- Ok added updated benchmark with adding -mtune=native and tests for core2, haswell and fx10. It stays pretty consistent. don't inline conditional add branched real0m0.698s user0m0.698s sys 0m0.000

[Bug middle-end/66986] poor performance of __builtin_isinf on x64

2015-07-24 Thread neleai at seznam dot cz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66986 --- Comment #1 from Ondrej Bilka --- Created attachment 36047 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=36047&action=edit testing script

[Bug middle-end/66986] New: poor performance of __builtin_isinf on x64

2015-07-24 Thread neleai at seznam dot cz
: middle-end Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Target Milestone: --- Created attachment 36046 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=36046&action=edit benchmark. Hi, On x64 floating builtins are considerably slow

[Bug libstdc++/66416] New: Regression: string::find 3.5 times slower than memrchr

2015-06-04 Thread neleai at seznam dot cz
Component: libstdc++ Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Target Milestone: --- Same problem as with strstr also applies here. As we know length we could compare that to memrchr. Again instead simply calling that an implementation is 3.5 times

[Bug libstdc++/66414] New: Regression: string::find ten times slower than strstr

2015-06-04 Thread neleai at seznam dot cz
Component: libstdc++ Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Target Milestone: --- Hi, as I seen bug with string::== being slower than using strcmp I decided to check other functions for regressions. Here string::find doesn't simply call opti

[Bug libstdc++/59048] operator== between std::string and const char* slower than strcmp

2015-06-02 Thread neleai at seznam dot cz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=59048 Ondrej Bilka changed: What|Removed |Added CC||neleai at seznam dot cz --- Comment #13

[Bug middle-end/64247] malloc alignment and -mavx

2014-12-11 Thread neleai at seznam dot cz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=64247 Ondrej Bilka changed: What|Removed |Added CC||neleai at seznam dot cz --- Comment #7

[Bug c++/60731] [4.7/4.8/4.9 Regression] dynamic library not getting reinitialized on multiple calls to dlopen()

2014-04-23 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=60731 Ondrej Bilka changed: What|Removed |Added CC||neleai at seznam dot cz --- Comment #9

[Bug c/46936] turn __attribute__ ((nonnull (x))) into assert in debug mode

2013-11-05 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=46936 --- Comment #3 from Ondrej Bilka --- > As per http://gcc.gnu.org/onlinedocs/gcc/Function-Attributes.html, the > annotation on the example function there "causes the compiler to check that, > in > calls to my_memcpy, arguments dest and src are non

[Bug c/46936] turn __attribute__ ((nonnull (x))) into assert in debug mode

2013-10-23 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=46936 Ondrej Bilka changed: What|Removed |Added CC||neleai at seznam dot cz --- Comment #1

[Bug target/58112] New: Ineffective addressing mode used in loop.

2013-08-09 Thread neleai at seznam dot cz
Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Hi, in following testcase gcc -O3 generates following loop: movq%rsi, %r9 subq%rdx, %r9 movq%r9, %rdi movq%r9, %rsi leaq16(%r9), %r8

[Bug target/58112] Ineffective addressing mode used in loop.

2013-08-09 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=58112 --- Comment #1 from Ondrej Bilka --- Created attachment 30628 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=30628&action=edit testcase

[Bug target/58110] New: Useless GPR push and pop when only xmm registers are used.

2013-08-09 Thread neleai at seznam dot cz
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Hi,attached code generates extra push/pop rbx pair while there is no gpr register assigned in segment between them. This was generated by head xgcc -O3. A gcc-4.7 has

[Bug target/58110] Useless GPR push and pop when only xmm registers are used.

2013-08-09 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=58110 --- Comment #1 from Ondrej Bilka --- Created attachment 30627 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=30627&action=edit testcase

[Bug target/29776] result of ffs/clz/ctz/popcount/parity are already sign-extended

2013-07-04 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29776 --- Comment #15 from Ondrej Bilka --- On Thu, Jul 04, 2013 at 07:46:07PM +, glisse at gcc dot gnu.org wrote: > http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29776 > > --- Comment #14 from Marc Glisse --- > (In reply to Jakub Jelinek from comme

[Bug other/57668] Regression in vectorizing memcpy pattern.

2013-06-21 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57668 --- Comment #1 from Ondrej Bilka --- Created attachment 30333 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=30333&action=edit benchmark for memcpy

[Bug other/57668] New: Regression in vectorizing memcpy pattern.

2013-06-21 Thread neleai at seznam dot cz
Assignee: unassigned at gcc dot gnu.org Reporter: neleai at seznam dot cz Hi, When I ran atached benchmark that test how gcc can optimize byte by byte memcpy(attached memcpy_byte.c) I got a regression on nehalem and ivy_bridge architectures. I ran it by commands ./run machine 2

[Bug target/54349] _mm_cvtsi128_si64 unnecessary stores value at stack

2013-04-26 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54349 --- Comment #4 from Ondrej Bilka 2013-04-27 01:06:45 UTC --- I found that AMD Bulldozer optimization guide states that moves from xmm to GPR register should be done directly:" 10.4 Moving Data Between General-Purpose and XMM/YMM Registe

[Bug target/57056] Missed optimization of finite finite builtin

2013-04-25 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57056 Ondrej Bilka changed: What|Removed |Added Attachment #29930|0 |1 is obsolete|

[Bug target/57056] New: Missed optimization of finite finite builtin

2013-04-24 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=57056 Bug #: 57056 Summary: Missed optimization of finite finite builtin Classification: Unclassified Product: gcc Version: 4.7.1 Status: UNCONFIRMED Severity: normal

[Bug target/56676] unnecesary splitted load when using avx2

2013-03-21 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56676 --- Comment #2 from Ondrej Bilka 2013-03-21 14:53:26 UTC --- On Thu, Mar 21, 2013 at 01:30:42PM +, rguenth at gcc dot gnu.org wrote: > > > http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56676 > > > > --- Comment #1 from Richard

[Bug target/56676] New: unnecesary splitted load when using avx2

2013-03-21 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56676 Bug #: 56676 Summary: unnecesary splitted load when using avx2 Classification: Unclassified Product: gcc Version: 4.7.1 Status: UNCONFIRMED Severity: normal

[Bug other/56631] duplicated sse code in switch

2013-03-16 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56631 --- Comment #1 from Ondrej Bilka 2013-03-16 11:36:04 UTC --- Created attachment 29678 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=29678 testcase

[Bug other/56631] New: duplicated sse code in switch

2013-03-16 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56631 Bug #: 56631 Summary: duplicated sse code in switch Classification: Unclassified Product: gcc Version: unknown Status: UNCONFIRMED Severity: normal Priorit

[Bug other/56338] register spill caused by loading constant

2013-02-14 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56338 --- Comment #1 from Ondrej Bilka 2013-02-15 07:42:10 UTC --- Created attachment 29461 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=29461 testcase

[Bug other/56338] New: register spill caused by loading constant

2013-02-14 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56338 Bug #: 56338 Summary: register spill caused by loading constant Classification: Unclassified Product: gcc Version: unknown Status: UNCONFIRMED Severity: normal

[Bug target/56199] strcpy/strcat builtins for constant strings generates suboptimal code.

2013-02-04 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56199 Ondrej Bilka changed: What|Removed |Added Status|RESOLVED|UNCONFIRMED Resolution|IN

[Bug target/56199] strcpy/strcat builtins for constant strings generates suboptimal code.

2013-02-04 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56199 --- Comment #3 from Ondrej Bilka 2013-02-04 15:15:12 UTC --- Created attachment 29349 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=29349 icatche stressing benchmark

[Bug target/56199] strcpy/strcat builtins generates suboptimal code.

2013-02-04 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56199 --- Comment #1 from Ondrej Bilka 2013-02-04 08:42:32 UTC --- Created attachment 29344 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=29344 benchmark

[Bug target/56199] New: strcpy/strcat builtins generates suboptimal code.

2013-02-04 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=56199 Bug #: 56199 Summary: strcpy/strcat builtins generates suboptimal code. Classification: Unclassified Product: gcc Version: unknown Status: UNCONFIRMED Severity: norma

[Bug target/55945] New: alloca aligns aligned pointers

2013-01-11 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=55945 Bug #: 55945 Summary: alloca aligns aligned pointers Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED Severity: normal Priority

[Bug tree-optimization/55600] excessive size of vectorized code

2012-12-26 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=55600 --- Comment #3 from Ondrej Bilka 2012-12-26 22:05:37 UTC --- Created attachment 29052 --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=29052 benchmark

[Bug tree-optimization/55600] excessive size of vectorized code

2012-12-26 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=55600 --- Comment #2 from Ondrej Bilka 2012-12-26 22:03:59 UTC --- Yes when 128 is replaced by smaller constant. Attached patch gives on my i5 following: size 32 vector real0m0.224s user0m0.220s sys0m0.000s unroll real0

[Bug tree-optimization/55600] New: excessive size of vectorized code

2012-12-04 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=55600 Bug #: 55600 Summary: excessive size of vectorized code Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED Severity: normal Prior

[Bug c/54491] New: interval membership optimization

2012-09-05 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54491 Bug #: 54491 Summary: interval membership optimization Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED Severity: normal Priority: P3

[Bug tree-optimization/54481] missed optimization: unnecessary indirect call

2012-09-05 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54481 --- Comment #2 from Ondrej Bilka 2012-09-05 09:42:27 UTC --- On Wed, Sep 05, 2012 at 09:30:04AM +, rguenth at gcc dot gnu.org wrote: > http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54481 > > Richard Guenther changed: > >What|

[Bug c/54481] New: missed optimization: unnecessary indirect call

2012-09-04 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54481 Bug #: 54481 Summary: missed optimization: unnecessary indirect call Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED Severity: normal Pri

[Bug target/54349] _mm_cvtsi128_si64 unnecessary stores value at stack

2012-08-23 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54349 Ondrej Bilka changed: What|Removed |Added Status|RESOLVED|UNCONFIRMED Resolution|INVALID

[Bug c/54360] New: missed optimalization: unnecessary indirect call

2012-08-23 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54360 Bug #: 54360 Summary: missed optimalization: unnecessary indirect call Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED Severity: normal P

[Bug target/54349] New: _mm_cvtsi128_si64 unnecessary stores value at stack

2012-08-22 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54349 Bug #: 54349 Summary: _mm_cvtsi128_si64 unnecessary stores value at stack Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED Severity: normal

[Bug tree-optimization/54116] suboptimal code for tight loops

2012-07-29 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54116 --- Comment #2 from Ondrej Bilka 2012-07-29 10:30:46 UTC --- On Sun, Jul 29, 2012 at 10:13:41AM +, pinskia at gcc dot gnu.org wrote: > http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54116 > > --- Comment #1 from Andrew Pinski 2012-07-29 > 10:

[Bug tree-optimization/54116] New: suboptimal code for tight loops

2012-07-29 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54116 Bug #: 54116 Summary: suboptimal code for tight loops Classification: Unclassified Product: gcc Version: 4.7.1 Status: UNCONFIRMED Severity: normal Priority: P3

[Bug target/54115] New: Unnecessary sign extensions for __builtin_ctz et al.

2012-07-29 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=54115 Bug #: 54115 Summary: Unnecessary sign extensions for __builtin_ctz et al. Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED Severity: normal

[Bug target/53907] New: gcc uses unaligned load when aligned load was requested

2012-07-09 Thread neleai at seznam dot cz
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=53907 Bug #: 53907 Summary: gcc uses unaligned load when aligned load was requested Classification: Unclassified Product: gcc Version: 4.8.0 Status: UNCONFIRMED

[Bug c++/35527] New: function in template can depend on superclass

2008-03-10 Thread neleai at seznam dot cz
ned at gcc dot gnu dot org ReportedBy: neleai at seznam dot cz GCC target triplet: i486-linux-gnu http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35527

[Bug c++/35525] New: function in template can depend on superclass

2008-03-10 Thread neleai at seznam dot cz
ned at gcc dot gnu dot org ReportedBy: neleai at seznam dot cz GCC target triplet: i486-linux-gnu http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35525