[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2025-01-16 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #26 from Rama Malladi --- Thank you Eugene for the fix/ backports.

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2025-01-10 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #23 from Rama Malladi --- Hi Eugene, Hope you are doing well. I am just checking-in on this patch commit to mainline GCC and back-ported. Thanks -Rama

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-12-11 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #21 from Rama Malladi --- Hi Eugene, We verified that the GCC patch restores the PGO performance gain. The HammerDB workload shows an 11% PGO performance gain with the fix, compared to a 2% gain without the fix. Thanks for the fix.

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-12-03 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #20 from Rama Malladi --- > I propose the patch below. Rama, can you please check if this resolves your > perf regression? Hi Eugene, Thanks for this investigation and proposed fix. I can give it a try and update in a day or two.

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-11-11 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #15 from Rama Malladi --- (In reply to Rama Malladi from comment #14) > Thanks Eugene. Were you able to review the repro and propose a fix? Hi Eugene, checking again. It would be great if you can look into it. Thanks.

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-10-25 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #14 from Rama Malladi --- Thanks Eugene. Were you able to review the repro and propose a fix?

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-10-11 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #12 from Rama Malladi --- Hi Eugene, checking in again. Can you please review this issue and a fix? Thanks.

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-10-04 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #11 from Rama Malladi --- Hi Eugene, Could you please review the test-case attached? Thanks.

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-09-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #10 from Rama Malladi --- Created attachment 59207 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=59207&action=edit reproducer for funtion inlining issue - source This is a reproducer to show GCC 12.3.0 inlining issue w AutoFD

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-09-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #9 from Rama Malladi --- Created attachment 59206 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=59206&action=edit reporducer for function inlining issue This is a reproducer to show GCC 12.3.0 inlining issue w AutoFDO due to

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-09-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #8 from Rama Malladi --- Here attached is a compilation unit extracted from `MySQL` repo (https://github.com/mysql/mysql-server/blob/trunk/storage/innobase/handler/ha_innodb.cc) which shows the impact of commit `3d9e6767939e` on func

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-09-21 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #7 from Rama Malladi --- I haven't been successful to create a reproducer yet. A simple `test.cc` as follows isn't showing this behavior as the compiler inlines these irrespective (at `-O3`). ``` #include #define N 30 void init_arra

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit r12-5817-g3d9e6767939e96 causes ~10% perf regression w AutoFDO

2024-09-19 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #6 from Rama Malladi --- I am trying to create a reproducer for this issue. Interim, I wanted to share some stats I got from the MySQL build to highlight this issue w GCC 12.3.0 vs. 11.5.0. Executable Size (B)Baseline

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit `3d9e6767939e` causes ~10% perf regression w AutoFDO

2024-09-17 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #5 from Rama Malladi --- (In reply to Eugene Rozenfeld from comment #4) > AutoFDO does work. I made a number of fixes and improvements over the last > several years, both in GCC (including fixing autoprofiledbootstrap) and in > googl

[Bug gcov-profile/116743] [12/13/14/15 regression] Commit `3d9e6767939e` causes ~10% perf regression w AutoFDO

2024-09-17 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 --- Comment #2 from Rama Malladi --- The 10% regression was observed on an `aarch64` architecture/ instance. We saw regression on an `x86_64` instance too. Here is the data: GCC version BaselineAutoFDO ---

[Bug gcov-profile/116743] New: Commit `3d9e6767939e` causes ~10% perf regression

2024-09-16 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116743 Bug ID: 116743 Summary: Commit `3d9e6767939e` causes ~10% perf regression Product: gcc Version: 12.3.0 Status: UNCONFIRMED Severity: normal Priority: P3 Compone

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2024-09-16 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 114531, which changed state. Bug 114531 Summary: Feature proposal for an `-finline-functions-aggressive` compiler option https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 What|Removed

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-09-16 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 Rama Malladi changed: What|Removed |Added Resolution|--- |WONTFIX Status|UNCONFIRMED

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-09-16 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #24 from Rama Malladi --- I am closing this bug report as the feature request has been rejected by the committee after reviewing the same.

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-09-02 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #22 from Rama Malladi --- Checking in again... Hubicka@ and rsandifo@, Can we action this PR https://gcc.gnu.org/pipermail/gcc-patches/2024-June/655506.html? Accept or Reject? Thanks.

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-07-21 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #21 from Rama Malladi --- Thank you Hubicka@ and rsandifo@ for reviewing this feature request and supporting it. Could we go ahead and dispose this as accepted or rejected? Accordingly action the PR https://gcc.gnu.org/pipermail/gcc-

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-06-27 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #19 from Rama Malladi --- Thank you Hubicka@ for the inputs. I see your intent and that we have to revisit the inline parameter tuning. As I and Richard S mentioned, the intent of this feature request or PR is to expose such an optio

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-06-25 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #16 from Rama Malladi --- I had posted a patch at the URL below for this feature: https://gcc.gnu.org/pipermail/gcc-patches/2024-June/655506.html

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-06-25 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #15 from Rama Malladi --- Thanks for the comments and for giving us some history/ perspective. I agree with this statement, > Pushing up -O2 limits can make sense, but needs to be done carefully - > in longer term IMO we do not want

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-06-25 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #13 from Rama Malladi --- (In reply to Jan Hubicka from comment #12) > If this is without LTO, can you also try the LTO numbers? > Inliner behaves sifniciantly different with and without LTO, since LTO > introduces many (and often to

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-06-25 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #11 from Rama Malladi --- (In reply to Wilco from comment #10) > A 1.1% overall performance gain looks good - is there a significant codesize > hit from this? If so, are there slightly less aggressive settings that still > get most o

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-05-31 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #9 from Rama Malladi --- I wanted us to review this feature implementation given GCC 15 Stage 1 development has started. Thank you.

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-04-08 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 Rama Malladi changed: What|Removed |Added CC||rvmallad at amazon dot com --- Comment #

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-04-01 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #7 from Rama Malladi --- (In reply to Rama Malladi from comment #5) > (In reply to Andrew Pinski from comment #3) > > Also do you have numbers with lto enabled? Or is these without lto? > > > > Does LTO improve the situation for Env

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-03-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #5 from Rama Malladi --- (In reply to Andrew Pinski from comment #3) > Also do you have numbers with lto enabled? Or is these without lto? > > Does LTO improve the situation for Envoy too? These numbers are without lto. I haven't t

[Bug ipa/114531] Feature proposal for an `-finline-functions-aggressive` compiler option

2024-03-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 --- Comment #4 from Rama Malladi --- (In reply to Andrew Pinski from comment #1) > Maybe we should figure out why the increase of the limits help and add extra > code to get better heuristics rather than just tweaking the limits. > > I know tha

[Bug driver/114531] New: Feature proposal for an `-finline-functions-aggressive` compiler option

2024-03-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114531 Bug ID: 114531 Summary: Feature proposal for an `-finline-functions-aggressive` compiler option Product: gcc Version: 14.0 Status: UNCONFIRMED Severity: normal

[Bug sanitizer/97696] ICE since ASAN_MARK does not handle poly_int sized varibales

2024-03-05 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97696 --- Comment #5 from Rama Malladi --- Thank you Richard for this patch/ fix.

[Bug sanitizer/97696] ICE since ASAN_MARK does not handle poly_int sized varibales

2024-01-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97696 Rama Malladi changed: What|Removed |Added CC||rvmallad at amazon dot com --- Comment #2

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2023-03-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #23 from Rama Malladi --- (In reply to Rama Malladi from comment #22) > I will close this issue as we were unable to reproduce the perf drop going > from gcc-7 to gcc-8 on a Graviton2 based instance. The performance of > 519.lbm_r bu

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2023-03-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #22 from Rama Malladi --- I will close this issue as we were unable to reproduce the perf drop going from gcc-7 to gcc-8 on a Graviton2 based instance. The performance of 519.lbm_r built with gcc-7.4 was same as that with gcc-8.5.

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2023-02-24 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #21 from Rama Malladi --- I did another triage for perf loss on Graviton 2 processor (neoverse-n1) based instance and found this commit: `a9a4edf0e71bbac9f1b5dcecdcf9250111d16889` to be the reason. As I had indicated in my earlier re

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2023-02-20 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #20 from Rama Malladi --- @Martin J and @Sebastian P, Let me walk you through the perf data and my triage. First, my triage has been on Graviton 3 (neoverse-v1) processor based instances. Next, I was looking for perf delta going fro

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2023-02-02 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #19 from Rama Malladi --- Thanks @Sebastian and @Martin J. I will get another bisect between GCC 7-and-8.

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2023-01-08 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #15 from Rama Malladi --- Hi, Can we review this issue and suggest next steps/ action please? Thanks.

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2022-12-12 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #14 from Rama Malladi --- (In reply to Martin Liška from comment #13) > Note the mentioned revision is a fix and yes, sometimes these revisions can > end up with a regression as profile estimation is a complex guess. Yes, possibly.

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2022-12-09 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #12 from Rama Malladi --- I found difference in dumps at various stages of the compilation for the mainline GCC and with update_max_bb_count() commented. Here are the details: Mainline: Commit ID: 63a42ffc0833553fbcb84b50cf0fd2d867b

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2022-12-08 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #11 from Rama Malladi --- (In reply to Martin Liška from comment #10) > @Honza ? Just checking if this can be fixed/ implemented. Thanks.

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-12-01 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #19 from Rama Malladi --- (In reply to Wilco from comment #17) > (In reply to Rama Malladi from comment #16) > > (In reply to Wilco from comment #15) > > > (In reply to Rama Malladi from comment #14) > > > > This fix also improved pe

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-12-01 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #18 from Rama Malladi --- (In reply to Wilco from comment #17) > (In reply to Rama Malladi from comment #16) > > (In reply to Wilco from comment #15) > > > (In reply to Rama Malladi from comment #14) > > > > This fix also improved pe

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark with r10-5090-ga9a4edf0e71bba

2022-11-30 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #9 from Rama Malladi --- (In reply to Martin Liška from comment #3) > Can you please share perf-profile before and after the revision? > > Note I can't see it for Altra aarch64 CPU: > https://lnt.opensuse.org/db_default/v4/SPEC/grap

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-11-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #16 from Rama Malladi --- (In reply to Wilco from comment #15) > (In reply to Rama Malladi from comment #14) > > This fix also improved performance of 538.imagick_r by 15%. Did you have a > > similar observation? Thank you. > > No,

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-11-29 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #14 from Rama Malladi --- This fix also improved performance of 538.imagick_r by 15%. Did you have a similar observation? Thank you.

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-11-28 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #13 from Rama Malladi --- (In reply to CVS Commits from comment #12) > The master branch has been updated by Wilco Dijkstra : > > https://gcc.gnu.org/g:0c1b0a23f1fe7db6a2e391b7cb78cff90032 > > commit r13-4291-g0c1b0a23f1fe7db6a

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-11-06 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #11 from Rama Malladi --- (In reply to Wilco from comment #10) > I'm seeing about 1.5% gain on Neoverse V1 and 0.5% loss on Neoverse N1. I'll > post a patch that allows per-CPU settings for FMA reassociation, so you'll > get good per

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-11-02 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #9 from Rama Malladi --- (In reply to Rama Malladi from comment #8) > (In reply to Wilco from comment #7) > > The revert results in about 0.5% loss on Neoverse N1, so it looks like the > > reassociation pass is still splitting FMAs i

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark with r8-7132-gb5b33e113434be

2022-11-01 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #8 from Rama Malladi --- (In reply to Wilco from comment #7) > The revert results in about 0.5% loss on Neoverse N1, so it looks like the > reassociation pass is still splitting FMAs into separate MUL and ADD (which > is bad for narr

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-28 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #6 from Rama Malladi --- The compilation options were: -Ofast -mcpu=native -flto

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-28 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #5 from Rama Malladi --- (In reply to Wilco from comment #2) > That's interesting - if the reassociation pass has become a bit smarter in > the last 5 years, we might no longer need this workaround. What is the > effect on the overal

[Bug c++/107433] 510.parest_r, call of overloaded 'back_interpolate' is ambiguous

2022-10-27 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107433 --- Comment #2 from Rama Malladi --- (In reply to Martin Liška from comment #1) > As mentioned slightly here: > https://www.spec.org/cpu2017/Docs/benchmarks/510.parest_r.html > please use -std=c++98 or something < c++17. Thank you. I had it for

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-27 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #8 from Rama Malladi --- (In reply to Mark Wielaard from comment #7) > The content of attachment 53773 [details] has been deleted for the following > reason: > > https://sourceware.org/pipermail/overseers/2022q4/019048.html Thank y

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-27 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #6 from Rama Malladi --- (In reply to Martin Liška from comment #5) > Please try writing here: overse...@sourceware.org I have asked for deletion. Thanks

[Bug c/107433] New: 510.parest_r, call of overloaded 'back_interpolate' is ambiguous

2022-10-27 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107433 Bug ID: 107433 Summary: 510.parest_r, call of overloaded 'back_interpolate' is ambiguous Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-27 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #4 from Rama Malladi --- Hi Martin, Thanks for the guidance. Can we delete the attachment from this bug report? Regards, Rama

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #3 from Rama Malladi --- I will get the effect of this revert for the overall SPEC FP score. I haven't tried experimenting with fp_reassoc_width values. Will try it and update.

[Bug tree-optimization/107413] Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 --- Comment #1 from Rama Malladi --- $ /home/ubuntu/gccfixissue2/bin/gcc -v Using built-in specs. COLLECT_GCC=/home/ubuntu/gccfixissue2/bin/gcc COLLECT_LTO_WRAPPER=/home/ubuntu/gccfixissue2/libexec/gcc/aarch64-unknown-linux-gnu/13.0.0/lto-wrapp

[Bug tree-optimization/107409] Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 --- Comment #1 from Rama Malladi --- $ /home/ubuntu/gccfixissue1/bin/gcc -v Using built-in specs. COLLECT_GCC=/home/ubuntu/gccfixissue1/bin/gcc COLLECT_LTO_WRAPPER=/home/ubuntu/gccfixissue1/libexec/gcc/aarch64-unknown-linux-gnu/13.0.0/lto-wrapp

[Bug tree-optimization/107413] New: Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413 Bug ID: 107413 Summary: Perf loss ~14% on 519.lbm_r SPEC cpu2017 benchmark Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Componen

[Bug tree-optimization/107409] New: Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark

2022-10-26 Thread rvmallad at amazon dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107409 Bug ID: 107409 Summary: Perf loss ~5% on 519.lbm_r SPEC cpu2017 benchmark Product: gcc Version: 13.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component