https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90056
Bug ID: 90056 Summary: 548.exchange2_r regressions on AMD Zen Product: gcc Version: 9.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: middle-end Assignee: unassigned at gcc dot gnu.org Reporter: jamborm at gcc dot gnu.org CC: hubicka at gcc dot gnu.org, marxin at gcc dot gnu.org Blocks: 26163 Target Milestone: --- Host: x86_64-linux Target: x86_64-linux As of revision 270053, the 548.exchange2_r benchmark from SPEC 2017 INTrate suite suffered a number of smaller regressions on AMD Zen CPUs: - At -O2, it is 4.5% slower than when compiled with GCC 7 - At -Ofast, it is 4.7% slower than when compiled with GCC 8 - At -Ofast -march=native -mutine=native, this difference is 6.9% - At -Ofast and native tuning, it is 6% slower with PGO than without it. According to https://lnt.opensuse.org/db_default/v4/SPEC/spec_report/options the last regression on a different Ryzen CPU is 6.8 and PGO+LTO is 8.2% slower than just native -Ofast. Bisecting does not help much because the performance of the benchmark has varied a lot. For example in September there was no PGO regression but only because the non-PGO executable was equally slow. I only have data from February from an Intel machine, but there I only saw the native -Ofast regression, but it might have gone away meanwhile. Referenced Bugs: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 [Bug 26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)