https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70773
--- Comment #22 from PeteVine <tulipawn at gmail dot com> --- > I don't know what exactly "fixed" this That would be nice to know. This I can say for sure: gcc 7.2.1 20171116 still produces slower profiled code on the target system. I've also discovered, compiling and profiling on a binary compatible Cortex A17 system (same flags), produces binaries that don't run any slower on the target system.