https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70773
--- Comment #14 from wilco at gcc dot gnu.org --- (In reply to PeteVine from comment #11) > I've just retested gcc7 on both ARM platforms. > > AArch64 gets a 3% improvement now, while ARMv7 reproduces the issue, just as > before. I'm compiling/profiling on a Cortex A5 which could be the main > reason behind all this, as it doesn't have hard division. Can you try comparing the .S outputs on both the Cortex-A5 and Cortex-A53 system using exact same options, ie. -marm -mcpu=cortex-a5? Assuming you're using the same GCC version, you should get identical .S files and the same .gcda.