https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #4 from PeteVine <tulipawn at gmail dot com> --- I've just done the obvious and run the resulting ARMv7 binaries on a Cortex A53 in aarch32 mode and the difference is there (GCC 6.2.1 and 7.0.0) so I can confirm the issue is present to this day. Cortex-A5 vs Cortex-A9 codegen yields a 0.81x performance ratio.