https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53659
--- Comment #3 from PeteVine <tulipawn at gmail dot com> --- Curiously, up to gcc 6, targeting Cortex-A5 made virtually no difference, but in gcc 7, generic codegen takes an 8% hit while -mcpu=cortex-a5 produces roughly the same performance as before. (but that's a different issue so FWIW)