https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77468
--- Comment #15 from James Greenhalgh <jgreenhalgh at gcc dot gnu.org> --- (In reply to Aldy Hernandez from comment #13) > The aarch64-linux-gnu regression originally reported for -mcpu=cortex-a53 > was caused by: > > commit 08993ad1c669cab64baf352f79cd7f8584dd8e0c > Author: jgreenhalgh <jgreenhalgh@138bc75d-0d04-0410-961f-82ee72b054a4> > Date: Thu Oct 1 09:33:40 2015 +0000 > > [Patch 2/2 ARM/AArch64] Add a new Cortex-A53 scheduling model > > * config/arm/aarch-common-protos.h > (aarch_accumulator_forwarding): New. > (aarch_forward_to_shift_is_not_shifted_reg): Likewise. > * config/arm/aarch-common.c (aarch_accumulator_forwarding): New. > (aarch_forward_to_shift_is_not_shifted_reg): Liekwise. > * config/arm/cortex-a53.md: Rewrite. > > > > git-svn-id: svn+ssh://gcc.gnu.org/svn/gcc/trunk@228324 > 138bc75d-0d04-0410-961f-82ee72b054a4 Could you dump me the assembly of the hot loop before and after that change - I'll see if we're doing anything particularly offensive with the scheduling, though I still don't have access to the source code for this benchmark so a reduced testcase would be appreciated.