https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114860
--- Comment #8 from prathamesh3492 at gcc dot gnu.org --- Hi Tamar, Using -falign-loops=5 indeed brings back the performance. The adrp instruction has same address (0x4ae784) by setting -falign-loops=5 (which reduces misalignment to 4) with/without a2f4be3dae0. So I guess this is really code-alignment issue ? (Also in our latest builds the regression has seemingly gone away without any adjustments to code alignment) Thanks, Prathamesh