https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119291
--- Comment #3 from Andrew Pinski <pinskia at gcc dot gnu.org> --- Works on aarch64 and the gimple level loops the same between x86_64 and aarch64. I suspect this is either a target issue or a rtl optimization issue.