https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82210
--- Comment #7 from Jakub Jelinek <jakub at gcc dot gnu.org> --- The difference with that revision is: addq $2, %rax cmpq %rcx, %rax jne .L3 - leaq (%r12,%rdx), %rdx + andq $-16, %rdx xorl %eax, %eax + leaq (%r12,%rdx), %rdx