https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114544
--- Comment #3 from Hongtao Liu <liuhongt at gcc dot gnu.org> --- 0000000000000000 <__umodti3>: ... 37 58: 66 48 0f 6e c7 movq %rdi,%xmm0 38 5d: 66 48 0f 6e d6 movq %rsi,%xmm2 39 62: 66 0f 6c c2 punpcklqdq %xmm2,%xmm0 40 66: 0f 29 44 24 f0 movaps %xmm0,-0x10(%rsp) 41 6b: 48 8b 44 24 f0 mov -0x10(%rsp),%rax 42 70: 48 8b 54 24 f8 mov -0x8(%rsp),%rdx 43 75: 5b pop %rbx 44 76: c3 ret Look like the misoptimization is also in __umodti3.