------- Comment #19 from bonzini at gnu dot org 2009-02-09 13:37 ------- Also, Dwarak, here the change is not from
addps (%rax, %rsi), %xmm1 to movps (%rax, %rsi), %xmm0 addps %xmm0, %xmm1 but rather from movps %xmm0, %xmm1 addps (%rax, %rsi), %xmm1 to the second snippet above. Does this pessimize on AMD too? I don't think so, it should be 1 uop less, but I'd rather have confirmation. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38824