------- Comment #7 from rguenth at gcc dot gnu dot org 2008-08-18 15:22 ------- That is, GCCs inner loop is
.L6: addl $1, %eax addsd %xmm12, %xmm11 cmpl $100000000, %eax addsd %xmm14, %xmm3 addsd %xmm15, %xmm2 addsd %xmm13, %xmm1 jne .L6 which doesn't necessarily look slower than ICCs. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31079