[Bug c/31396] New: Inline code performance much worse than out-of-line

2007-03-29 Thread jamagallon at ono dot com
AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: jamagallon at ono dot com GCC target triplet: i586-mandriva-linux-gnu http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31396

[Bug c/31396] Inline code performance much worse than out-of-line

2007-03-29 Thread jamagallon at ono dot com
--- Comment #1 from jamagallon at ono dot com 2007-03-29 23:17 --- Created an attachment (id=13298) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=13298&action=view) testcase Simple test case with a loop in main() and a call to a function. Both just calculate the sum

[Bug c/31396] Inline code performance much worse than out-of-line

2007-03-29 Thread jamagallon at ono dot com
--- Comment #2 from jamagallon at ono dot com 2007-03-29 23:18 --- Created an attachment (id=13299) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=13299&action=view) Makefile for testcase Makefile to build tst.c. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31396

[Bug c/31396] Inline code performance much worse than out-of-line

2007-03-29 Thread jamagallon at ono dot com
--- Comment #3 from jamagallon at ono dot com 2007-03-29 23:22 --- Sample assembler for the loops. For the funcion, out of line: #APP #FBGN #NO_APP movldata, %edx fldz movl$1, %eax .L2: fadds -4(%edx,%eax,4) addl$1, %eax cmpl$268435457

[Bug middle-end/31396] Inline code performance much worse than out-of-line

2007-03-29 Thread jamagallon at ono dot com
--- Comment #4 from jamagallon at ono dot com 2007-03-29 23:47 --- Assembler for the opteron. out-of-line: .L2: cvtss2sd(%rdx,%rax,4), %xmm0 incq%rax cmpq$268435456, %rax addsd %xmm0, %xmm1 jne .L2 inline: .L11: cvtss2sd(%rdx,%rax,4), %xmm0