On Sun, 31 Jul 2011, Paolo Bonzini wrote:
On Sat, Jul 30, 2011 at 19:21, Dimitrios Apostolou <ji...@gmx.net> wrote:
Nevertheless I'd appreciate comments on whether any part of this patch is
worth keeping. FWIW I've profiled this on i386 to be about 4 M instr slower
out of ~1.5 G inst. I'll be now checking the profiler to see where exactly
the overhead is.

I suggest -fdump-tree-all too, to check if unrolling is happening and
if not why.

I don't fully understand the output from -fdump-tree-all, but my conclusion based also on profiler output and objdump, is that both unrolling and inlining is happening in both versions. Nevertheless I can see that assembly output is a bit different in the two cases (I can post specific disassembly output if you are interested).

My opinion is that code cleanup is worth the minor overhead, given that there should be no regressions.


Thanks,
Dimitris

Reply via email to