------- Additional Comments From rguenth at tat dot physik dot uni-tuebingen dot de 2004-12-06 14:31 ------- Subject: Re: [4.0 Regression] Inlining limits cause 340% performance regression
On 6 Dec 2004, hubicka at ucw dot cz wrote: > > > the order of inlining decisions affecting this. I would be curious how > > > those results compare to leafify and whether the 0m27s is not caused by > > > missoptimization. > > > > You can check for misoptimization by looking at the final output. > > I.e. the rh,vx,vy and vz sums should be nearly zero, the T sum > > will increase with the number of iterations. > > > > With mainline, -O2 -fpeel-loops -march=pentium4 -ffast-math > > -D__NO_MATH_INLINES (we still need explicit -fpeel-loops for > > unrolling for (i=0;i<3;++i) a[i]=0;), I need 0m17s for -n 10 with > > leafification turned on, with it turned off, runtime increases > > to 0m31s with --param inline-unit-growth=175. > > I compiled with -O3, would be possible for you to measure how much > speedup you get on mainline with -O3 and -O3+lefify? That would > probably allow me relate those numbers somehow. 0m23s for -O3+leafify, 1m54s for -O3, 0m35s for -O3 --param inline-unit-growth=150. Richard. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18704