------- Comment #8 from jv244 at cam dot ac dot uk 2009-05-16 12:20 ------- (In reply to comment #7) > Subject: Re: missing > unrolling/scalarization/reassoc/free
so, double good news. First, the unrelated other testcase that speeds up by 30% does this thanks to this patch only. This is a really significant gain since it is a real CP2K run, not a micro-benchmark. Second, the current testcase gets great results, i.e. reproducing the hand-optimized code in efficiency using the unroll parameters and the no-ivopts flag. gfortran -O3 -march=native -ffast-math -funroll-loops -ffree-line-length-200 --param max-completely-peel-times=27 --param max-completely-peeled-insns=2000 -fno-ivopts test.f90 ; ./a.out Sparse: time[s] 0.24001500 New: time[s] 0.22401398 speedup 1.0714287 Glfops 2.8926766 Error: 0.0000000000000000 -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168