------- Comment #26 from rguenth at gcc dot gnu dot org 2010-06-28 09:35 ------- (In reply to comment #21) > With the patch in comment #19, the test suite pass with -m32 and -m64, but for > gfortran.dg/transpose_2.f90 which needs an adjustment of the dg-error. > > AFAICT the SUM of the different tests are scalarized (it would be interesting > to have some timings before and after the patch for 465.tonto).
Estimated Estimated Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 465.tonto 9840 390 25.2 S 9840 361 27.3 S 465.tonto 9840 391 25.2 S 9840 361 27.3 * 465.tonto 9840 390 25.2 * 9840 362 27.2 S ============================================================================== 465.tonto 9840 390 25.2 * 9840 361 27.3 * base is trunk r161367, peak is trunk r161367 + the patch from comment #19. Flags are -O3 -ffast-math -funroll-loops -march=core2, executed on an iCore7 @ 3.3GHz. Thus your patch improves performance by 7.5% which is very nice and even above what I expected (compared to manual source manipulation of the hottest SUM). -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43829