------- Comment #26 from rguenth at gcc dot gnu dot org 2010-06-28 09:35
-------
(In reply to comment #21)
> With the patch in comment #19, the test suite pass with -m32 and -m64, but for
> gfortran.dg/transpose_2.f90 which needs an adjustment of the dg-error.
>
> AFAICT the SUM of the different tests are scalarized (it would be interesting
> to have some timings before and after the patch for 465.tonto).
Estimated Estimated
Base Base Base Peak Peak Peak
Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio
-------------- ------ --------- --------- ------ --------- ---------
465.tonto 9840 390 25.2 S 9840 361 27.3 S
465.tonto 9840 391 25.2 S 9840 361 27.3 *
465.tonto 9840 390 25.2 * 9840 362 27.2 S
==============================================================================
465.tonto 9840 390 25.2 * 9840 361 27.3 *
base is trunk r161367, peak is trunk r161367 + the patch from comment #19.
Flags are -O3 -ffast-math -funroll-loops -march=core2, executed on an
iCore7 @ 3.3GHz.
Thus your patch improves performance by 7.5% which is very nice and even
above what I expected (compared to manual source manipulation of the
hottest SUM).
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43829