------- Comment #6 from jv244 at cam dot ac dot uk  2009-05-16 11:31 -------
(In reply to comment #5)
This looks somewhat different from what I get here.

trunk without patch:

vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native
-ffast-math -funroll-loops -ffree-line-length-200 --param
max-completely-peel-times=27 --param max-completely-peeled-insns=666 test.f90 ;
./a.out
 Sparse: time[s]   0.70804399
 New: time[s]   0.21201301
     speedup    3.3396254
      Glfops    3.0564163
 Error:   1.11022302462515654E-016

vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native
-ffast-math -funroll-loops -ffree-line-length-200  test.f90 ; ./a.out
 Sparse: time[s]   0.61203802
 New: time[s]   0.20801300
     speedup    2.9423065
      Glfops    3.1151900
 Error:   1.11022302462515654E-016

with patch:
vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native
-ffast-math -funroll-loops -ffree-line-length-200 --param
max-completely-peel-times=27 --param max-completely-peeled-insns=666 test.f90 ;
./a.out
 Sparse: time[s]   0.60403699
 New: time[s]   0.21201402
     speedup    2.8490427
      Glfops    3.0564015
 Error:   1.11022302462515654E-016

with 4.4_branch:

vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native
-ffast-math -funroll-loops -ffree-line-length-200 --param
max-completely-peel-times=27 --param max-completely-peeled-insns=666 test.f90 ;
./a.out
 Sparse: time[s]   0.70404398
 New: time[s]   0.20401198
     speedup    3.4509933
      Glfops    3.1762841
 Error:   1.11022302462515654E-016
vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native
-ffast-math -funroll-loops -ffree-line-length-200 test.f90 ; ./a.out
 Sparse: time[s]   0.66804200
 New: time[s]   0.20801300
     speedup    3.2115397
      Glfops    3.1151900
 Error:   1.11022302462515654E-016

-march=native goes to :
-march=k8-sse3 -mcx16 -msahf --param l1-cache-size=64 --param
l1-cache-line-size=64 --param l2-cache-size=1024 -mtune=k8


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168

Reply via email to