------- Comment #6 from jv244 at cam dot ac dot uk 2009-05-16 11:31 ------- (In reply to comment #5) This looks somewhat different from what I get here.
trunk without patch: vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native -ffast-math -funroll-loops -ffree-line-length-200 --param max-completely-peel-times=27 --param max-completely-peeled-insns=666 test.f90 ; ./a.out Sparse: time[s] 0.70804399 New: time[s] 0.21201301 speedup 3.3396254 Glfops 3.0564163 Error: 1.11022302462515654E-016 vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native -ffast-math -funroll-loops -ffree-line-length-200 test.f90 ; ./a.out Sparse: time[s] 0.61203802 New: time[s] 0.20801300 speedup 2.9423065 Glfops 3.1151900 Error: 1.11022302462515654E-016 with patch: vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native -ffast-math -funroll-loops -ffree-line-length-200 --param max-completely-peel-times=27 --param max-completely-peeled-insns=666 test.f90 ; ./a.out Sparse: time[s] 0.60403699 New: time[s] 0.21201402 speedup 2.8490427 Glfops 3.0564015 Error: 1.11022302462515654E-016 with 4.4_branch: vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native -ffast-math -funroll-loops -ffree-line-length-200 --param max-completely-peel-times=27 --param max-completely-peeled-insns=666 test.f90 ; ./a.out Sparse: time[s] 0.70404398 New: time[s] 0.20401198 speedup 3.4509933 Glfops 3.1762841 Error: 1.11022302462515654E-016 vond...@pcihopt3:/data03/vondele/contract> gfortran -O3 -march=native -ffast-math -funroll-loops -ffree-line-length-200 test.f90 ; ./a.out Sparse: time[s] 0.66804200 New: time[s] 0.20801300 speedup 3.2115397 Glfops 3.1151900 Error: 1.11022302462515654E-016 -march=native goes to : -march=k8-sse3 -mcx16 -msahf --param l1-cache-size=64 --param l1-cache-line-size=64 --param l2-cache-size=1024 -mtune=k8 -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40168