------- Comment #13 from dominiq at lps dot ens dot fr  2010-08-24 16:19 -------
With the patch in comment #12 I get

[macbook] lin/test% gfc -Ofast -funroll-loops test_fpu.f90
[macbook] lin/test% time a.out
  Benchmark running, hopefully as only ACTIVE task
  0.99755959009261719     
Test1 - Gauss 2000 (101x101) inverts  2.0 sec  Err= 0.000000000000006
Test2 - Crout 2000 (101x101) inverts  2.9 sec  Err= 0.000000000000014
Test3 - Crout  2 (1001x1001) inverts  3.4 sec  Err= 0.000000000000043
Test4 - Lapack 2 (1001x1001) inverts  2.6 sec  Err= 0.000000000000250
                             total = 10.9 sec

11.103u 0.098s 0:11.21 99.8%    0+0k 0+0io 0pf+0w

compared to

[macbook] lin/test% gfcp -Ofast -funroll-loops test_fpu.f90
[macbook] lin/test% time a.out
  Benchmark running, hopefully as only ACTIVE task
  0.99755959009261719     
Test1 - Gauss 2000 (101x101) inverts  2.0 sec  Err= 0.000000000000006
Test2 - Crout 2000 (101x101) inverts  2.9 sec  Err= 0.000000000000014
Test3 - Crout  2 (1001x1001) inverts  3.4 sec  Err= 0.000000000000043
Test4 - Lapack 2 (1001x1001) inverts  2.6 sec  Err= 0.000000000000250
                             total = 10.9 sec

11.114u 0.101s 0:11.22 99.9%    0+0k 0+0io 0pf+0w

So it fixes the slow down. Thanks for the patch.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=45379

Reply via email to