------- Comment #6 from dominiq at lps dot ens dot fr 2009-07-04 14:02 ------- I have seen this problem also. From a crude profiling, it seems that the slow routines are dgemm as pointed in comment #2 and gauss. This is a regression with respect to 4.4.0 and it has started between June 5 and 6.
On i686-apple-darwin9 with "-ffast-math -funroll-loops -O3" but without specifying -march, the inner loop is unrolled 8 times with 4.4 and only 4 times with trunk with a lot of extra code for the memory access (I am not fluent with the *86 assembly). I have compared the outputs of -ftree-vectorizer-verbose=2, but did not see anything suspicious. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40648