------- Comment #2 from burnus at gcc dot gnu dot org 2009-10-06 12:14 ------- (In reply to comment #1) > Try enabling -fwhole-program together with -flto.
I did now - and thanks to sanitizing the tree, there is no failure (contrary to using only -fwhole-program w/o -flto). capacita is now only <1% slower; but mdbx, rnflow, and test_fpu are still 2% to 3% slower. (The overall performance [geometric mean] is still 1% faster.) (Comparison just for fun: Pathscale 3.2.99 is (geometric mean) 9%, ifort 11.1 is 8% faster.) -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=41578