http://gcc.gnu.org/bugzilla/show_bug.cgi?id=60291
--- Comment #7 from Markus Trippelsdorf <trippels at gcc dot gnu.org> --- (In reply to Richard Biener from comment #6) > For reference (in testing) Looks promising: Without LTO: 2:27.39 total With LTO: 35.485 total (60% faster than clang) > throwing that to callgrind now (perf sucks - no backtraces :/) There is "-g" for "perf record" and "-g -G" for "perf report".