------- Comment #5 from amylaar at gcc dot gnu dot org 2008-12-09 21:52 ------- FWIW, the same problem can be seen for the SH, although it doesn't manifest as an actual preformance regression from 4.2.1 because the 4.2.1 SH backend is suboptimal - the mov.b / extu.b scheduling is bad, that could be fixed with a peephole2, and the doloop_end pattern that can make use of the dt instruction is not available.
-- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38440