http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50164
--- Comment #8 from Ilya Enkovich <enkovich.gnu at gmail dot com> 2011-08-30 10:50:44 UTC --- I attached a fixed reproducer. It is closer to the original test and has higher registers pressure then the previous version. It has the same problem as the first reproducer. Reproduced with GCC 4.7.0 20110828 and options "-O2 -m32 -march=atom". Code becomes faster on both Atom (~10%) and Core (~35%) if I use just "-O2 -m32".