http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50088
--- Comment #10 from H.J. Lu <hjl.tools at gmail dot com> 2011-08-16 14:23:39 UTC --- The real problem is the store forward issue on Atom: addl $1, 4(%esp) # 67 *addsi_1/2 [length = 5] andl $15, 4(%esp) # 68 *andsi_1/1 [length = 5] movl (%eax), %edi # 70 *movsi_internal/1 [length = 2] movzbl 4(%esp), %ecx # 154 *movqi_internal/3 [length = 5] That is we write 32bit and read 8bit, which performs very poorly on Atom. We should write/read the same size.