http://gcc.gnu.org/bugzilla/show_bug.cgi?id=55619
--- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> 2012-12-08 08:05:54 UTC --- I think it would be better to rewrite this using intrinsics than depending on the register allocator. You have 3 register inputs, 4 register outputs and 5 memory inputs. I also don't think you have the input/outputs correct either. %2 you only read and never write to so why have it as +? \