http://gcc.gnu.org/bugzilla/show_bug.cgi?id=52424
--- Comment #1 from Jiangning Liu <jiangning.liu at arm dot com> 2012-02-29 03:23:46 UTC --- > I've attached a proposed fix. Jiangning, can you please apply this and see if > your performance problem is resolved? Bill, Confirmed, I think your patch works for my big case and I do see the redundant copies are removed from final binary code. Benchmark performance boosts accordingly as well, although there still might be other potential problems. Thanks a lot for your quick patch. And are you going to check-in to trunk soon for 4.7? It would be also better if you can add a test case. -Jiangning