http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31067
--- Comment #42 from Richard Guenther <rguenth at gcc dot gnu.org> 2011-07-25 15:39:54 UTC --- With gas_dyn changed to use MINLOC (DTEMP, 1) we now inline the intrinsic (but not with MINLOC (DTEMP), even though we know it'll be a single-element array result ...). We completely lack a way to fuse the loops though. Inlining the intrinsic gives a moderate 5% speedup.