https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77287
--- Comment #2 from Petr <kobalicek.petr at gmail dot com> --- With '-mtune=intel' the push/pop sequence is gone, but YMM register management remains the same - 24 memory accesses more than clang.
kobalicek.petr at gmail dot com Thu, 18 Aug 2016 04:08:26 -0700
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77287
--- Comment #2 from Petr <kobalicek.petr at gmail dot com> --- With '-mtune=intel' the push/pop sequence is gone, but YMM register management remains the same - 24 memory accesses more than clang.