https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90202
--- Comment #1 from Marc Glisse <glisse at gcc dot gnu.org> --- IIRC this is a tuning decision made on purpose. If you use just -mavx512f instead of the -march, you get the code you expected.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90202
--- Comment #1 from Marc Glisse <glisse at gcc dot gnu.org> --- IIRC this is a tuning decision made on purpose. If you use just -mavx512f instead of the -march, you get the code you expected.