https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94406
--- Comment #4 from Martin Jambor <jamborm at gcc dot gnu.org> --- For the record, on AMD Zen2 at least, SPEC 2006 410.bwaves also runs about 12% faster with --param vect-epilogues-nomask=0 (and otherwise with -Ofast -march=native -mtune=native).