https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117562
--- Comment #2 from Hongtao Liu <liuhongt at gcc dot gnu.org> --- My guess there's a lower-tripcount(< 128bit vector) hot loop, avx512_two_epilogues only takes more cmp/jcc instructions but doesn't execute any real vector instructions.