https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114801
--- Comment #31 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Seems most of the V4BI/V8BImode predicates are in UNSPECs, I think long term turning them into just using there V16BImode might help. Keep using V4BI/V8BImode for cases where we know it is all 0 or all 1 bits in each element, say when some comparison produces the mask, and then just use SUBREG to turn that into V16BImode.