https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118945
--- Comment #7 from Jeffrey A. Law <law at gcc dot gnu.org> --- That's going to be a micro-architectual decision. Some designs aren't sensitive to the number of vsetvls and I would expect that over time that's where high performance designs will land over time. Obviously for your design you'll want to set the knob which says "minimize vsetvls" as opposed to "avoid false dependencies by preferring tail agnostic". That's easily handled by putting the data in the tuning structure for each design.