https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110660
--- Comment #1 from Richard Biener <rguenth at gcc dot gnu.org> --- The vectorizer itself could do the merging which means it could also more accurately cost things. Otherwise think about whether/how such a situation might arise from people using RVV intrinsics - how are those exposed to GIMPLE / RTL and at which level could that be optimized? Is it possible to write an intrinsic testcase with such opportunity?