https://gcc.gnu.org/bugzilla/show_bug.cgi?id=61338
--- Comment #12 from Richard Biener <rguenth at gcc dot gnu.org> --- There's the SLP permute optimization phase which should ideally catch all these cases, but the permutes for reverse accesses are only introduced during code-generation.