https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119373
--- Comment #9 from camel-cdr <camel-cdr at protonmail dot com> --- Sorry, I missed that you attached the relevant C code. Here is a side by side with and without -m-rvv-lmul-max=dynamic: https://godbolt.org/z/MToxx813v Using LLVM-MCA as a quick and dirty performance model shows that this reduces the cycles by about 45%.