https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94837
--- Comment #3 from Gabriel Ravier <gabravier at gmail dot com> --- Also, I've tested the code from https://gcc.gnu.org/bugzilla/show_bug.cgi?id=54593 and the optimization in question is no longer in in `-mtune=generic`, only with specific architectures like `-mtune=k8`