https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82004
--- Comment #37 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Created attachment 43771 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=43771&action=edit gcc8-pr82004.patch Untested hack. With this it works even with -flto. Though, the rounding errors because we do 400 multiplications get perhaps way too high even for -Ofast (the benchmark doesn't care that much except for the first iteration, but I think it is too much). So, as an alternative I'll just try to something different.