https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94789
--- Comment #3 from Gabriel Ravier <gabravier at gmail dot com> --- >From what I've seen, this optimisation could be useful on at least these targets : - x86_64 - i686 - aarch64 On other architectures I've looked at, either the optimization can't be done and/or it's useless because those architectures are capable of doing `32 - b` in a single instruction