https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94092
--- Comment #4 from Richard Biener <rguenth at gcc dot gnu.org> --- With profile feedback we (target or middle-end) can produce specialized RTL expansion doing small copies inline and larget ones offline. The idea of GIMPLE level pattern detection is that even for small sizes the target usually knows how to expand the copy optimally while the user may have written a byte copying loop. Of course that requires targets to pay attention. Note most compiler optimization involves some heuristics and clearly heuristics can be off. I wonder if you can obtain better coremark results by using link-time optimization. Iff you're only after benchmark numbers...