https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67323

--- Comment #9 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Note the question comes here is which is better using ldr/str followed by a few
mult or ld3/st3 followed by a few shifts/adds.  I think it depends on the
micro-arch really (at least for aarch32).  In fact I think ldr/str followed by
a few mult is much better for ThunderX and most likely also Cortex-A57 (at
least that is how I read the optimizing manual).

Reply via email to