http://gcc.gnu.org/bugzilla/show_bug.cgi?id=39423

--- Comment #17 from chrbr at gcc dot gnu.org 2012-07-11 12:35:32 UTC ---
Created attachment 27775
  --> http://gcc.gnu.org/bugzilla/attachment.cgi?id=27775
plus add combine

Here is the patch that I've been running since some time, it also use the same
combine pattern matcher, but the goal of this patch was originally to fix up
chains of multiple mult-add instructions.
Optimizing the cst+reg addressing mode appears as a nice effects. Out of range
indexes seems to be handled as afar as I can see.

This brings a EEMBC telecom speedup of 10%.FFMPEG code size reduced to 30% on a
few objects. 
Validated on whole linux distribution, with only improvements (few regression
only bellow noise).

This patch is only for comments/illustration. Need a few polishing before
proposing. I'm having a look at your implementation to see how they compare and
possibly combined together. Both approaches look interesting.

Reply via email to