------- Comment #48 from paolo dot bonzini at lu dot unisi dot ch 2006-08-08 07:05 ------- Subject: Re: [4.0/4.1 Regression] gcc 4 produces worse x87 code on all platforms than gcc 3
> In x86/x86-64 world one can be almost sure that the load+execute instruction > pair will execute (marginaly to noticeably) faster than move+load-and-execute > instruction pair as the more complex instructions are harder for on-chip > scheduling (they retire later). Yes, so far so good and this part has already been committed. But does a *single* load-and-execute instruction execute faster than the two instructions in a load+execute sequence? -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=27827