Richard Biener wrote: > With Ooo CPUs speculatively executing the next iterations I very much doubt > that.
OoO execution is like really dumb loop unrolling, you still have all the dependencies between iterations, all the branches, all the pointer increments etc. Optimizing those reduces instruction counts like vectorization. Fewer instructions implies faster code. Wilco