https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85216
--- Comment #8 from Richard Biener <rguenth at gcc dot gnu.org> ---
(In reply to Timothy Pearson from comment #4)
> (In reply to Andrew Pinski from comment #3)
> > This is 100% the equivalent code.
> >
> > jmp *(%r15) # opline.199_67->handler
> > Does two things:
> > loads a pointer from %r15 and then jumps to that pointer.
> >
> > In PowerPC, you can only jump indirectly via the CTR or LR registers.
> >
> > ld 9,0(29) # opline.200_67->handler, gotovar.1505_2678
> > mtctr 9 # gotovar.1505_2678, gotovar.1505_2678
> > bctr
> >
> >
> > Most likely what is happening is the indirect branch predictor is not
> > predicting the branch correctly on the powerpc side while it is on the x86
> > side. This is a micro-architecture difference between the two chips and is
> > unrelated to the ISA differences.
>
> I'm forwarding this for analysis to see if there's anything we can do in
> firmware to "fix" the branch predictor. If not, is there a way to prime the
> predictor in this scenario, or is this too specific to be added
> compiler-side?
The usual way is speculative devirtualization, you replace
jmp *(%r15)
with
if (%r15 == constant-address)
jmp constant-address
else
jmp *(%r15)
where the hope is this helps branch prediction.
Other than that - are there very many such indirect branches or is it just
one?