https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80952
--- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> --- I suspect this is due to using atomic increments for the profile data now. Before there were many race conditions and profile arch would produce incorrect values for threaded (openmp is such) code.