https://gcc.gnu.org/bugzilla/show_bug.cgi?id=125174

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Last reconfirmed|                            |2026-05-06
     Ever confirmed|0                           |1
             Status|UNCONFIRMED                 |NEW

--- Comment #2 from Richard Biener <rguenth at gcc dot gnu.org> ---
Reproduces also on a Zen4 machine with -Ofast -march=x86-64-v3 -mtune=znver3
and --param ix86-vect-compare-costs 0 vs 1.

Most notable perf differences are that we now see

   2.27%         12762  tonto_peak.amd6  libmvec.so.1                   [.]
_ZGVbN2v_sin_sse4   
   1.36%          7618  tonto_peak.amd6  libmvec.so.1                   [.]
_ZGVbN2v_cos_sse4                 
   0.79%          4483  tonto_peak.amd6  libmvec.so.1                   [.]
_ZGVdN4v_sin_avx2   

vs.

   0.72%          4072  tonto_base.amd6  libmvec.so.1                   [.]
_ZGVdN4v_sin_avx2           

the SSE4 ones do not appear.  And we have

   1.45%          8157  tonto_peak.amd6  libm.so.6                      [.]
__sincos_fma                                     ▒
   1.36%          7700  tonto_base.amd6  libm.so.6                      [.]
__sincos_fma

Reply via email to