https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98387
Bug ID: 98387 Summary: GCC >= 6 cannot inline _mm_cmp_ps on SSE targets Product: gcc Version: 10.2.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: danielhanchen at gmail dot com Target Milestone: --- https://gcc.godbolt.org/z/493ead GCC since version 6.1 cannot inline _mm_cmp_ps on targets supporting only SSE (Nehalem, Tremont etc). From >= SandyBridge, everything inlines fine. _mm_cmp_ps is called by using it as a function argument (ie auto function). All SSE only machines use a jmp to _mm_cmp_ps, but it should be inlined. O3 ffast-math is also used, and the function is declared inline.