https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101495

            Bug ID: 101495
           Summary: Unnecessary vzeroupper
           Product: gcc
           Version: 11.1.1
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: target
          Assignee: unassigned at gcc dot gnu.org
          Reporter: hjl.tools at gmail dot com
                CC: crazylht at gmail dot com
  Target Milestone: ---
            Target: i386,x86-64

[hjl@gnu-cfl-2 pr101456]$ cat y.c
#include <x86intrin.h>

extern __m256d x;

extern __m256d bar (void);

__m256d
foo (void)
{
  x = _mm256_setzero_pd ();
  return bar ();
}
[hjl@gnu-cfl-2 pr101456]$ gcc -S -O2 -mavx2 y.c
[hjl@gnu-cfl-2 pr101456]$ cat y.s
        .file   "y.c"
        .text
        .p2align 4
        .globl  foo
        .type   foo, @function
foo:
.LFB5667:
        .cfi_startproc
        vxorpd  %xmm0, %xmm0, %xmm0
        vmovapd %ymm0, x(%rip)
        vzeroupper
        jmp     bar
        .cfi_endproc
.LFE5667:
        .size   foo, .-foo
        .ident  "GCC: (GNU) 11.1.1 20210531 (Red Hat 11.1.1-3)"
        .section        .note.GNU-stack,"",@progbits
[hjl@gnu-cfl-2 pr101456]$ 

Since bar returns __m256d, it must be compiled with AVX and there is no
need for vzeroupper.

Reply via email to