https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101495
Bug ID: 101495 Summary: Unnecessary vzeroupper Product: gcc Version: 11.1.1 Status: UNCONFIRMED Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: hjl.tools at gmail dot com CC: crazylht at gmail dot com Target Milestone: --- Target: i386,x86-64 [hjl@gnu-cfl-2 pr101456]$ cat y.c #include <x86intrin.h> extern __m256d x; extern __m256d bar (void); __m256d foo (void) { x = _mm256_setzero_pd (); return bar (); } [hjl@gnu-cfl-2 pr101456]$ gcc -S -O2 -mavx2 y.c [hjl@gnu-cfl-2 pr101456]$ cat y.s .file "y.c" .text .p2align 4 .globl foo .type foo, @function foo: .LFB5667: .cfi_startproc vxorpd %xmm0, %xmm0, %xmm0 vmovapd %ymm0, x(%rip) vzeroupper jmp bar .cfi_endproc .LFE5667: .size foo, .-foo .ident "GCC: (GNU) 11.1.1 20210531 (Red Hat 11.1.1-3)" .section .note.GNU-stack,"",@progbits [hjl@gnu-cfl-2 pr101456]$ Since bar returns __m256d, it must be compiled with AVX and there is no need for vzeroupper.