[Bug target/91722] gcc generates sub-optimal assembly when AVX instructions are used.

2025-01-08 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91722 --- Comment #3 from Andrew Pinski --- _9 = __builtin_ia32_vbroadcastss256 (&signbit); _4 = __builtin_ia32_andnps256 (_9, to_3(D)); _6 = __builtin_ia32_andps256 (_9, from_5(D)); _7 = __builtin_ia32_orps256 (_6, _4); [tail call]

[Bug target/91722] gcc generates sub-optimal assembly when AVX instructions are used.

2019-09-11 Thread rguenth at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91722 Richard Biener changed: What|Removed |Added Keywords||missed-optimization Target|

[Bug target/91722] gcc generates sub-optimal assembly when AVX instructions are used.

2019-09-10 Thread hjl.tools at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91722 --- Comment #1 from H.J. Lu --- GCC 10 generates: .globl _Z11copysign_psDv8_fS_ .type _Z11copysign_psDv8_fS_, @function _Z11copysign_psDv8_fS_: .LFB5339: .cfi_startproc vmovaps %ymm0, %ymm2 vbroadcastss