https://gcc.gnu.org/bugzilla/show_bug.cgi?id=47860

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
     Ever confirmed|0                           |1
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2021-08-16

--- Comment #6 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
Confirmed, ICC is able to vectorize this loop even without AVX (GCC can do the
vectorize the loop currently with AVX).


        movdqa    %xmm0, %xmm11                                 #10.11
        lea       1(%r14), %r15d                                #9.31
        movups    (%rdx,%r15,8), %xmm9                          #9.27
        movups    (%rcx,%r14,8), %xmm10                         #10.24
        cmpltpd   %xmm1, %xmm10                                 #10.24
        pxor      %xmm2, %xmm10                                 #10.24
        movmskpd  %xmm10, %r15d                                 #10.24
        testl     %r15d, %r15d                                  #10.24
        je        ..B1.14       # Prob 50%                      #10.24
                                # LOE rax rdx rcx rbx rsi rdi ebp r8d r9d r10d
r11d r12d r13d r14d xmm0 xmm1 xmm2 xmm3 xmm4 xmm5 xmm6 xmm7 xmm8 xmm9 xmm10
xmm11
..B1.13:                        # Preds ..B1.12
                                # Execution count [1.25e+01]
        pshufd    $8, %xmm10, %xmm11                            #10.24
        movaps    %xmm9, %xmm8                                  #5.21
        pand      %xmm6, %xmm11                                 #10.24
                                # LOE rax rdx rcx rbx rsi rdi ebp r8d r9d r10d
r11d r12d r13d r14d xmm0 xmm1 xmm2 xmm3 xmm4 xmm5 xmm6 xmm7 xmm8 xmm11
..B1.14:                        # Pre

Reply via email to