On Fri, Aug 09, 2019 at 11:25:30AM +0200, Richard Biener wrote:
>   0.65 │1e0:   vpxor  %xmm0,%xmm0,%xmm0
>   0.32 │       vpmaxs -0x10(%rsp),%xmm0,%xmm0
>  40.45 │       vmovd  %xmm0,%eax
>   2.45 │       imul   %r8d,%eax

Shouldn't we hoist the vpxor before the loop?  Is it STV being done too late
that we don't do that anymore?  Couldn't e.g. STV itself detect that and put
the clearing instruction before the loop instead of right before the minmax?

        Jakub

Reply via email to