Re: Ping: [PATCH] x86/AVX512: improve generated code for mask-to-vector-register conversions

2019-07-22 Thread Jeff Law
On 7/18/19 10:07 AM, Jan Beulich wrote: On 27.06.19 at 10:59, wrote: >> Conversion of comparison results to full vectors does, when VPMOVM2* are >> unavailable, not require any intermediate VMOVDQ{A,U}*: Simply use >> embedded masking on VPTERNLOG* right away, which is available with >> AVX51

Ping: [PATCH] x86/AVX512: improve generated code for mask-to-vector-register conversions

2019-07-18 Thread Jan Beulich
>>> On 27.06.19 at 10:59, wrote: > Conversion of comparison results to full vectors does, when VPMOVM2* are > unavailable, not require any intermediate VMOVDQ{A,U}*: Simply use > embedded masking on VPTERNLOG* right away, which is available with > AVX512F (while VPMOVM2{D,Q} are available only wit

[PATCH] x86/AVX512: improve generated code for mask-to-vector-register conversions

2019-06-27 Thread Jan Beulich
Conversion of comparison results to full vectors does, when VPMOVM2* are unavailable, not require any intermediate VMOVDQ{A,U}*: Simply use embedded masking on VPTERNLOG* right away, which is available with AVX512F (while VPMOVM2{D,Q} are available only with AVX512DQ). Note that the chosen immedia