Re: [PATCH v2 2/2] util: add util function buffer_zero_avx512()

2020-02-27 Thread Richard Henderson
On 2/25/20 7:35 PM, Robert Hoo wrote: > @@ -226,10 +263,16 @@ static void init_accel(unsigned cache) > fn = buffer_zero_avx2; > } > #endif > +#ifdef CONFIG_AVX512F_OPT > +if (cache & CACHE_AVX512F) { > +fn = buffer_zero_avx512; > +length_to_accel = 256; > +} >

[PATCH v2 2/2] util: add util function buffer_zero_avx512()

2020-02-25 Thread Robert Hoo
Intialize buffer_accel with this buffer_zero_avx512(), when Intel AVX512F is available on host. This function utilizes Intel AVX512 fundamental instructions which is faster than its implementation with AVX2 (in my unit test, with 4K buffer, on CascadeLake SP, ~36% faster, buffer_zero_avx512() V.S.