Re: [PATCH] Improve AVX512F sse4_1_round* patterns

2016-04-28 Thread Kirill Yukhin
Hi Jakub, On 27 Apr 23:34, Jakub Jelinek wrote: > Hi! > > While AVX512F doesn't contain EVEX encoded vround{ss,sd,ps,pd} instructions, > it contains vrndscale* which performs the same thing if bits [4:7] of the > immediate are zero. > > For _mm*_round_{ps,pd} we actually already emit vrndscale* f

[PATCH] Improve AVX512F sse4_1_round* patterns

2016-04-27 Thread Jakub Jelinek
Hi! While AVX512F doesn't contain EVEX encoded vround{ss,sd,ps,pd} instructions, it contains vrndscale* which performs the same thing if bits [4:7] of the immediate are zero. For _mm*_round_{ps,pd} we actually already emit vrndscale* for -mavx512f instead of vround* unconditionally (because _rnds