Re: [PATCH] Improve QImode extraction without SSE4.1 (PR tree-optimization/91201)

2019-08-02 Thread Uros Bizjak
On Fri, Aug 2, 2019 at 9:54 AM Jakub Jelinek wrote: > > Hi! > > As mentioned in the PR, for SSE4.1 we use pextrb for vec_extractv16qiqi, > but at least for element 0 we store the vector into memory and load the > single byte from there and we can just use movd instead. > > The following patch does

[PATCH] Improve QImode extraction without SSE4.1 (PR tree-optimization/91201)

2019-08-02 Thread Jakub Jelinek
Hi! As mentioned in the PR, for SSE4.1 we use pextrb for vec_extractv16qiqi, but at least for element 0 we store the vector into memory and load the single byte from there and we can just use movd instead. The following patch does that, just skips it for the case when we know we'll go through mem