On Tue, Dec 8, 2015 at 9:37 PM, Jonathan Gray wrote:
> Change the __m128i variables to be volatile so gcc 4.9 won't optimise
> all of them out with -O1 or greater. The _mm_set1_epi32/pinsrd calls
> still get optimised out but now there is at least one SSE4.1 instruction
> generated via _mm_max_ep
Change the __m128i variables to be volatile so gcc 4.9 won't optimise
all of them out with -O1 or greater. The _mm_set1_epi32/pinsrd calls
still get optimised out but now there is at least one SSE4.1 instruction
generated via _mm_max_epu32/pmaxud. When all of the sse4.1 instructions
got optimised