On Sun, 7 Apr 2013, Marc Glisse wrote:
extern __inline __m128i __attribute__((__gnu_inline__, __always_inline__,
__artificial__))
_mm_slli_epi16 (__m128i __A, int __B)
{
- return (__m128i)__builtin_ia32_psllwi128 ((__v8hi)__A, __B);
+ return (__m128i) ((__v8hi)__A << __B);
}
Actually, I believe I have to keep using the builtins for shifts, because the intrinsics have well defined behavior for large __B whereas << and >> don't.
-- Marc Glisse
