On 23/08/14 11:07 AM, Mickaël Raulet wrote:
> For 10bits and 12bits, they should stay sse4 as well because of packusdw. You
> need some instructions to convert it to ssse3 see below
>
>
> static av_always_inline __m128i _MM_PACKUS_EPI32( __m128i a, __m128i b )
> {
> a = _mm_slli_epi32 (a, 16);
> a = _mm_srai_epi32 (a, 16);
> b = _mm_slli_epi32 (b, 16);
> b = _mm_srai_epi32 (b, 16);
> a = _mm_packs_epi32 (a, b);
> return a;
> }
There's a PACK macro in lavfi/x86/yasm-16.asm that does this without intrinsics.
_______________________________________________
ffmpeg-devel mailing list
[email protected]
http://ffmpeg.org/mailman/listinfo/ffmpeg-devel