Hello,
The following patchset add the ff_simple_idct function neon functions for the
aarch64 platform. It's ported from armv7 simple_idct_neon with some
improvements:
* the source idct blocks are now loaded once and kept in v24-v31
* the source idct blocks are no longer overriden in idct_col4_top
* the destination is now written in one pass at the end of
ff_simple_idct{,_put,_add}_neon
It is bitexact with the armv7 neon implementation.
Here are some results (reported by {START,STOP}_TIMER) on an Odroid-C2 (Cortex
A53):
Functions IDCT: simple IDCT: simpleneon
ff_simple_idct_put 9795 units 3170 units
ff_simple_idct_add 10227 units 3302 units
Matthieu
_______________________________________________
ffmpeg-devel mailing list
[email protected]
http://ffmpeg.org/mailman/listinfo/ffmpeg-devel