On Thu, 7 Oct 2021, J. Dekker wrote:

--bench on AWS Graviton:

hevc_sao_edge_8x8_8_c: 516.0
hevc_sao_edge_8x8_8_neon: 81.0

Signed-off-by: J. Dekker <[email protected]>
---
libavcodec/aarch64/hevcdsp_init_aarch64.c |  3 ++
libavcodec/aarch64/hevcdsp_sao_neon.S     | 52 +++++++++++++++++++++++
2 files changed, 55 insertions(+)

Looks ok (with the same comments about folding in 'lsl' into the load at the start), as this is essentially exactly the same but doing two rows at ocne.

// Martin

_______________________________________________
ffmpeg-devel mailing list
[email protected]
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
[email protected] with subject "unsubscribe".

Reply via email to