On Sat, Aug 31, 2024 at 10:40 PM Michael Niedermayer <[email protected]> wrote: > On Fri, Aug 30, 2024 at 08:56:55PM +0200, Ramiro Polla wrote: > > A55 A76 > > deinterleave_bytes_c: 70342.0 34497.5 > > deinterleave_bytes_neon: 21594.5 ( 3.26x) 5535.2 ( 6.23x) > > deinterleave_bytes_aligned_c: 71340.8 34651.2 > > deinterleave_bytes_aligned_neon: 8616.8 ( 8.28x) 3996.2 ( 8.67x) > > --- > > libswscale/aarch64/rgb2rgb.c | 4 ++ > > libswscale/aarch64/rgb2rgb_neon.S | 59 +++++++++++++++++++++++ > > tests/checkasm/sw_rgb.c | 77 +++++++++++++++++++++++++++++++ > > 3 files changed, 140 insertions(+) > > this breaks fate on x86-64 > > Test checkasm-sw_rgb failed. Look at tests/data/fate/checkasm-sw_rgb.err for > details.
The sse2/avx implementations of deinterleaveBytes use LOOP_NVXX_TO_UV, which checks for alignment on src (and can read unaligned data) but expects dst to be aligned. Should the unaligned versions of these functions be modified to support writing to unaligned data? _______________________________________________ ffmpeg-devel mailing list [email protected] https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email [email protected] with subject "unsubscribe".
