On Thu, 25 Feb 2021 at 09:54, Alex Bennée <[email protected]> wrote: > > > Richard Henderson <[email protected]> writes: > > > Always perform one call instead of two for 16-byte operands. > > Use byte loads/stores directly into the vector register file > > instead of extractions and deposits to a 64-bit local variable. > > > > In order to easily receive pointers into the vector register file, > > convert the helper to the gvec out-of-line signature. Move the > > helper into vec_helper.c, where it can make use of H1 and clear_tail. > > > > Signed-off-by: Richard Henderson <[email protected]> > > Reviewed-by: Alex Bennée <[email protected]> > Tested-by: Alex Bennée <[email protected]> > > Looked marginally slower on the (1!) run I did but probably in the noise > and the generated code looks nicer.
Applied to target-arm.next, thanks. -- PMM
