On Fri, 21 Apr 2023 at 21:00, Richard Sandiford <richard.sandif...@arm.com> wrote: > > Prathamesh Kulkarni <prathamesh.kulka...@linaro.org> writes: > > Hi Richard, > > Based on your suggestions in the other thread, the patch uses force_reg > > to avoid creating pseudo if value is already in a register. > > Bootstrap+test passes on aarch64-linux-gnu. > > OK to commit ? > > > > Thanks, > > Prathamesh > > > > [aarch64] Use force_reg instead of copy_to_mode_reg. > > > > Use force_reg instead of copy_to_mode_reg in aarch64_simd_dup_constant > > and aarch64_expand_vector_init to avoid creating pseudo if original value > > is already in a register. > > > > gcc/ChangeLog: > > * config/aarch64/aarch64.cc (aarch64_simd_dup_constant): Use > > force_reg instead of copy_to_mode_reg. > > (aarch64_expand_vector_init): Likewise. > > OK, thanks. Thanks, committed in: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=e306501ff556647dc31915a63ce95a5496f08f97
Thanks, Prathamesh > > Richard > > > diff --git a/gcc/config/aarch64/aarch64.cc b/gcc/config/aarch64/aarch64.cc > > index 0d7470c05a1..321580d7f6a 100644 > > --- a/gcc/config/aarch64/aarch64.cc > > +++ b/gcc/config/aarch64/aarch64.cc > > @@ -21968,7 +21968,7 @@ aarch64_simd_dup_constant (rtx vals) > > /* We can load this constant by using DUP and a constant in a > > single ARM register. This will be cheaper than a vector > > load. */ > > - x = copy_to_mode_reg (inner_mode, x); > > + x = force_reg (inner_mode, x); > > return gen_vec_duplicate (mode, x); > > } > > > > @@ -22082,7 +22082,7 @@ aarch64_expand_vector_init (rtx target, rtx vals) > > /* Splat a single non-constant element if we can. */ > > if (all_same) > > { > > - rtx x = copy_to_mode_reg (inner_mode, v0); > > + rtx x = force_reg (inner_mode, v0); > > aarch64_emit_move (target, gen_vec_duplicate (mode, x)); > > return; > > } > > @@ -22190,12 +22190,12 @@ aarch64_expand_vector_init (rtx target, rtx vals) > > vector register. For big-endian we want that position to hold > > the last element of VALS. */ > > maxelement = BYTES_BIG_ENDIAN ? n_elts - 1 : 0; > > - rtx x = copy_to_mode_reg (inner_mode, XVECEXP (vals, 0, > > maxelement)); > > + rtx x = force_reg (inner_mode, XVECEXP (vals, 0, maxelement)); > > aarch64_emit_move (target, lowpart_subreg (mode, x, inner_mode)); > > } > > else > > { > > - rtx x = copy_to_mode_reg (inner_mode, XVECEXP (vals, 0, > > maxelement)); > > + rtx x = force_reg (inner_mode, XVECEXP (vals, 0, maxelement)); > > aarch64_emit_move (target, gen_vec_duplicate (mode, x)); > > } > > > > @@ -22205,7 +22205,7 @@ aarch64_expand_vector_init (rtx target, rtx vals) > > rtx x = XVECEXP (vals, 0, i); > > if (matches[i][0] == maxelement) > > continue; > > - x = copy_to_mode_reg (inner_mode, x); > > + x = force_reg (inner_mode, x); > > emit_insn (GEN_FCN (icode) (target, x, GEN_INT (i))); > > } > > return; > > @@ -22249,7 +22249,7 @@ aarch64_expand_vector_init (rtx target, rtx vals) > > rtx x = XVECEXP (vals, 0, i); > > if (CONST_INT_P (x) || CONST_DOUBLE_P (x)) > > continue; > > - x = copy_to_mode_reg (inner_mode, x); > > + x = force_reg (inner_mode, x); > > emit_insn (GEN_FCN (icode) (target, x, GEN_INT (i))); > > } > > }