On Tue, Aug 13, 2019 at 2:33 PM Richard Biener <rguent...@suse.de> wrote: > > > The following splits out the change that makes the live-range split / > mode change copy pseudos vector mode pseudos. I've tried reproducing > the issue I faced with SImode chains but failed likely because we're > never using simple moves in the end while for SImode we can end up > with > > (set (subreg:V4SI (reg:SI 42) 0) (subreg:V4SI (reg:SI 41) 0)) > > which we happily propagate out. So pursuing this patch independently > doesn't make much sense. Still the main change is in > dimode_scalar_chain::make_vector_copies where 'vreg' is now V2DImode > instead of DImode and the change to emit_move_insn (vreg, tmp) > hints at the change done to the above problematic insn. > > The rest of the changes deal with some regs already being in > the appropriate vector mode. > > I realize I might have avoided my original issue by emitting not > the above reg-reg move but > > (set (subreg:V4SI (reg:SI 42) 0) > (vec_merge:V4SI > (vec_duplicate:V4SI (reg:SI 41)) > (const_vec [0 0 0 0]) > (const_1))) > > but I didn't try (the full patch contained a testcase for the > issue). Still it seems odd to use DImode pseudos when all uses > are wrapped in (subreg:V2DI ...).
It looks to me that the above is the way to go. make_vector_copies creates a scalar pseudo, and all other support functions expect a scalar that will be processed with "replace_with_subreg". I think that the safest way is to emit the code above for SImode for the problematic move insn, and > + emit_move_insn (gen_rtx_SUBREG (V2DImode, vreg 0), > + gen_rtx_VEC_MERGE (V2DImode, > + gen_rtx_VEC_DUPLICATE > (V2DImode, > + tmp), > + CONST0_RTX (V2DImode), > + GEN_INT (HOST_WIDE_INT_1U))); for DImode. This way, you won't need other changes (conditional generation of SUBREGs), the above should be enough. Oh, and don't use emit_move_insn, emit insn with gen_rtx_SET (gen_rtx_SUBREG (...)) should do the trick. Uros. > As (unrelated) change I propose to error with fatal_insn_not_found > in case recog doesn't recognize the altered insns which make it > easier to spot errors early. > > Bootstrap / regtest running on x86_64-unknown-linux-gnu ontop of [1/2], OK > for trunk if that succeeds? > > Thanks, > Richard. > > 2019-08-13 Richard Biener <rguent...@suse.de> > > * config/i386/i386-features.c > (dimode_scalar_chain::replace_with_subreg): Elide the subreg if the > reg is already vector. > (dimode_scalar_chain::convert_op): Likewise. > (dimode_scalar_chain::make_vector_copies): Use a vector-mode pseudo as > destination to avoid later passes copy-propagating it out. > (dimode_chain::convert_insn): Convert the source of a > memory op in case it is not of the appropriate scalar mode. > Raise fatal_insn_not_found if the converted insn is not recognized. > > Index: gcc/config/i386/i386-features.c > =================================================================== > --- gcc/config/i386/i386-features.c (revision 274328) > +++ gcc/config/i386/i386-features.c (working copy) > @@ -573,7 +573,8 @@ rtx > dimode_scalar_chain::replace_with_subreg (rtx x, rtx reg, rtx new_reg) > { > if (x == reg) > - return gen_rtx_SUBREG (V2DImode, new_reg, 0); > + return (GET_MODE (new_reg) == V2DImode > + ? new_reg : gen_rtx_SUBREG (V2DImode, new_reg, 0)); > > const char *fmt = GET_RTX_FORMAT (GET_CODE (x)); > int i, j; > @@ -627,7 +628,7 @@ void > dimode_scalar_chain::make_vector_copies (unsigned regno) > { > rtx reg = regno_reg_rtx[regno]; > - rtx vreg = gen_reg_rtx (DImode); > + rtx vreg = gen_reg_rtx (V2DImode); > df_ref ref; > > for (ref = DF_REG_DEF_CHAIN (regno); ref; ref = DF_REF_NEXT_REG (ref)) > @@ -641,7 +642,12 @@ dimode_scalar_chain::make_vector_copies > gen_rtx_SUBREG (SImode, reg, 0)); > emit_move_insn (adjust_address (tmp, SImode, 4), > gen_rtx_SUBREG (SImode, reg, 4)); > - emit_move_insn (vreg, tmp); > + emit_move_insn (vreg, > + gen_rtx_VEC_MERGE (V2DImode, > + gen_rtx_VEC_DUPLICATE > (V2DImode, > + tmp), > + CONST0_RTX (V2DImode), > + GEN_INT (HOST_WIDE_INT_1U))); > } > else if (TARGET_SSE4_1) > { > @@ -841,7 +847,8 @@ dimode_scalar_chain::convert_op (rtx *op > gcc_assert (!DF_REF_CHAIN (ref)); > break; > } > - *op = gen_rtx_SUBREG (V2DImode, *op, 0); > + if (GET_MODE (*op) != V2DImode) > + *op = gen_rtx_SUBREG (V2DImode, *op, 0); > } > else if (CONST_INT_P (*op)) > { > @@ -931,6 +938,8 @@ dimode_scalar_chain::convert_insn (rtx_i > case MEM: > if (!REG_P (dst)) > convert_op (&src, insn); > + else if (GET_MODE (src) != DImode) > + src = gen_rtx_SUBREG (DImode, src, 0); > break; > > case REG: > @@ -977,7 +986,9 @@ dimode_scalar_chain::convert_insn (rtx_i > PATTERN (insn) = def_set; > > INSN_CODE (insn) = -1; > - recog_memoized (insn); > + int patt = recog_memoized (insn); > + if (patt == -1) > + fatal_insn_not_found (insn); > df_insn_rescan (insn); > } >