Hi!
On Fri, Sep 04, 2020 at 05:18:49PM +0800, luoxhu wrote:
> On 2020/9/4 15:23, Richard Biener wrote:
> > On Fri, Sep 4, 2020 at 9:19 AM Richard Biener
> > <[email protected]> wrote:
> >> On Fri, Sep 4, 2020 at 8:38 AM luoxhu <[email protected]> wrote:
> >>> On 2020/9/4 14:16, luoxhu via Gcc-patches wrote:
> >>>> Another problem is v[n&3]=i and vec_insert(v, i, n) are generating with
> >>>> different gimple code:
> >>>>
> >>>> {
> >>>> _1 = n & 3;
> >>>> VIEW_CONVERT_EXPR<int[4]>(v1)[_1] = i;
> >>>> }
> >>>>
> >>>> vs:
> >>>>
> >>>> {
> >>>> __vector signed int v1;
> >>>> __vector signed int D.3192;
> >>>> long unsigned int _1;
> >>>> long unsigned int _2;
> >>>> int * _3;
> >>>>
> >>>> <bb 2> [local count: 1073741824]:
> >>>> D.3192 = v_4(D);
> >>>> _1 = n_7(D) & 3;
> >>>> _2 = _1 * 4;
> >>>> _3 = &D.3192 + _2;
> >>>> *_3 = i_8(D);
> >>>> v1_10 = D.3192;
> >>>> return v1_10;
> >>>> }
Not the semantics of vec_insert aren't exactly that.. It doesn't modify
the vector in place, it returns a copy with the modification. But yes,
it could/should just use this same VIEW_CONVERT_EXPR(...)[...] thing for
that.
> >> I think what the GCC vector extensions produce is generally better
> >> so wherever "code generation" for vec_insert resides it should be
> >> adjusted to produce the same code. Same for vec_extract.
Yup.
> > Guess altivec.h, dispatching to __builtin_vec_insert. Wonder why it wasn't
> >
> > #define vec_insert(a,b,c) (a)[c]=(b)
> >
> > anyway, you obviously have some lowering of the builtin somewhere in
> > rs6000.c
> > and thus can adjust that.
> >
>
> Yes, altivec.h use that style for all vector functions, not sure why.
Probably simply because pretty much everything in there is just calling
builtins, everything new follows suit. It is contagious ;-)
> But this could be adjusted by below patch during front end parsing,
> which could also generate "VIEW_CONVERT_EXPR<int[4]>(D.3192)[_1] = i;"
> in gimple, then both v[n&3]=i and vec_insert(v, i, n) could use optabs
> in expander:
>
>
> diff --git a/gcc/config/rs6000/rs6000-c.c b/gcc/config/rs6000/rs6000-c.c
> index 03b00738a5e..00c65311f76 100644
> --- a/gcc/config/rs6000/rs6000-c.c
> +++ b/gcc/config/rs6000/rs6000-c.c
> /* Build *(((arg1_inner_type*)&(vector type){arg1})+arg2) = arg0. */
> @@ -1654,15 +1656,8 @@ altivec_resolve_overloaded_builtin (location_t loc,
> tree fndecl,
> SET_EXPR_LOCATION (stmt, loc);
> stmt = build1 (COMPOUND_LITERAL_EXPR, arg1_type, stmt);
> }
> -
> - innerptrtype = build_pointer_type (arg1_inner_type);
> -
> - stmt = build_unary_op (loc, ADDR_EXPR, stmt, 0);
> - stmt = convert (innerptrtype, stmt);
> - stmt = build_binary_op (loc, PLUS_EXPR, stmt, arg2, 1);
> - stmt = build_indirect_ref (loc, stmt, RO_NULL);
> - stmt = build2 (MODIFY_EXPR, TREE_TYPE (stmt), stmt,
> - convert (TREE_TYPE (stmt), arg0));
> + stmt = build_array_ref (loc, stmt, arg2);
> + stmt = fold_build2 (MODIFY_EXPR, TREE_TYPE (arg0), stmt, arg0);
> stmt = build2 (COMPOUND_EXPR, arg1_type, stmt, decl);
> return stmt;
> }
You should make a copy of the vector, not modify the original one in
place? (If I read that correctly, heh.) Looks good otherwise.
Segher