On Wed, 13 Apr 2022, Richard Biener wrote:
> On Wed, 13 Apr 2022, Richard Biener wrote:
>
> > On Wed, 13 Apr 2022, Richard Sandiford wrote:
> >
> > > Richard Biener via Gcc-patches <[email protected]> writes:
> > > > The following reverts the original PR105140 fix and goes for instead
> > > > applying the additional fold_convert constraint for VECTOR_TYPE
> > > > conversions also to fold_convertible_p. I did not try sanitizing
> > > > all of this at this point.
> > > >
> > > > Bootstrapped on x86_64-unknown-linux-gnu, testing in progress.
> > > >
> > > > 2022-04-13 Richard Biener <[email protected]>
> > > >
> > > > PR tree-optimization/105250
> > > > * fold-const.cc (fold_convertible_p): Revert
> > > > r12-7979-geaaf77dd85c333, instead check for size equality
> > > > of the vector types involved.
> > >
> > > This doesn't look right, and I think it'll break SVE. For one
> > > thing, the tree_int_cst_equal check is bound to fail for
> > > variable-length vectors.
> > >
> > > But also, the idea was to allow element-wise conversions between
> > > different vector sizes. For example, you can do a nop/convert
> > > from V4SI to V4DI, which converts 4 SIs to 4 DIs. This is used
> > > a lot for conversions to and from “partial” SVE vectors, where smaller
> > > elements are stored in wider containers.
> >
> > But fold_convertible_p is used as guard for fold_convert in a lot of
> > places and that will simply ICE when there's a mismatch in size
> > as can be seen in the testcase. Note the code as before the
> > previous fix couldn't really have worked as expected. Is there any
> > testcase that will "break" now?
> >
> > I realize the fold_convertible_p comment says "using a NOP_EXPR" which
> > means it might conver a narrower set of conversions than fold_convert
> > (which will happily use FLOAT_EXPR and friends), but still it should
> > allow fold_convert to build the conversion.
> >
> > The alternative would have been to emit a NOP_EXPR from fold_convert
> > for vector type conversions (with the correct constraints), but then
> > not all targets support those, so we'd need a target support check
> > in fold_convertible_p then?
>
> Btw, fold_convertible_p is currently used in few places only:
>
> fold-const.cc:fold_convertible_p (const_tree type, const_tree arg)
> ipa-cp.cc: || fold_convertible_p (param_type, value))
> ipa-param-manipulation.cc: if (!fold_convertible_p (TREE_TYPE
> (origin), arg))
> ipa-prop.cc: if (fold_convertible_p (TREE_TYPE (rhs), v->value))
> tree-inline.cc: if (fold_convertible_p (type, value))
> tree-inline.cc: if (fold_convertible_p (caller_type, var))
>
> all in places that try to deal with type mismatches in IPA (from
> parameters).
Btw, I can't find a tree_int_cst_equal replacement that would work
for POLY_INT_CST as well as INTEGER_CST, is there any that I
missed?
Richard.
> Richard.
>
> > Richard.
> >
> > > Thanks,
> > > Richard
> > >
> > > >
> > > > * gcc.dg/pr105250.c: New testcase.
> > > > ---
> > > > gcc/fold-const.cc | 7 +++----
> > > > gcc/testsuite/gcc.dg/pr105250.c | 29 +++++++++++++++++++++++++++++
> > > > 2 files changed, 32 insertions(+), 4 deletions(-)
> > > > create mode 100644 gcc/testsuite/gcc.dg/pr105250.c
> > > >
> > > > diff --git a/gcc/fold-const.cc b/gcc/fold-const.cc
> > > > index 7226bc5af01..a57ad0739fb 100644
> > > > --- a/gcc/fold-const.cc
> > > > +++ b/gcc/fold-const.cc
> > > > @@ -2379,13 +2379,12 @@ build_zero_vector (tree type)
> > > > return build_vector_from_val (type, t);
> > > > }
> > > >
> > > > -/* Returns true, if ARG, an operand or a type, is convertible to TYPE
> > > > - using a NOP_EXPR. */
> > > > +/* Returns true, if ARG is convertible to TYPE using a NOP_EXPR. */
> > > >
> > > > bool
> > > > fold_convertible_p (const_tree type, const_tree arg)
> > > > {
> > > > - const_tree orig = TYPE_P (arg) ? arg : TREE_TYPE (arg);
> > > > + const_tree orig = TREE_TYPE (arg);
> > > >
> > > > if (type == orig)
> > > > return true;
> > > > @@ -2417,7 +2416,7 @@ fold_convertible_p (const_tree type, const_tree
> > > > arg)
> > > > return (VECTOR_TYPE_P (orig)
> > > > && known_eq (TYPE_VECTOR_SUBPARTS (type),
> > > > TYPE_VECTOR_SUBPARTS (orig))
> > > > - && fold_convertible_p (TREE_TYPE (type), TREE_TYPE
> > > > (orig)));
> > > > + && tree_int_cst_equal (TYPE_SIZE (type), TYPE_SIZE
> > > > (orig)));
> > > >
> > > > default:
> > > > return false;
> > > > diff --git a/gcc/testsuite/gcc.dg/pr105250.c
> > > > b/gcc/testsuite/gcc.dg/pr105250.c
> > > > new file mode 100644
> > > > index 00000000000..665dd95d8cb
> > > > --- /dev/null
> > > > +++ b/gcc/testsuite/gcc.dg/pr105250.c
> > > > @@ -0,0 +1,29 @@
> > > > +/* { dg-do compile } */
> > > > +/* { dg-options "-w -Wno-psabi -O2" } */
> > > > +
> > > > +typedef int __attribute__((__vector_size__(4))) T;
> > > > +typedef int __attribute__((__vector_size__(8))) U;
> > > > +typedef int __attribute__((__vector_size__(16))) V;
> > > > +typedef int __attribute__((__vector_size__(32))) W;
> > > > +typedef _Float32 __attribute__((__vector_size__(16))) F;
> > > > +typedef _Float64 __attribute__((__vector_size__(32))) G;
> > > > +void foo();
> > > > +
> > > > +foo(int, int, int, int, U, U, V, V, W, W, int,
> > > > + T, int, U, U, V, V, W, W, T,
> > > > + T, int, U, U, V, V, W, W, T,
> > > > + T, int, W, W, T, T, int, int, int,
> > > > + int, int, int, W, int, int, int, int, int, int,
> > > > + V, W, T, int, int, U, F, int, int, int,
> > > > + int, int, int, G)
> > > > +{
> > > > + foo(0, 0, 0, 0, (U){}, (U){}, (V){}, (V){}, (W){},
> > > > + (W){}, 2, (T){}, 0, 0, 0, 0, (U){}, (U){},
> > > > + (V){}, (V){}, (W){}, (W){}, (T){},
> > > > + (T){}, 0, 0, 0, 0, (U){}, (U){}, (V){},
> > > > + (V){}, (W){}, (W){}, (T){}, (T){}, 0, 0, 0,
> > > > + 0, 0, 0, (T){},
> > > > + (T){}, (W){},
> > > > + (W){}, (T){}, (T){}, 0, 0, 0, 0, 0, 0, (W){},
> > > > + (V){}, (W){}, (T){}, 0, 0, (U){}, (F){});
> > > > +}
> > >
> >
> >
>
>
--
Richard Biener <[email protected]>
SUSE Software Solutions Germany GmbH, Maxfeldstrasse 5, 90409 Nuernberg,
Germany; GF: Ivo Totev; HRB 36809 (AG Nuernberg)