Re: [PATCH] Further improvements for the (T)(P+A)-(T)(P+B) folding (PR sanitizer/81281)

Richard Biener Fri, 15 Dec 2017 03:15:38 -0800

On Fri, 15 Dec 2017, Jakub Jelinek wrote:

> On Fri, Dec 15, 2017 at 10:28:52AM +0100, Richard Biener wrote:
> > > --- gcc/match.pd.jj       2017-12-07 14:00:51.083048186 +0100
> > > +++ gcc/match.pd  2017-12-07 15:17:49.132784931 +0100
> > > @@ -1784,8 +1784,8 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
> > >  
> > >    /* (T)(P + A) - (T)P -> (T) A */
> > >    (simplify
> > > -   (minus (convert (plus:c @0 @1))
> > > -    (convert @0))
> > > +   (minus (convert (plus:c @@0 @1))
> > > +    (convert? @0))
> > >     (if (element_precision (type) <= element_precision (TREE_TYPE (@1))
> > >   /* For integer types, if A has a smaller type
> > >      than T the result depends on the possible
> > > @@ -1794,10 +1794,29 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
> > >      However, if an overflow in P + A would cause
> > >      undefined behavior, we can assume that there
> > >      is no overflow.  */
> > > - || (INTEGRAL_TYPE_P (TREE_TYPE (@0))
> > > -     && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@0))))
> > > + || (INTEGRAL_TYPE_P (TREE_TYPE (@1))
> > > +     && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@1))))
> > 
> > Given @1 and @@0 are in the same plus this change isn't technically
> > necessary but it makes it clearer which type we look at (thus ok).
> 
> My understanding is that it is necessary, because @@0 could have different
> type from @0 and TREE_TYPE (@0) is the type of where @0 is used rather
> than @@0.


Using @@0 also guarantees you to get this specific operand when
later refering to it via @0.

> > >      (convert @1)))
> > >    (simplify
> > > +   (plus (convert (plus @1 INTEGER_CST@0)) INTEGER_CST@2)
> > > +   (with { bool overflow;
> > > +    wide_int w = wi::neg (wi::to_wide (@2), &overflow); }
> > > +    (if (wi::to_widest (@0) == widest_int::from (w, TYPE_SIGN (TREE_TYPE 
> > > (@2)))
> > > +  && (!overflow
> > > +      || (INTEGRAL_TYPE_P (TREE_TYPE (@2))
> > > +          && TYPE_UNSIGNED (TREE_TYPE (@2))))
> > > +  && (element_precision (type) <= element_precision (TREE_TYPE (@1))
> > > +      /* For integer types, if A has a smaller type
> > > +         than T the result depends on the possible
> > > +         overflow in P + A.
> > > +         E.g. T=size_t, A=(unsigned)429497295, P>0.
> > > +         However, if an overflow in P + A would cause
> > > +         undefined behavior, we can assume that there
> > > +         is no overflow.  */
> > > +      || (INTEGRAL_TYPE_P (TREE_TYPE (@1))
> > > +          && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@1)))))
> > 
> > I think we don't need to worry about definedness of overflow.  All
> 
> We are talking about
> (int) (x + 0x80000000U) + INT_MIN
> I think you're right that we can still optimize that to (int) x.
> 
> > that matters is whether twos complement arithmetic will simplify
> > the expression to (convert @1).  Specifically the possible overflow
> > of the negation of @2 for the case element_precision (type) <= 
> > element_precision (TREE_TYPE (@1)) shouldn't matter, likewise
> > for the widening case (we'd never get the equality).
> > 
> > Don't we want to compare @0 and -@2 in the type of @2?  Like
> > for (unsigned int)(unsigned-long-x + 0x100000005) + -5U which
> > we should be able to simplify?  For the widening case that would
> > work as well as far as I can see?
> 
> So, we can have several cases, the narrowing one, e.g.:
> (int)(unsigned-long-long-x + 0x100000005ULL) + -5
> (unsigned)(long-long-x + 0x100000005LL) + -5U
> (int)(unsigned-long-long-x + 0x1fffffffbULL) + 5
> (unsigned)(long-long-x + 0x1fffffffbLL) + 5U
> same precision:
> (int)(unsigned-x + 5U) + -5
> (unsigned)(int-x + 5) + -5U
> (int)(unsigned-x + -5U) + 5
> (unsigned)(int-x + -5) + 5
> and widening ones:
> (long long)(int-x + 5) + -5LL
> (unsigned long long)(int-x + 5) + -5ULL
> (long long)(int-x + -5) + 5LL
> (unsigned long long)(int-x + -5) + 5ULL
> You mean we should effectively (though on wide_int/widest_int)
> fold_unary (MINUS_EXPR, TREE_TYPE (@2), fold_convert (TREE_TYPE (@2), @0))
> and compare that to @2?

I think so.

> > If you can split out this new pattern the rest is ok with honoring
> > the comment below.
> 
> Ok (will need to comment out the corresponding testcase, done below).
> 
> > > -  || (INTEGRAL_TYPE_P (TREE_TYPE (@0))
> > > -      && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@0))))
> > > +    (if (((element_precision (type) <= element_precision (TREE_TYPE 
> > > (@1)))
> > > +   == (element_precision (type) <= element_precision (TREE_TYPE (@1))))
> 
> Yeah, this @1 above should have been @2.  Thanks for catching this.
> 
> So for now like this?

Yes.

Thanks,
Richard.

> 2017-12-15  Jakub Jelinek  <ja...@redhat.com>
> 
>       PR sanitizer/81281
>       * match.pd ((T)(P + A) - (T)P -> (T) A): Use @@0 instead of @0 and
>       convert? on @0 instead of convert.  Check type of @1, not @0.
>       ((T)P - (T)(P + A) -> -(T) A): Use @@0 instead of @0 and
>       convert? on @0 instead of convert.  Check type of @1, not @0.
>       ((T)(P + A) - (T)(P + B) -> (T)A - (T)B): Use @@0 instead of @0,
>       only optimize if either both @1 and @2 types are narrower
>       precision, or both are wider or equal precision, and in the former
>       case only if both have undefined overflow.
> 
>       * gcc.dg/pr81281-3.c: New test.
> 
> --- gcc/match.pd.jj   2017-12-07 18:04:54.580750329 +0100
> +++ gcc/match.pd      2017-12-15 11:52:22.582118364 +0100
> @@ -1784,8 +1784,8 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
>  
>    /* (T)(P + A) - (T)P -> (T) A */
>    (simplify
> -   (minus (convert (plus:c @0 @1))
> -    (convert @0))
> +   (minus (convert (plus:c @@0 @1))
> +    (convert? @0))
>     (if (element_precision (type) <= element_precision (TREE_TYPE (@1))
>       /* For integer types, if A has a smaller type
>          than T the result depends on the possible
> @@ -1794,8 +1794,8 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
>          However, if an overflow in P + A would cause
>          undefined behavior, we can assume that there
>          is no overflow.  */
> -     || (INTEGRAL_TYPE_P (TREE_TYPE (@0))
> -         && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@0))))
> +     || (INTEGRAL_TYPE_P (TREE_TYPE (@1))
> +         && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@1))))
>      (convert @1)))
>    (simplify
>     (minus (convert (pointer_plus @@0 @1))
> @@ -1818,8 +1818,8 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
>  
>    /* (T)P - (T)(P + A) -> -(T) A */
>    (simplify
> -   (minus (convert @0)
> -    (convert (plus:c @0 @1)))
> +   (minus (convert? @0)
> +    (convert (plus:c @@0 @1)))
>     (if (INTEGRAL_TYPE_P (type)
>       && TYPE_OVERFLOW_UNDEFINED (type)
>       && element_precision (type) <= element_precision (TREE_TYPE (@1)))
> @@ -1833,8 +1833,8 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
>           However, if an overflow in P + A would cause
>           undefined behavior, we can assume that there
>           is no overflow.  */
> -      || (INTEGRAL_TYPE_P (TREE_TYPE (@0))
> -          && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@0))))
> +      || (INTEGRAL_TYPE_P (TREE_TYPE (@1))
> +          && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@1))))
>       (negate (convert @1)))))
>    (simplify
>     (minus (convert @0)
> @@ -1862,23 +1862,28 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
>  
>    /* (T)(P + A) - (T)(P + B) -> (T)A - (T)B */
>    (simplify
> -   (minus (convert (plus:c @0 @1))
> +   (minus (convert (plus:c @@0 @1))
>      (convert (plus:c @0 @2)))
>     (if (INTEGRAL_TYPE_P (type)
>       && TYPE_OVERFLOW_UNDEFINED (type)
> -     && element_precision (type) <= element_precision (TREE_TYPE (@1)))
> +     && element_precision (type) <= element_precision (TREE_TYPE (@1))
> +     && element_precision (type) <= element_precision (TREE_TYPE (@2)))
>      (with { tree utype = unsigned_type_for (type); }
>       (convert (minus (convert:utype @1) (convert:utype @2))))
> -    (if (element_precision (type) <= element_precision (TREE_TYPE (@1))
> -      /* For integer types, if A has a smaller type
> -         than T the result depends on the possible
> -         overflow in P + A.
> -         E.g. T=size_t, A=(unsigned)429497295, P>0.
> -         However, if an overflow in P + A would cause
> -         undefined behavior, we can assume that there
> -         is no overflow.  */
> -      || (INTEGRAL_TYPE_P (TREE_TYPE (@0))
> -          && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@0))))
> +    (if (((element_precision (type) <= element_precision (TREE_TYPE (@1)))
> +       == (element_precision (type) <= element_precision (TREE_TYPE (@2))))
> +      && (element_precision (type) <= element_precision (TREE_TYPE (@1))
> +          /* For integer types, if A has a smaller type
> +             than T the result depends on the possible
> +             overflow in P + A.
> +             E.g. T=size_t, A=(unsigned)429497295, P>0.
> +             However, if an overflow in P + A would cause
> +             undefined behavior, we can assume that there
> +             is no overflow.  */
> +          || (INTEGRAL_TYPE_P (TREE_TYPE (@1))
> +              && INTEGRAL_TYPE_P (TREE_TYPE (@2))
> +              && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@1))
> +              && TYPE_OVERFLOW_UNDEFINED (TREE_TYPE (@2)))))
>       (minus (convert @1) (convert @2)))))
>    (simplify
>     (minus (convert (pointer_plus @@0 @1))
> --- gcc/testsuite/gcc.dg/pr81281-3.c.jj       2017-12-15 11:51:38.294654394 
> +0100
> +++ gcc/testsuite/gcc.dg/pr81281-3.c  2017-12-15 11:56:08.097388860 +0100
> @@ -0,0 +1,105 @@
> +/* PR sanitizer/81281 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -fdump-tree-optimized" } */
> +/* { dg-final { scan-tree-dump-not "\[+=-] \?123\[ ;]" "optimized" } } */
> +
> +#ifdef __SIZEOF_INT128__
> +__int128
> +f1 (int a, long long b)
> +{
> +  __int128 f = 123 + a;
> +  __int128 g = 123 + b;
> +  return f - g;
> +}
> +#endif
> +
> +signed char
> +f2 (int a, long long b)
> +{
> +  signed char f = 123 + a;
> +  signed char g = 123 + b;
> +  return f - g;
> +}
> +
> +signed char
> +f3 (unsigned int a, unsigned long long b)
> +{
> +  signed char f = 123 + a;
> +  signed char g = 123 + b;
> +  return f - g;
> +}
> +
> +unsigned char
> +f4 (unsigned int a, unsigned long long b)
> +{
> +  unsigned char f = 123 + a;
> +  unsigned char g = 123 + b;
> +  return f - g;
> +}
> +
> +/* This isn't optimized yet.  */
> +#if 0
> +long long
> +f5 (int a)
> +{
> +  long long f = 123 + a;
> +  long long g = 123;
> +  return f - g;
> +}
> +#endif
> +
> +signed char
> +f6 (long long a)
> +{
> +  signed char f = 123 + a;
> +  signed char g = 123;
> +  return f - g;
> +}
> +
> +signed char
> +f7 (unsigned int a)
> +{
> +  signed char f = 123 + a;
> +  signed char g = 123;
> +  return f - g;
> +}
> +
> +unsigned char
> +f8 (unsigned long int a)
> +{
> +  unsigned char f = 123 + a;
> +  unsigned char g = 123;
> +  return f - g;
> +}
> +
> +long long
> +f9 (int a)
> +{
> +  long long f = 123;
> +  long long g = 123 + a;
> +  return f - g;
> +}
> +
> +signed char
> +f10 (long long a)
> +{
> +  signed char f = 123;
> +  signed char g = 123 + a;
> +  return f - g;
> +}
> +
> +signed char
> +f11 (unsigned int a)
> +{
> +  signed char f = 123;
> +  signed char g = 123 + a;
> +  return f - g;
> +}
> +
> +unsigned char
> +f12 (unsigned long int a)
> +{
> +  unsigned char f = 123;
> +  unsigned char g = 123 + a;
> +  return f - g;
> +}
> 
> 
>       Jakub
> 
> 

-- 
Richard Biener <rguent...@suse.de>
SUSE LINUX GmbH, GF: Felix Imendoerffer, Jane Smithard, Graham Norton, HRB 
21284 (AG Nuernberg)

Re: [PATCH] Further improvements for the (T)(P+A)-(T)(P+B) folding (PR sanitizer/81281)

Reply via email to