On Thu, Oct 27, 2011 at 9:54 PM, Martin Jambor <mjam...@suse.cz> wrote:
> Hi,
>
> On Thu, Oct 27, 2011 at 11:06:02AM +0200, Richard Guenther wrote:
>> On Thu, Oct 27, 2011 at 1:22 AM, Martin Jambor <mjam...@suse.cz> wrote:
>> > Hi,
>> >
>> > I've been asked by Maxim Kuvyrkov to revive the following patch which
>> > has not made it to 4.6.  Currently, when type based devirtualization
>> > detects a potential type change, it simply gives up on gathering any
>> > information on the object in question.  This patch adds an attempt to
>> > actually detect the new type after the change.
>> >
>> > Maxim claimed this (and another patch I'll post tomorrow) noticeably
>> > improved performance of some real code.  I can only offer a rather
>> > artificial example in the attachment.  When the constructors are
>> > inlined but the function multiply_matrices is not, this patch makes
>> > the produced executable run for only 7 seconds instead of about 20 on
>> > my 4 year old i686 desktop (with -Ofast).
>> >
>> > Anyway, the patch passes bootstrap and testsuite on x86_64-linux.
>> > What do you think, is it a good idea for trunk now?
>> >
>> > Thanks,
>> >
>> > Martin
>> >
>> >
>> > 2011-10-21  Martin Jambor  <mjam...@suse.cz>
>> >
>> >        * ipa-prop.c (type_change_info): New fields object, 
>> > known_current_type
>> >        and multiple_types_encountered.
>> >        (extr_type_from_vtbl_ptr_store): New function.
>> >        (check_stmt_for_type_change): Use it, set 
>> > multiple_types_encountered if
>> >        the result is different from the previous one.
>> >        (detect_type_change): Renamed to detect_type_change_1. New parameter
>> >        comp_type.  Set up new fields in tci, build known type jump
>> >        functions if the new type can be identified.
>> >        (detect_type_change): New function.
>> >        * tree.h (DECL_CONTEXT): Comment new use.
>> >
>> >        * testsuite/g++.dg/ipa/devirt-c-1.C: Add dump scans.
>> >        * testsuite/g++.dg/ipa/devirt-c-2.C: Likewise.
>> >        * testsuite/g++.dg/ipa/devirt-c-7.C: New test.
>> >
>> >
>> > Index: src/gcc/ipa-prop.c
>> > ===================================================================
>> > --- src.orig/gcc/ipa-prop.c
>> > +++ src/gcc/ipa-prop.c
>> > @@ -271,8 +271,17 @@ ipa_print_all_jump_functions (FILE *f)
>> >
>> >  struct type_change_info
>> >  {
>> > +  /* The declaration or SSA_NAME pointer of the base that we are checking 
>> > for
>> > +     type change.  */
>> > +  tree object;
>> > +  /* If we actually can tell the type that the object has changed to, it 
>> > is
>> > +     stored in this field.  Otherwise it remains NULL_TREE.  */
>> > +  tree known_current_type;
>> >   /* Set to true if dynamic type change has been detected.  */
>> >   bool type_maybe_changed;
>> > +  /* Set to true if multiple types have been encountered.  
>> > known_current_type
>> > +     must be disregarded in that case.  */
>> > +  bool multiple_types_encountered;
>> >  };
>> >
>> >  /* Return true if STMT can modify a virtual method table pointer.
>> > @@ -338,6 +347,49 @@ stmt_may_be_vtbl_ptr_store (gimple stmt)
>> >   return true;
>> >  }
>> >
>> > +/* If STMT can be proved to be an assignment to the virtual method table
>> > +   pointer of ANALYZED_OBJ and the type associated with the new table
>> > +   identified, return the type.  Otherwise return NULL_TREE.  */
>> > +
>> > +static tree
>> > +extr_type_from_vtbl_ptr_store (gimple stmt, tree analyzed_obj)
>> > +{
>> > +  tree lhs, t, obj;
>> > +
>> > +  if (!is_gimple_assign (stmt))
>>
>> gimple_assign_single_p (stmt)
>
> OK.
>
>>
>> > +    return NULL_TREE;
>> > +
>> > +  lhs = gimple_assign_lhs (stmt);
>> > +
>> > +  if (TREE_CODE (lhs) != COMPONENT_REF)
>> > +    return NULL_TREE;
>> > +  obj = lhs;
>> > +
>> > +  if (!DECL_VIRTUAL_P (TREE_OPERAND (lhs, 1)))
>> > +    return NULL_TREE;
>> > +
>> > +  do
>> > +    {
>> > +      obj = TREE_OPERAND (obj, 0);
>> > +    }
>> > +  while (TREE_CODE (obj) == COMPONENT_REF);
>>
>> You do not allow other components than component-refs (thus, for
>> example an ARRAY_REF - that is for a reason?).  Please add
>> a comment why.  Otherwise this whole sequence would look like
>> it should be replaceable by get_base_address (obj).
>>
>
> I guess I might have been overly conservative here, ARRAY_REFs are
> fine.  get_base_address only digs into MEM_REFs if they are based on
> an ADDR_EXPR while I do so always.  But I can check that either both
> obj and analyzed_obj are a MEM_REF of the same SSA_NAME or they are
> the same thing (i.e. the same decl)... which even feels a bit cleaner,
> so I did that.

Well, as you are looking for a must-change-type pattern I think you cannot
simply ignore offsets.  Consider

T a[10];

new (T') (&a[9]);
a[8]->foo();

where the must-type-change on a[9] is _not_ changing the type of a[8]!

Similar cases might happen with

class Compound { T a; T b; };

no?

Please think about the difference must vs. may-type-change for these
cases.  I'm not convinced that the must-type-change code is correct.

Richard.


>
>> > +  if (TREE_CODE (obj) == MEM_REF)
>> > +    obj = TREE_OPERAND (obj, 0);
>> > +  if (TREE_CODE (obj) == ADDR_EXPR)
>> > +    obj = TREE_OPERAND (obj, 0);
>> > +  if (obj != analyzed_obj)
>> > +    return NULL_TREE;
>> > +
>> > +  t = gimple_assign_rhs1 (stmt);
>> > +  if (TREE_CODE (t) != ADDR_EXPR)
>> > +    return NULL_TREE;
>> > +  t = get_base_address (TREE_OPERAND (t, 0));
>>
>> Do we actually ever store sth else than the plain address of a
>> virtual table (thus, &decl)?  If so, what difference makes the offset
>> into the virtual table we get in this way?  I ask, because ...
>>
>> > +  if (!t || TREE_CODE (t) != VAR_DECL || !DECL_VIRTUAL_P (t))
>> > +    return NULL_TREE;
>> > +
>> > +  return DECL_CONTEXT (t);
>>
>> ... DECL_CONTEXT (t) is of course independent of any offset.
>>
>> Thus I think you should drop the call to get_base_address () and
>> instead just look at TREE_OPERAND (t, 0).
>
> Even the simplest cases of assignments have basically a form like
> this:
>
>  this_1(D)->_vptr.A = &MEM[(void *)&_ZTV1A + 8B];
>
> so I need to extract the actual VAR_DECL somehow.  But of course I can
> do that by looking through the MEM_REF and ADDR_EXPR by hand, if
> preferable.
>
>>
>> > +}
>> > +
>> >  /* Callback of walk_aliased_vdefs and a helper function for
>> >    detect_type_change to check whether a particular statement may modify
>> >    the virtual table pointer, and if possible also determine the new type 
>> > of
>> > @@ -352,6 +404,12 @@ check_stmt_for_type_change (ao_ref *ao A
>> >
>> >   if (stmt_may_be_vtbl_ptr_store (stmt))
>> >     {
>> > +      tree type;
>> > +      type = extr_type_from_vtbl_ptr_store (stmt, tci->object);
>> > +      if (tci->type_maybe_changed
>> > +         && type != tci->known_current_type)
>> > +       tci->multiple_types_encountered = true;
>> > +      tci->known_current_type = type;
>> >       tci->type_maybe_changed = true;
>> >       return true;
>> >     }
>> > @@ -359,19 +417,19 @@ check_stmt_for_type_change (ao_ref *ao A
>> >     return false;
>> >  }
>> >
>> > -/* Detect whether the dynamic type of ARG has changed (before callsite 
>> > CALL) by
>> > -   looking for assignments to its virtual table pointer.  If it is, 
>> > return true
>> > -   and fill in the jump function JFUNC with relevant type information or 
>> > set it
>> > -   to unknown.  ARG is the object itself (not a pointer to it, unless
>> > -   dereferenced).  BASE is the base of the memory access as returned by
>> > -   get_ref_base_and_extent, as is the offset.  */
>> > +
>> > +
>> > +/* Like detect_type_change but with extra argument COMP_TYPE which will 
>> > become
>> > +   the component type part of new JFUNC of dynamic type change is 
>> > detected and
>> > +   the new base type is identified.  */
>> >
>> >  static bool
>> > -detect_type_change (tree arg, tree base, gimple call,
>> > -                   struct ipa_jump_func *jfunc, HOST_WIDE_INT offset)
>> > +detect_type_change_1 (tree arg, tree base, tree comp_type, gimple call,
>> > +                     struct ipa_jump_func *jfunc, HOST_WIDE_INT offset)
>> >  {
>> >   struct type_change_info tci;
>> >   ao_ref ao;
>> > +  tree t;
>> >
>> >   gcc_checking_assert (DECL_P (arg)
>> >                       || TREE_CODE (arg) == MEM_REF
>> > @@ -381,8 +439,6 @@ detect_type_change (tree arg, tree base,
>> >   if (!flag_devirtualize || !gimple_vuse (call))
>> >     return false;
>> >
>> > -  tci.type_maybe_changed = false;
>> > -
>> >   ao.ref = arg;
>> >   ao.base = base;
>> >   ao.offset = offset;
>> > @@ -391,15 +447,51 @@ detect_type_change (tree arg, tree base,
>> >   ao.ref_alias_set = -1;
>> >   ao.base_alias_set = -1;
>> >
>> > +  t = arg;
>> > +  while (handled_component_p (t))
>> > +    t = TREE_OPERAND (t, 0);
>>
>> Here you are using handled_component_p, thus allow ARRAY_REFs ...
>> but you are not trying to distinguish a[1] from a[2] anywhere, so this
>> looks bogus and instead should look like the case above (with a comment
>> as to why looking at ARRAY_REFs would be bogus).
>
> I changed both to get_base_address as described above.
>
> This is the new version of the patch which I'm currently bootstrapping
> and testing on an x86_64-linux.  OK for trunk if it passes?  Thanks.
>
> Martin
>
>
> 2011-10-27  Martin Jambor  <mjam...@suse.cz>
>
>        * ipa-prop.c (type_change_info): New fields object, known_current_type
>        and multiple_types_encountered.
>        (extr_type_from_vtbl_ptr_store): New function.
>        (check_stmt_for_type_change): Use it, set multiple_types_encountered if
>        the result is different from the previous one.
>        (detect_type_change): Renamed to detect_type_change_1. New parameter
>        comp_type.  Set up new fields in tci, build known type jump
>        functions if the new type can be identified.
>        (detect_type_change): New function.
>        * tree.h (DECL_CONTEXT): Comment new use.
>
>        * testsuite/g++.dg/ipa/devirt-c-1.C: Add dump scans.
>        * testsuite/g++.dg/ipa/devirt-c-2.C: Likewise.
>        * testsuite/g++.dg/ipa/devirt-c-7.C: New test.
>
>
> Index: src/gcc/ipa-prop.c
> ===================================================================
> --- src.orig/gcc/ipa-prop.c
> +++ src/gcc/ipa-prop.c
> @@ -271,8 +271,17 @@ ipa_print_all_jump_functions (FILE *f)
>
>  struct type_change_info
>  {
> +  /* The declaration or SSA_NAME pointer of the base that we are checking for
> +     type change.  */
> +  tree object;
> +  /* If we actually can tell the type that the object has changed to, it is
> +     stored in this field.  Otherwise it remains NULL_TREE.  */
> +  tree known_current_type;
>   /* Set to true if dynamic type change has been detected.  */
>   bool type_maybe_changed;
> +  /* Set to true if multiple types have been encountered.  known_current_type
> +     must be disregarded in that case.  */
> +  bool multiple_types_encountered;
>  };
>
>  /* Return true if STMT can modify a virtual method table pointer.
> @@ -338,6 +347,44 @@ stmt_may_be_vtbl_ptr_store (gimple stmt)
>   return true;
>  }
>
> +/* If STMT can be proved to be an assignment to the virtual method table
> +   pointer of ANALYZED_OBJ and the type associated with the new table
> +   identified, return the type.  Otherwise return NULL_TREE.  */
> +
> +static tree
> +extr_type_from_vtbl_ptr_store (gimple stmt, tree analyzed_obj)
> +{
> +  tree lhs, t, obj;
> +
> +  if (!gimple_assign_single_p (stmt))
> +    return NULL_TREE;
> +
> +  lhs = gimple_assign_lhs (stmt);
> +
> +  if (TREE_CODE (lhs) != COMPONENT_REF
> +      || !DECL_VIRTUAL_P (TREE_OPERAND (lhs, 1)))
> +    return NULL_TREE;
> +
> +  obj = get_base_address (lhs);
> +  if (TREE_CODE (obj) == MEM_REF)
> +    {
> +      if (TREE_CODE (analyzed_obj) != MEM_REF
> +         || TREE_OPERAND (analyzed_obj, 0) != TREE_OPERAND (obj, 0))
> +       return NULL_TREE;
> +    }
> +  else if (analyzed_obj != obj)
> +    return NULL_TREE;
> +
> +  t = gimple_assign_rhs1 (stmt);
> +  if (TREE_CODE (t) != ADDR_EXPR)
> +    return NULL_TREE;
> +  t = get_base_address (TREE_OPERAND (t, 0));
> +  if (!t || TREE_CODE (t) != VAR_DECL || !DECL_VIRTUAL_P (t))
> +    return NULL_TREE;
> +
> +  return DECL_CONTEXT (t);
> +}
> +
>  /* Callback of walk_aliased_vdefs and a helper function for
>    detect_type_change to check whether a particular statement may modify
>    the virtual table pointer, and if possible also determine the new type of
> @@ -352,6 +399,12 @@ check_stmt_for_type_change (ao_ref *ao A
>
>   if (stmt_may_be_vtbl_ptr_store (stmt))
>     {
> +      tree type;
> +      type = extr_type_from_vtbl_ptr_store (stmt, tci->object);
> +      if (tci->type_maybe_changed
> +         && type != tci->known_current_type)
> +       tci->multiple_types_encountered = true;
> +      tci->known_current_type = type;
>       tci->type_maybe_changed = true;
>       return true;
>     }
> @@ -359,16 +412,15 @@ check_stmt_for_type_change (ao_ref *ao A
>     return false;
>  }
>
> -/* Detect whether the dynamic type of ARG has changed (before callsite CALL) 
> by
> -   looking for assignments to its virtual table pointer.  If it is, return 
> true
> -   and fill in the jump function JFUNC with relevant type information or set 
> it
> -   to unknown.  ARG is the object itself (not a pointer to it, unless
> -   dereferenced).  BASE is the base of the memory access as returned by
> -   get_ref_base_and_extent, as is the offset.  */
> +
> +
> +/* Like detect_type_change but with extra argument COMP_TYPE which will 
> become
> +   the component type part of new JFUNC of dynamic type change is detected 
> and
> +   the new base type is identified.  */
>
>  static bool
> -detect_type_change (tree arg, tree base, gimple call,
> -                   struct ipa_jump_func *jfunc, HOST_WIDE_INT offset)
> +detect_type_change_1 (tree arg, tree base, tree comp_type, gimple call,
> +                     struct ipa_jump_func *jfunc, HOST_WIDE_INT offset)
>  {
>   struct type_change_info tci;
>   ao_ref ao;
> @@ -381,8 +433,6 @@ detect_type_change (tree arg, tree base,
>   if (!flag_devirtualize || !gimple_vuse (call))
>     return false;
>
> -  tci.type_maybe_changed = false;
> -
>   ao.ref = arg;
>   ao.base = base;
>   ao.offset = offset;
> @@ -391,15 +441,44 @@ detect_type_change (tree arg, tree base,
>   ao.ref_alias_set = -1;
>   ao.base_alias_set = -1;
>
> +  tci.object = get_base_address (arg);
> +  tci.known_current_type = NULL_TREE;
> +  tci.type_maybe_changed = false;
> +  tci.multiple_types_encountered = false;
> +
>   walk_aliased_vdefs (&ao, gimple_vuse (call), check_stmt_for_type_change,
>                      &tci, NULL);
>   if (!tci.type_maybe_changed)
>     return false;
>
> -  jfunc->type = IPA_JF_UNKNOWN;
> +  if (!tci.known_current_type
> +      || tci.multiple_types_encountered
> +      || offset != 0)
> +    jfunc->type = IPA_JF_UNKNOWN;
> +  else
> +    {
> +      jfunc->type = IPA_JF_KNOWN_TYPE;
> +      jfunc->value.known_type.base_type = tci.known_current_type;
> +      jfunc->value.known_type.component_type = comp_type;
> +    }
> +
>   return true;
>  }
>
> +/* Detect whether the dynamic type of ARG has changed (before callsite CALL) 
> by
> +   looking for assignments to its virtual table pointer.  If it is, return 
> true
> +   and fill in the jump function JFUNC with relevant type information or set 
> it
> +   to unknown.  ARG is the object itself (not a pointer to it, unless
> +   dereferenced).  BASE is the base of the memory access as returned by
> +   get_ref_base_and_extent, as is the offset.  */
> +
> +static bool
> +detect_type_change (tree arg, tree base, gimple call,
> +                   struct ipa_jump_func *jfunc, HOST_WIDE_INT offset)
> +{
> +  return detect_type_change_1 (arg, base, TREE_TYPE (arg), call, jfunc, 
> offset);
> +}
> +
>  /* Like detect_type_change but ARG is supposed to be a non-dereferenced 
> pointer
>    SSA name (its dereference will become the base and the offset is assumed to
>    be zero).  */
> @@ -407,16 +486,19 @@ detect_type_change (tree arg, tree base,
>  static bool
>  detect_type_change_ssa (tree arg, gimple call, struct ipa_jump_func *jfunc)
>  {
> +  tree comp_type;
> +
>   gcc_checking_assert (TREE_CODE (arg) == SSA_NAME);
>   if (!flag_devirtualize
>       || !POINTER_TYPE_P (TREE_TYPE (arg))
>       || TREE_CODE (TREE_TYPE (TREE_TYPE (arg))) != RECORD_TYPE)
>     return false;
>
> +  comp_type = TREE_TYPE (TREE_TYPE (arg));
>   arg = build2 (MEM_REF, ptr_type_node, arg,
> -                build_int_cst (ptr_type_node, 0));
> +               build_int_cst (ptr_type_node, 0));
>
> -  return detect_type_change (arg, arg, call, jfunc, 0);
> +  return detect_type_change_1 (arg, arg, comp_type, call, jfunc, 0);
>  }
>
>  /* Callback of walk_aliased_vdefs.  Flags that it has been invoked to the
> Index: src/gcc/testsuite/g++.dg/ipa/devirt-c-1.C
> ===================================================================
> --- src.orig/gcc/testsuite/g++.dg/ipa/devirt-c-1.C
> +++ src/gcc/testsuite/g++.dg/ipa/devirt-c-1.C
> @@ -1,7 +1,7 @@
>  /* Verify that ipa-cp correctly detects the dynamic type of an object
>    under construction when doing devirtualization.  */
>  /* { dg-do run } */
> -/* { dg-options "-O3 -fno-early-inlining -fno-inline"  } */
> +/* { dg-options "-O3 -fno-early-inlining -fno-inline -fdump-ipa-cp 
> -fdump-tree-optimized"  } */
>
>  extern "C" void abort (void);
>
> @@ -69,3 +69,8 @@ int main (int argc, char *argv[])
>     bah ();
>   return 0;
>  }
> +
> +/* { dg-final { scan-ipa-dump "Discovered a virtual call to a known 
> target.*A::foo"  "cp"  } } */
> +/* { dg-final { scan-tree-dump-times "OBJ_TYPE_REF" 0 "optimized"} } */
> +/* { dg-final { cleanup-ipa-dump "cp" } } */
> +/* { dg-final { cleanup-tree-dump "optimized" } } */
> Index: src/gcc/testsuite/g++.dg/ipa/devirt-c-2.C
> ===================================================================
> --- src.orig/gcc/testsuite/g++.dg/ipa/devirt-c-2.C
> +++ src/gcc/testsuite/g++.dg/ipa/devirt-c-2.C
> @@ -1,7 +1,7 @@
>  /* Verify that ipa-cp correctly detects the dynamic type of an object
>    under construction when doing devirtualization.  */
>  /* { dg-do run } */
> -/* { dg-options "-O3 -fno-early-inlining -fno-inline"  } */
> +/* { dg-options "-O3 -fno-early-inlining -fno-inline -fdump-ipa-cp 
> -fdump-tree-optimized"  } */
>
>  extern "C" void abort (void);
>
> @@ -77,3 +77,8 @@ int main (int argc, char *argv[])
>     bah ();
>   return 0;
>  }
> +
> +/* { dg-final { scan-ipa-dump "Discovered a virtual call to a known 
> target.*A::foo"  "cp"  } } */
> +/* { dg-final { scan-tree-dump-times "OBJ_TYPE_REF" 0 "optimized"} } */
> +/* { dg-final { cleanup-ipa-dump "cp" } } */
> +/* { dg-final { cleanup-tree-dump "optimized" } } */
> Index: src/gcc/tree.h
> ===================================================================
> --- src.orig/gcc/tree.h
> +++ src/gcc/tree.h
> @@ -2678,7 +2678,9 @@ struct function;
>     nodes, this points to either the FUNCTION_DECL for the containing
>     function, the RECORD_TYPE or UNION_TYPE for the containing type, or
>     NULL_TREE or a TRANSLATION_UNIT_DECL if the given decl has "file
> -    scope".  */
> +    scope".  In particular, for VAR_DECLs which are virtual table pointers
> +    (they have DECL_VIRTUAL set), we use DECL_CONTEXT to determine the type
> +    they belong to.  */
>  #define DECL_CONTEXT(NODE) (DECL_MINIMAL_CHECK (NODE)->decl_minimal.context)
>  #define DECL_FIELD_CONTEXT(NODE) \
>   (FIELD_DECL_CHECK (NODE)->decl_minimal.context)
> Index: src/gcc/testsuite/g++.dg/ipa/devirt-c-7.C
> ===================================================================
> --- /dev/null
> +++ src/gcc/testsuite/g++.dg/ipa/devirt-c-7.C
> @@ -0,0 +1,87 @@
> +/* Verify that ipa-cp will not get confused by placement new constructing an
> +   object within another one when looking for dynamic type change .  */
> +/* { dg-do run } */
> +/* { dg-options "-O3 -Wno-attributes"  } */
> +
> +extern "C" void abort (void);
> +namespace std {
> +  typedef __SIZE_TYPE__ size_t;
> +}
> +inline void* __attribute__ ((always_inline))
> +operator new(std::size_t, void* __p) throw()
> +{
> +  return __p;
> +}
> +
> +class A
> +{
> +public:
> +  char data[256];
> +  A();
> +  virtual int foo (int i);
> +};
> +
> +class B : public A
> +{
> +public:
> +  virtual int foo (int i);
> +};
> +
> +class C
> +{
> +public:
> +  C();
> +  virtual double foo (double i);
> +};
> +
> +int A::foo (int i)
> +{
> +  return i + 1;
> +}
> +
> +int B::foo (int i)
> +{
> +  return i + 2;
> +}
> +
> +double C::foo (double i)
> +{
> +  return i + 3.5;
> +}
> +
> +static int __attribute__ ((noinline)) middleman (class A *obj, int i)
> +{
> +  return obj->foo (i);
> +}
> +
> +int __attribute__ ((noinline,noclone)) get_input(void)
> +{
> +  return 1;
> +}
> +
> +__attribute__ ((always_inline)) C::C ()
> +{
> +}
> +
> +A::A ()
> +{
> +}
> +
> +static  __attribute__ ((noinline)) void bah ()
> +{
> +  class B b;
> +
> +  C *c = new ((void *) &b.data) C;
> +
> +  if (middleman (&b, get_input ()) != 3)
> +    abort ();
> +}
> +
> +int main (int argc, char *argv[])
> +{
> +  int i;
> +
> +  for (i = 0; i < 10; i++)
> +    bah ();
> +  return 0;
> +}
>

Reply via email to