Re: [PATCH 2/2] Split edge when edge locus and dest don't match

Richard Biener via Gcc-patches Tue, 11 Oct 2022 04:27:52 -0700

On Tue, Oct 11, 2022 at 12:57 PM Jørgen Kvalsvik
<jorgen.kvalsvik@woven-planet.global> wrote:
>
> On 07/10/2022 13:45, Jørgen Kvalsvik wrote:
> > On 07/10/2022 08:53, Richard Biener wrote:
> >> On Thu, Oct 6, 2022 at 4:28 PM Jørgen Kvalsvik
> >> <jorgen.kvalsvik@woven-planet.global> wrote:
> >>>
> >>> On 06/10/2022 10:12, Richard Biener wrote:
> >>>> On Wed, Oct 5, 2022 at 2:49 PM Martin Liška <mli...@suse.cz> wrote:
> >>>>>
> >>>>> On 10/5/22 14:04, Jørgen Kvalsvik via Gcc-patches wrote:
> >>>>>> Edges with locus are candidates for splitting so that the edge with
> >>>>>> locus is the only edge out of a basic block, except when the locuses
> >>>>>> match. The test checks the last (non-debug) statement in a basic block,
> >>>>>> but this causes an unnecessary split when the edge locus go to a block
> >>>>>> which coincidentally has an unrelated label. Comparing the first
> >>>>>> statement of the destination block is the same check but does not get
> >>>>>> tripped up by labels.
> >>>>>>
> >>>>>> An example with source/edge/dest locus when an edge is split:
> >>>>>>
> >>>>>>       1  int fn (int a, int b, int c) {
> >>>>>>       2        int x = 0;
> >>>>>>       3        if (a && b) {
> >>>>>>       4                x = a;
> >>>>>>       5        } else {
> >>>>>>       6  a_:
> >>>>>>       7            x = (a - b);
> >>>>>>       8        }
> >>>>>>       9
> >>>>>>      10        return x;
> >>>>>>      11  }
> >>>>>>
> >>>>>> block  file  line   col  stmt
> >>>>>>
> >>>>>> src     t.c     3    10  if (a_3(D) != 0)
> >>>>>> edge    t.c     6     1
> >>>>>> dest    t.c     6     1  a_:
> >>>>>>
> >>>>>> src     t.c     3    13  if (b_4(D) != 0)
> >>>>>> edge    t.c     6     1
> >>>>>> dst     t.c     6     1  a_:
> >>>>>>
> >>>>>> With label removed:
> >>>>>>
> >>>>>>       1  int fn (int a, int b, int c) {
> >>>>>>       2        int x = 0;
> >>>>>>       3        if (a && b) {
> >>>>>>       4                x = a;
> >>>>>>       5        } else {
> >>>>>>       6  // a_: <- label removed
> >>>>>>       7            x = (a - b);
> >>>>>>       8        }
> >>>>>>       9
> >>>>>>      10        return x;
> >>>>>>      11
> >>>>>>
> >>>>>> block  file  line   col  stmt
> >>>>>>
> >>>>>> src     t.c     3    10  if (a_3(D) != 0)
> >>>>>> edge  (null)    0     0
> >>>>>> dest    t.c     6     1  a_:
> >>>>>>
> >>>>>> src     t.c     3    13  if (b_4(D) != 0)
> >>>>>> edge  (null)    0     0
> >>>>>> dst     t.c     6     1  a_:
> >>>>>>
> >>>>>> and this is extract from gcov-4b.c which *should* split:
> >>>>>>
> >>>>>>     205  int
> >>>>>>     206  test_switch (int i, int j)
> >>>>>>     207  {
> >>>>>>     208    int result = 0;
> >>>>>>     209
> >>>>>>     210    switch (i)                            /* branch(80 25) */
> >>>>>>     211                                          /* branch(end) */
> >>>>>>     212      {
> >>>>>>     213        case 1:
> >>>>>>     214          result = do_something (2);
> >>>>>>     215          break;
> >>>>>>     216        case 2:
> >>>>>>     217          result = do_something (1024);
> >>>>>>     218          break;
> >>>>>>     219        case 3:
> >>>>>>     220        case 4:
> >>>>>>     221          if (j == 2)                     /* branch(67) */
> >>>>>>     222                                          /* branch(end) */
> >>>>>>     223            return do_something (4);
> >>>>>>     224          result = do_something (8);
> >>>>>>     225          break;
> >>>>>>     226        default:
> >>>>>>     227          result = do_something (32);
> >>>>>>     228          switch_m++;
> >>>>>>     229          break;
> >>>>>>     230      }
> >>>>>>     231    return result;
> >>>>>>     231  }
> >>>>>>
> >>>>>> block  file  line   col  stmt
> >>>>>>
> >>>>>> src    4b.c   214    18  result_18 = do_something (2);
> >>>>>> edge   4b.c   215     9
> >>>>>> dst    4b.c   231    10  _22 = result_3;
> >>>>>>
> >>>>>> src    4b.c   217    18  result_16 = do_something (1024);
> >>>>>> edge   4b.c   218     9
> >>>>>> dst    4b.c   231    10  _22 = result_3;
> >>>>>>
> >>>>>> src    4b.c   224    18  result_12 = do_something (8);
> >>>>>> edge   4b.c   225     9
> >>>>>> dst    4b.c   231    10  _22 = result_3;
> >>>>>>
> >>>>>> Note that the behaviour of comparison is preserved for the (switch) 
> >>>>>> edge
> >>>>>> splitting case. The former case now fails the check (even though
> >>>>>> e->goto_locus is no longer a reserved location) because the dest 
> >>>>>> matches
> >>>>>> the e->locus.
> >>>>>
> >>>>> It's fine, please install it.
> >>>>> I verified tramp3d coverage output is the same as before the patch.
> >>>>>
> >>>>> Martin
> >>>>>
> >>>>>>
> >>>>>> gcc/ChangeLog:
> >>>>>>
> >>>>>>         * profile.cc (branch_prob): Compare edge locus to dest, not 
> >>>>>> src.
> >>>>>> ---
> >>>>>>  gcc/profile.cc | 18 +++++++++---------
> >>>>>>  1 file changed, 9 insertions(+), 9 deletions(-)
> >>>>>>
> >>>>>> diff --git a/gcc/profile.cc b/gcc/profile.cc
> >>>>>> index 96121d60711..c13a79a84c2 100644
> >>>>>> --- a/gcc/profile.cc
> >>>>>> +++ b/gcc/profile.cc
> >>>>>> @@ -1208,17 +1208,17 @@ branch_prob (bool thunk)
> >>>>>>         FOR_EACH_EDGE (e, ei, bb->succs)
> >>>>>>           {
> >>>>>>             gimple_stmt_iterator gsi;
> >>>>>> -           gimple *last = NULL;
> >>>>>> +           gimple *dest = NULL;
> >>>>>>
> >>>>>>             /* It may happen that there are compiler generated 
> >>>>>> statements
> >>>>>>                without a locus at all.  Go through the basic block 
> >>>>>> from the
> >>>>>>                last to the first statement looking for a locus.  */
> >>>>
> >>>> The comment no longer matches the code.>
> >>>>>> -           for (gsi = gsi_last_nondebug_bb (bb);
> >>>>>> +           for (gsi = gsi_start_nondebug_bb (bb);
> >>>>
> >>>> ^^^ and you are looking at the branch block stmts (bb), not the 
> >>>> destination
> >>>> block stmts (e->dest)
> >>>>
> >>>>>>                  !gsi_end_p (gsi);
> >>>>>> -                gsi_prev_nondebug (&gsi))
> >>>>>> +                gsi_next_nondebug (&gsi))
> >>>>>>               {
> >>>>>> -               last = gsi_stmt (gsi);
> >>>>>> -               if (!RESERVED_LOCATION_P (gimple_location (last)))
> >>>>>> +               dest = gsi_stmt (gsi);
> >>>>>> +               if (!RESERVED_LOCATION_P (gimple_location (dest)))
> >>>>>>                   break;
> >>>>>>               }
> >>>>>>
> >>>>>> @@ -1227,14 +1227,14 @@ branch_prob (bool thunk)
> >>>>>>                Don't do that when the locuses match, so
> >>>>>>                if (blah) goto something;
> >>>>>>                is not computed twice.  */
> >>>>>> -           if (last
> >>>>>> -               && gimple_has_location (last)
> >>>>>> +           if (dest
> >>>>>> +               && gimple_has_location (dest)
> >>>>>>                 && !RESERVED_LOCATION_P (e->goto_locus)
> >>>>>>                 && !single_succ_p (bb)
> >>>>>>                 && (LOCATION_FILE (e->goto_locus)
> >>>>>> -                   != LOCATION_FILE (gimple_location (last))
> >>>>>> +                   != LOCATION_FILE (gimple_location (dest))
> >>>>>>                     || (LOCATION_LINE (e->goto_locus)
> >>>>>> -                       != LOCATION_LINE (gimple_location (last)))))
> >>>>>> +                       != LOCATION_LINE (gimple_location (dest)))))
> >>>>
> >>>> this heuristic needs to be in sync with how we insert edge counters
> >>>> which seems to be hidden in the MST compute (and/or edge insertion).
> >>>> I don't see how it's a win to disregard 'last' and only consider 'dest' 
> >>>> here.
> >>>>
> >>>> I think the patch is wrong.  Please revert if you applied it already.
> >>>
> >>> I haven't applied it yet, so unless someone beat me to it there's 
> >>> fortunately
> >>> nothing to revert.
> >>>
> >>>> I don't see how it's a win to disregard 'last' and only consider 'dest' 
> >>>> here.
> >>>
> >>> It might not be other than that it unbreaks my condition profiling by not
> >>> splitting condition edges and apparently doesn't cause a regression (one 
> >>> caught
> >>> by tests anyway). That being said the heuristic may very well be wrong 
> >>> (as is
> >>> the implementation since it considers bb and not e->dest, although I'm 
> >>> sure I
> >>> tested it with e->dest at some point).
> >>>
> >>> I guess the most important question is if the if (a && b) {} {label:} 
> >>> edges
> >>> should be split in the first place. As mentioned in the patch set, the 
> >>> only
> >>> difference in the test suite happens on break in switches. I'll tinker a 
> >>> bit
> >>> more to see if I can figure out what's going on or if the heuristic can
> >>> otherwise be improved.
> >>>
> >>> Then, when does a block with a goto_locus edge have multiple successors? 
> >>> From my
> >>> previous testing it doesn't seem like it's the conditions make a 
> >>> goto_locus, but
> >>> it's more than just plain gotos right? When would it then have multiple
> >>> successors? Switches and exception handling? If that's the case then 
> >>> maybe the
> >>> heuristic can be improved by simply checking the edge type.
> >>
> >> The goto_locus of an edge is usually either the locus of the control stmt 
> >> or the
> >> locus of the stmt the control transfers to.  The most important case is for
> >> 'goto' stmts themselves since those are elided and become edges (thus
> >> 'goto_locus').
> >>
> >> My understanding as of why we split edges at all is that we want to 
> >> instrument
> >> different locations with different counters and since we cannot have 
> >> counters on
> >> edges itself but have to either insert the counter on the source or
> >> the destination
> >> we in some cases have to split the edge to create an insert location
> >> to not falsely
> >> account.  instrument_edges seems to simply use gsi_insert_on_edge which
> >> inserts with the gimple_find_edge_insert_loc logic which doesn't look
> >> at goto_locus
> >> at all.  So the existing heuristic must be fragile as well.
> >>
> >> BUT - as you say, the plain goto shouldn't be subject to edge 
> >> instrumentation.
> >> The interesting case is probably computed goto (which has multiple 
> >> successors)
> >> and from what I can see a branch where we take the goto_locus from the
> >> COND_EXPRs then/else goto stmt which in theory could have different 
> >> locations.
> >>
> >> I don't fully understand your requirement of not splitting edges -
> >> I'll just note that
> >> the place you are patching is not the only place where edges are split (but
> >> the insert location computation only sees those splits).
> >>
> >> Richard.
> >
> > Ok, I think I understand. To move forward I propose this additional test 
> > case,
> > if anything to catch regressions. Naturally, it fails when the split does 
> > not
> > happen because the 'first' label gets incremented twice (I'll probably 
> > rename it
> > pre application, assuming it gets approved) not once.
> >
> > This also means I need to change strategy for condition coverage - either, I
> > must snapshot these splits and make a mapping table for the "original"
> > identities or maybe run the analysis before this splitting happens.
> >
> >
> > diff --git a/gcc/testsuite/gcc.misc-tests/gcov-4.c
> > b/gcc/testsuite/gcc.misc-tests/gcov-4.c
> > index 498d299b66b..b1dc29b573a 100644
> > --- a/gcc/testsuite/gcc.misc-tests/gcov-4.c
> > +++ b/gcc/testsuite/gcc.misc-tests/gcov-4.c
> > @@ -110,6 +110,29 @@ lab2:
> >    return 8;                          /* count(1) */
> >  }
> >
> > +int
> > +test_goto3 (int i, int j)
> > +{
> > +    if (j) goto first;               /* count(1) */
> > +
> > +top:
> > +    if (i)                   /* count(1) */
> > +      {
> > +     i = do_something (i);
> > +      }
> > +    else
> > +      {
> > +first:                               /* count(1) */
> > +     j = do_something (j);   /* count(2) */
> > +     if (j)                  /* count(2) */
> > +       {
> > +         j = 0;              /* count(1) */
> > +         goto top;           /* count(1) */
> > +       }
> > +      }
> > +    return 16;
> > +}
> > +
> >  void
> >  call_goto ()
> >  {
> > @@ -117,6 +140,7 @@ call_goto ()
> >    goto_val += test_goto1 (1);
> >    goto_val += test_goto2 (3);
> >    goto_val += test_goto2 (30);
> > +  goto_val += test_goto3 (0, 1);
> >  }
> >
> >  /* Check nested if-then-else statements. */
> > @@ -260,7 +284,7 @@ main()
> >    call_unref ();
> >    if ((for_val1 != 12)
> >        || (for_val2 != 87)
> > -      || (goto_val != 15)
> > +      || (goto_val != 31)
> >        || (ifelse_val1 != 31)
> >        || (ifelse_val2 != 23)
> >        || (ifelse_val3 != 246)
> >
> > What do you think?
> >
> > Thanks,
> > Jørgen
>
> Hello,
>
> After tinkering a bit more I figured out a patch I could do to merge these
> splits in the condition coverage code, rather than relying on the splits not
> happening. I still think the tests are a good addition, but there's no longer 
> a
> reason for me to change the splitting heuristics.
>
> Should I prepare a separate patch set for the tests?


Yes, enhancing test coverage is always good and should be done
separately.

Thanks a lot,
Richard.

>
> Thanks,
> Jørgen

Re: [PATCH 2/2] Split edge when edge locus and dest don't match

Reply via email to