https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97077

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |missed-optimization
     Ever confirmed|0                           |1
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2020-09-17

--- Comment #1 from Richard Biener <rguenth at gcc dot gnu.org> ---
This is because the second loop has a load from {0,1,2,3,4} in its body and
thus
appears larger to unroll (we don't estimate those loads to go away - a missed
optimization).

  static const int C.0[5] = {0, 1, 2, 3, 4};
...
  <bb 4> [local count: 894749065]:
  # __for_begin_19 = PHI <__for_begin_10(5), &C.0(7)>
  # prephitmp_3 = PHI <pretmp_15(5), 0(7)>
  # ivtmp_14 = PHI <ivtmp_8(5), 5(7)>
  foo (prephitmp_3);
  __for_begin_10 = __for_begin_19 + 4;
  ivtmp_8 = ivtmp_14 - 1;
  if (ivtmp_8 == 0)
    goto <bb 6>; [20.00%]
  else
    goto <bb 5>; [80.00%]

  <bb 5> [local count: 715756304]:
  pretmp_15 = MEM[(const int *)__for_begin_19 + 4B];
  goto <bb 4>; [100.00%]


Estimating sizes for loop 2
 BB: 4, after_exit: 0
  size:   2 foo (prephitmp_3);
  size:   1 __for_begin_10 = __for_begin_19 + 4;
  size:   1 ivtmp_8 = ivtmp_14 - 1;
   Induction variable computation will be folded away.
  size:   2 if (ivtmp_8 == 0)
   Exit condition will be eliminated in peeled copies.
   Exit condition will be eliminated in last copy.
   Constant conditional.
 BB: 5, after_exit: 1
  size:   1 pretmp_15 = MEM[(const int *)__for_begin_19 + 4B];
size: 7-3, last_iteration: 6-3
  Loop size: 7
  Estimated size after unrolling: 12
Not unrolling loop 2: size would grow.
Not unrolling loop 2: contains call and code would grow.

at some point I had patches to improve this but they had negative ripple-down
effects so I reverted them.

Reply via email to