On Sun, May 5, 2019 at 2:02 PM Kewen.Lin <li...@linux.ibm.com> wrote: > > on 2019/5/5 下午12:04, Bin.Cheng wrote: > > On Sun, May 5, 2019 at 11:23 AM Kewen.Lin <li...@linux.ibm.com> wrote: > >>>> + /* Some compare iv_use is probably useless once the doloop > >>>> optimization > >>>> + performs. */ > >>>> + if (tailor_cmp_p) > >>>> + tailor_cmp_uses (data); > >>> Function tailor_cmp_uses sets iv_use->zero_cost_p under some > >>> conditions. Once the flag is set, though the iv_use participates cost > >>> computation in determine_group_iv_cost_cond, the result cost is > >>> hard-wired to ZERO (which means cost computation for such iv_use is > >>> waste of time). > >> > >> Yes, it can be improved by some early check and return. > >> But it's still helpful to make it call with may_eliminate_iv. > >> gcc.dg/no-strict-overflow-6.c is one example, with may_eliminate_iv > >> consideration it exposes more opportunities for downstream optimization. > > Hmm, I wasn't suggesting early check and return, on the contrary, we > > can better integrate doloop/cost stuff in the overall model. See more > > in following comments. > > Sorry, I didn't claim it clearly, the previous comment is to claim the > call with may_eliminate_iv is not completely "waste of time", and early > return can make it save time. :) > > And yes, it's not an issue any more with your proposed idea. > > >> > >>> Also iv_use rewriting process is skipped for related > >>> ivs preserved explicitly by preserve_ivs_for_use. > >>> Note IVOPTs already adds candidate for original ivs. So why not > >>> detecting doloop iv_use, adjust its cost with the corresponding > >>> original iv candidate, then let the general cost model make the > >>> decision? I believe this better fits existing infrastructure and > >>> requires less change, for example, preserve_ivs_for_use won't be > >>> necessary. > >> I agree adjusting the cost of original iv candidate of the iv_use > >> requires less change, but on one hand, I thought to remove interest > >> cmp iv use or make it zero cost is close to the fact. Since it's > >> eliminated eventually in doloop optimization, it should not > >> considered in cost modeling. This way looks more exact. > > Whether doloop transformation should be considered or be bypassed in > > cost model isn't a problem. Actually we can bind doloop iv_cand to > > cmp iv_use in order to force the transformation. My concern is the > > patch specially handles doloop by setting the special flag, then > > checking it. We generally avoid such special-case handling since it > > hurts long time maintenance. The pass was very unstable in the pass > > because of such issues. > > > > OK, I understand your concerns now. Thanks for explanation! > > >> One the other hand, I assumed your suggestion is to increase the > >> cost for the pair (original iv cand, cmp iv use), the increase cost > >> seems to be a heuristic value? It seems it's possible to sacrifice > > Decrease the cost so that the iv_cand is preferred? The comment > > wasn't made on top of implementing doloop in ivopts. Anyway, you can > > still bypass cost model by binding the "correct" iv_cand to cmp > > iv_use. > > > > To decrease the cost isn't workable for this case, it make the original > iv cand is preferred more and over the other iv cand for memory iv use, > then the desirable memory based iv cand won't be chosen. > If increase the cost, one basic iv cand is chosen for cmp use, memory > based iv cand is chosen for memory use, instead of original iv for both. Sorry for the mistake, I meant to decrease use cost of whatever "correct" iv_cand for cmp iv_use that could enable doloop optimization, it doesn't has to the original iv_cand.
> > Could you help to explain the "binding" more? Does it mean cost modeling > decision making can simply bypass the cmp iv_use (we have artificially > assigned one "correct" cand iv to it) and also doesn't count the "correct" > iv cand cost into total iv cost? Is my understanding correct? For example, if the heuristic finds out the "correct" doloop iv_cand, we can force choosing that iv_cand for cmp iv_use and bypass the candidate choosing algorithm: struct iv_group { //... struct iv_cand *bind_cand; }; then set this bind_cand directly in struct iv_ca::cand_for_group. As a result, iv_use rewriting code takes care of everything, no special handling (such as preserve_ivs_for_use) is needed. Whether letting cost model decide the "correct" iv_cand or bind it by yourself depends on how good your heuristic check is. It's your call. :) > > >>> tuning; 2) the doloop decision can still be canceled by cost model if > >>> it's really not beneficial. With current patch, it can't be undo once > >>> the decision is made (at very early stage in IVOPTs.). > >> > >> I can't really follow this. If it's predicted to be transformed to doloop, > >> I think it should not be undoed any more, since it's useless to consider > >> this cmp iv use. Whatever IVOPTS does, the comp at loop closing should not > >> be changed (although possible to use other iv), right? Do I miss > >> something? > > As mentioned, the previous comment wasn't made on top of implementing > > doloop in ivopts. That would be nice but a different story. > > Before we can do that, it'd better be conservative and only makes > > (doloop) decision in ivopts when you are sure. As you mentioned, it's > > hard to do the same checks at gimple as RTL, right? In this case, > > making it a (conservative) heuristic capturing certain beneficial > > cases sounds better than capturing all cases but fail in later RTL > > passes. > > > > Yes, I agree we should be conservative. But it's hard to determine which is > better in practice, even for capturing all cases, we are still trying our best > to be more conservative, excluding any suspicious factor which is possible to > make it fail in later RTL checking, one example is that the patch won't > predict > it can be doloop once finding switch statement. It depends on how much "bad" > cases we don't catch and how serious its impact is and whether easy to > improve. Sure, I don't know ppc so have all the trust in your decision here. Thanks for your patience. Thanks, bin