> Hi, > here are some of changes of LTO/IPA done in GCC10. There is also > recursive cloning and some other stuff I will add incrementally as well > as some data on overall compile time/memory use improvements as we > reported in past years. I am still running tests and fixing bugs in this > area. > > Honza Ping...
Honza > > diff --git a/htdocs/gcc-10/changes.html b/htdocs/gcc-10/changes.html > index aca76825..0f0fce18 100644 > --- a/htdocs/gcc-10/changes.html > +++ b/htdocs/gcc-10/changes.html > @@ -50,12 +50,46 @@ a work-in-progress.</p> > <!-- .................................................................. --> > <h2 id="general">General Improvements</h2> > > +<p>The following GCC command line options have been introduced or > improved.</p> > +<ul> > + <li><a > href="https://gcc.gnu.org/onlinedocs/gcc-10.1.0/gcc/Optimize-Options.html#index-fprofile-partial-training"><code>-fprofile-partial-training</code></a> > + can now be used to inform compiler that code paths not covered by the > + train run should not be optimized for size.</li> > +</ul> > <p>The following built-in functions have been introduced.</p> > <ul> > <li><code>__builtin_roundeven</code> for the corresponding function from > ISO/IEC TS 18661. > </li> > </ul> > +<p>A large number of improvements to code generation have been made, > including > + but not limited to the following.</p> > +<ul> > + <li>Inter-procedural optimization improvements: > + <ul> > + <li>Inter-procedural scalar replacement for aggregates (IPA-SRA) pass > was re-implemented to work at link-time. > + </li> > + <li><a > href="https://gcc.gnu.org/onlinedocs/gcc-10.1.0/gcc/Optimize-Options.html#index-finline-functions"><code>-finline-functions</code></a> > + is now enabled at <code>-O2</code> and was retuned for better code size > + versus runtime performance tradeofs. Inliner heuristics was also > + significantly sped up to avoid negativive impact to <code>-flto > + -O2</code> compile times. > + </li> > + <li>Inliner heuristics and function clonning can now use value-range > + information to predict effectivity of individual > transformations.</li> > + <li>Selected <code>--param</code> values can now be specified at > + translation unit granuality. This includes all parameters controlling > + inliner.</li> > + <li>During link-time optimization the C++ One Definition Rule is used to > + increase precision of type based alias analysis.</li> > + </ul> > + </li> > + <li>Profile driven optimization improvements: > + <ul> > + <li>Profile maintenance during compilation was improved and hot/cold > code partitioning improved.</li> > + </ul> > + </li> > +</ul> > > <!-- .................................................................. --> > <h2 id="languages">New Languages and Language specific improvements</h2>