On Thu, 19 Dec 2019 at 12:42, Richard Earnshaw (lists) <richard.earns...@arm.com> wrote: > > On 19/12/2019 12:35, Jonathan Wakely wrote: > > On Thu, 19 Dec 2019 at 12:33, Richard Earnshaw (lists) > > <richard.earns...@arm.com> wrote: > >> > >> On 19/12/2019 12:23, Jonathan Wakely wrote: > >>> On Thu, 19 Dec 2019 at 11:50, Richard Earnshaw (lists) > >>> <richard.earns...@arm.com> wrote: > >>>> > >>>> On 19/12/2019 09:27, Jonathan Wakely wrote: > >>>>> On Thu, 19 Dec 2019 at 00:02, Joseph Myers <jos...@codesourcery.com> > >>>>> wrote: > >>>>>> > >>>>>> On Wed, 18 Dec 2019, Joseph Myers wrote: > >>>>>> > >>>>>>> On Mon, 18 Nov 2019, Richard Earnshaw (lists) wrote: > >>>>>>> > >>>>>>>> I've attached a sample from the start of the fixed list - the full > >>>>>>>> list is far > >>>>>>>> too big to post to give a flavour of how the script currently works. > >>>>>>>> Note > >>>>>>>> that annotations of the form [checkme: ....] in the summary are for > >>>>>>>> diagnostic > >>>>>>>> purposes. These are where heuristics suggest that there's a higher > >>>>>>>> than > >>>>>>>> normal chance that the PR number is incorrect and that manual > >>>>>>>> auditing is > >>>>>>>> recommended. Such annotations would not be appropriate in the final > >>>>>>>> conversion. > >>>>>>> > >>>>>>> Concretely, here is the current list of 664 checkme: annotations where > >>>>>>> something was suspicious about the PR number (either component > >>>>>>> mismatch or > >>>>>>> resolved as INVALID). Would some people like to volunteer to pick up > >>>>>>> sections of this list and, for their section, produce a list of SVN > >>>>>>> revisions (at the end of the checkme line) for which the PR number > >>>>>>> appears > >>>>>>> to be correct, and a list of mappings from SVN revision to correct PR > >>>>>>> number when the PR number appears to be wrong? For any that don't get > >>>>>>> reviewed like that we can easily make the script, for the final > >>>>>>> conversion, decline to add a new summary line for any commit where > >>>>>>> the PR > >>>>>>> number is suspicious. > >>>>>> > >>>>>> Here's a slightly shorter version with 644 checkme: annotations, after > >>>>>> adding a few more component aliases to the script (e.g., no longer > >>>>>> considering it suspicious if the log message says PR g++/something and > >>>>>> that PR is in the component that's actually called c++). > >>>>> > >>>>> Line 18: c++ SVN r116634, looks suspicious, but PR number is correct. > >>>>> Line 326: lto SVN r196613, PR number is correct > >>>>> Line 411: libstdc++ SVN r219147, PR number is correct > >>>>> > >>>>> > >>>>> How do you want the mapping from SVN revision to correct PR to be > >>>>> expressed? > >>>>> > >>>>> Line 19: the correct PR for fortran SVN r120056 is fortran/30238 (not > >>>>> 39238) > >>>>> Line 608: lto SVN r268728 should be PR 87089 (not 87809) > >>>>> Line 616: lto SVN r269799 should be PR 87089 (not 87809) > >>>>> > >>>> > >>>> Best of all would be a pull request on > >>>> https://gitlab.com/esr/gcc-conversion/tree/master to update bugdb.py > >>>> directly. > >>>> > >>>> Second best would be something like > >>>> > >>>> whitelist: > >>>> > >>>> "<svn-revnumber>", "<svn-revnumber>", > >>>> > >>>> etc, where svn-revnumber is the revision number in svn as reported in > >>>> the checkme above but without the leading 'r' > >>>> > >>>> and > >>>> > >>>> Change: > >>>> > >>>> "<svn-revnumber>": {"PR": "<correct-bugid>"}, > >>>> .... > >>>> > >>>> where svn-revnumber is as before, and <correct-bugid> is the the PR > >>>> number that should have been used. > >>>> > >>>> The above can then be pasted quickly into the script to update it. > >>>> > >>>> R. > >>> > >>> Thanks. I'm working through the first 100 lines in the file then. > >>> > >>> If nobody else starts, I'll take the next 100, and so on. > >>> > >> > >> Great, thanks. > >> > >> It might be useful if someone could start from the other end. The later > >> numbers will be most recent and the ones which are more likely to be > >> relevant to users today. > > > > And less likely to refer to egcs bugs and/or egcs patches from 1997 > > which are harder to find in our mailing lists archives! > > > > Since nobody else has volunteered yet, maybe I should just start at > > the end instead. All I've managed so far is to decide that line 1 is > > too hard to track down and should get a {"summary":"..."} fixup > > instead. > > > > So I'll start with the LAST 100 lines. > > > > Another approach might be to do a quick triage and cull out (whitelist) > the ones that are "obviously correct". You can often tell by reading > the summary itself that it really does correspond to the commit. Then > we'd be left with a shorter list where things really do need further > digging. In the worst case we can just do as Joseph suggests and > implement a policy ignore for those in the final conversion (I already > do that for PR numbers <1000 since there was more than one gnats DB at > that time and the PR numbers just do not line up reliably enough). > > R.
It might be reasonable to assume rtl-optimization and tree-optimization are aliases, and not treat it as suspicious if those two appear mixed up. And anything where bugzilla has component debug or lto and the commit is tree-optimization is probably OK too (that seems to be the case for several so far). We might want to change the component in bugzilla for these: 92324 from c to tree-optimization 91763 from go to lto 91772 from lto to debug 91137 from rtl-optimization to tree-optimization 91445 from c++ to tree-optimization 90577 from middle-end to fortran 90716 from debug to tree-optimization 90273 from debug to tree-optimization 90194 from debug to middle-end