alexfh added inline comments.
================
Comment at: clang/lib/Serialization/ASTReader.cpp:6343
"Invalid data, missing pragma diagnostic states");
- SourceLocation Loc = ReadSourceLocation(F, Record[Idx++]);
- auto IDAndOffset = SourceMgr.getDecomposedLoc(Loc);
- assert(IDAndOffset.first.isValid() && "invalid FileID for transition");
- assert(IDAndOffset.second == 0 && "not a start location for a FileID");
+ FileID FID = ReadFileID(F, Record, Idx);
+ assert(FID.isValid() && "invalid FileID for transition");
----------------
dexonsmith wrote:
> alexfh wrote:
> > alexfh wrote:
> > > jansvoboda11 wrote:
> > > > alexfh wrote:
> > > > > dexonsmith wrote:
> > > > > > eaeltsin wrote:
> > > > > > > This doesn't work as before, likely because ReadFileID doesn't do
> > > > > > > TranslateSourceLocation.
> > > > > > >
> > > > > > > Our tests fail.
> > > > > > >
> > > > > > > I tried calling TranslateSourceLocation here and the tests passed:
> > > > > > > ```
> > > > > > > SourceLocation Loc = Diag.SourceMgr->getComposedLoc(FID, 0);
> > > > > > > SourceLocation Loc2 = TranslateSourceLocation(F, Loc);
> > > > > > > auto IDAndOffset = SourceMgr.getDecomposedLoc(Loc2);
> > > > > > >
> > > > > > > // Note that we don't need to set up Parent/ParentOffset
> > > > > > > here, because
> > > > > > > // we won't be changing the diagnostic state within
> > > > > > > imported FileIDs
> > > > > > > // (other than perhaps appending to the main source file,
> > > > > > > which has no
> > > > > > > // parent).
> > > > > > > auto &F = Diag.DiagStatesByLoc.Files[IDAndOffset.first];
> > > > > > > ```
> > > > > > >
> > > > > > > Sorry I don't know the codebase, so this fix is definitely ugly
> > > > > > > :) But it shows the problem.
> > > > > > >
> > > > > > I don't think that's the issue, since `ReadFileID()` calls
> > > > > > `TranslateFileID`, which should seems like it should be equivalent.
> > > > > >
> > > > > > However, I notice that the post-increment for `Idx` got dropped!
> > > > > > Can you try replacing the line of code with the following and see
> > > > > > if that fixes your tests (without any extra TranslateSourceLocation
> > > > > > logic)?
> > > > > > ```
> > > > > > lang=c++
> > > > > > FileID FID = ReadFileID(F, Record, Idx++);
> > > > > > ```
> > > > > >
> > > > > > If so, maybe you can contribute that fix with a reduced testcase? I
> > > > > > suggest adding me, @vsapsai, @Bigcheese, and @jansvoboda11 as
> > > > > > reviewers.
> > > > > >
> > > > > > @alexfh, maybe you can check if this fixes your tests as well?
> > > > > >
> > > > > > (If this is the issue, it's a bit surprising we don't have existing
> > > > > > tests covering this case... and embarrassing I missed it when
> > > > > > reviewing initially!)
> > > > > I've noticed the dropped `Idx` post-increment as well, but I went a
> > > > > step further and looked at the `ReadFileID` implementation, which is
> > > > > actually doing a post-increment itself, and accepts `Idx` by
> > > > > reference:
> > > > > ```
> > > > > FileID ReadFileID(ModuleFile &F, const RecordDataImpl &Record,
> > > > > unsigned &Idx) const {
> > > > > return TranslateFileID(F, FileID::get(Record[Idx++]));
> > > > > }
> > > > > ```
> > > > >
> > > > > Thus, it seems to be correct. But what @eaeltsin has found is
> > > > > actually a problem for us. I'm currently trying to make an isolated
> > > > > test case, but it's quite tricky (as header modules are =\). It may
> > > > > be the case that our build setup relies on something clang doesn't
> > > > > explicitly promises, but the fact is that the behavior (as observed
> > > > > by our build setup) has changed. I'll try to revert the commit for
> > > > > now to get us unblocked and provide a test case as soon as I can.
> > > > Thanks for helping out @dexonsmith, we did have the week off.
> > > >
> > > > @eaeltsin @alexfhDone, are you able to provide the failing test case?
> > > > I'm happy to look into it and re-land this with a fix.
> > > I've spent multiple hours trying to extract an observable test case. It
> > > turned out to be too hairy of a yaq to shave: for each compilation a
> > > separate sandboxed environment is created with a separate symlink tree
> > > with just the inputs necessary for that action. Some of the inputs are
> > > prebuilt module files (e.g. for libc++) that are version-locked with the
> > > compiler. So far @jgorbe and I could reduce this to four compilation
> > > steps with their own symlink trees with inputs. While I could figure out
> > > some of the factors that affect reproducibility (for example, symlinks
> > > are important, since making a deep copy of the input directories makes
> > > the issue disappear), it will take a few more hours of concentrated yak
> > > shaving to bring this to a shareable state. I'm not sure I have much more
> > > time to sink into investigating this.
> > >
> > > It seems like examining code based on @eaeltsin's finding may be a more
> > > fruitful path to synthesizing a regression test. Could you try following
> > > that path?
> > One more observation: `-fmodules-embed-all-files` and
> > `-Wno-mismatched-tags` compiler options turned out to be important.
> Maybe @eaeltsin can help, but I don't see any reason to think that testcase
> will be easier. Typically we don't revert without a testcase or at least some
> way to understand the problem and make progress.
>
> (Maybe @jansvoboda11 has ideas for extra instrumentation in the compiler to
> better understand what's going on with your setup?)
I've managed to get rid of the precompiled module files and now I have
something much more observable. It will take some more time to brush it up
though.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D137213/new/
https://reviews.llvm.org/D137213
_______________________________________________
cfe-commits mailing list
[email protected]
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits