JonChesterfield added a comment.

Oh, right. Nvptx is still lowering to machine code per-tu. We don't want the 
devicertl linking as machine code, so it has to go in per-tu. Or we could link 
nvptx as IR instead, and send that plus amdgpu down the same code path. 
Probably makes applications faster and compiles slower. Which sort of brings us 
to this patch with a different default.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D117048/new/

https://reviews.llvm.org/D117048

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to