yaxunl added a comment. One of my concerns is that all kernels are duplicated which may cause code object size doubled.
Do we need to make the clone always_inline and let the kernel call its clone to avoid duplicate function bodies? Or LLVM has some pass to do that? Another concern is that the duplicate non-kernel functions have actual kernel ABI. Not sure if that can cause any issues. Repository: rG LLVM Github Monorepo CHANGES SINCE LAST ACTION https://reviews.llvm.org/D120566/new/ https://reviews.llvm.org/D120566 _______________________________________________ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits