jhuber6 wrote:

> Faux "generic" IR sounds like a problematic concept, do you have an example?

It's what `libc` and the ROCm DeviceLibs do, compile without `-mcpu` and don't 
use any target specific attributes or intrinsics, then link it into a TU later 
when the target is known. It's find in principle if you hold it right, but the 
wavefrontsize is the one sticking issue, hence why Matt would suggest having 
two builds of `libc`, one for `amdgcn-amd-amdhsa-wave32` and 
`amdgcn-amd-amdhsa-wave64` or something.


https://github.com/llvm/llvm-project/pull/114481
_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to