yaxunl added a comment.

In https://reviews.llvm.org/D49083#1157568, @tra wrote:

> > HIP generates one fat binary for all devices after linking. However, for 
> > each compilation
> >  unit a ctor function is emitted which register the same fat binary. 
> >  Measures need to be taken to make sure the fat binary is only registered 
> > once.
>
> Are you saying that for HIP there's only one fatbin file with GPU code for 
> the complete host executable, even if it consists of multiple HIP TUs?


By 'TU' do you mean 'target unit'?

For HIP there is only one fatbin file with GPU code for the complete host 
executable even if there are mulitple GPU sub-targets. Device code for 
different sub-targets are bundled together by clang-offload-bundler as one 
fatbin. Runtime will extract device code for different sub-targets.


https://reviews.llvm.org/D49083



_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to