lucas-rami wrote:

> IIUC that is because the flat workgroup size. Waves per EU must yield to the 
> value computed from flat workgroup size, and if it is absent, we must assume 
> it can be 1024.

AFAIU the intent of the existing implementation is that the default minimum 
waves/EU is set so that all the waves of a workgroup of maximum size can fit 
concurrently on a single CU. I am not proposing we change that, what I would 
like the "amdgpu-waves-per-eu" attribute to do is be able to lower than minimum 
at the user's request in cases where higher occupancies are not thought to be 
beneficial.

https://github.com/llvm/llvm-project/pull/138284
_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to