================
@@ -1665,6 +1665,13 @@ Value *CodeGenFunction::EmitAMDGPUBuiltinExpr(unsigned 
BuiltinID,
     if (AppendFalseForOpselArg)
       Args.push_back(Builder.getFalse());
 
+    if (BuiltinID == AMDGPU::BI__builtin_amdgcn_wmma_i32_16x16x64_iu8) {
+      if (Args.size() == 7)
----------------
0xzre wrote:

Forcing an 8th arg immediately would break downstream code just as @arsenm 
mentioned here 
https://github.com/llvm/llvm-project/pull/171069#pullrequestreview-3552446843

Maybe what we need is to make clamp required later in a transition plan šŸ¤” 

https://github.com/llvm/llvm-project/pull/171069
_______________________________________________
cfe-commits mailing list
[email protected]
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to