[TVM Discuss] [Questions] How CUDA kernel is launched in TVM stack

Wei Sun via TVM Discuss Thu, 02 Apr 2020 02:28:29 -0700

Hi:


I am investigating the capability of TVM primitives (CUDA backend). I take 
CUTLASS as a baseline of highly-optimized CUDA library. 

I think most of optimization techniques used in CUTLASS like tiling, shared_mem 
management are supported by TVM primitives. 

Streaming is also an important optimization technique I think, but I did not 
find this property in TVM (python frond-end ). So I am wondering how can we use 
streaming in TVM stack. I think streaming is an important property for CUDA 
backend.





---
[Visit 
Topic](https://discuss.tvm.ai/t/how-cuda-kernel-is-launched-in-tvm-stack/6167/8)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.ai/email/unsubscribe/1550681d02fe08f5b7844ada341c66ef0c546bcd874566ef84175c1ca2ceced4).

[TVM Discuss] [Questions] How CUDA kernel is launched in TVM stack

Reply via email to