The answer is we use CUDA driver API to launch kernels from C++ code. 
```kernel<<<griddim,blockdim>>>(a,b,c)``` is not the only way to launch kernel 
and it requires compiling with NVCC.


See 
https://github.com/apache/incubator-tvm/blob/e0122c0ea68043372220e4e02b81692c34832227/src/runtime/cuda/cuda_module.cc#L189

There is a longer explanation of "a life of vector add" from python definition 
to cuda kernel launch here

 https://docs.tvm.ai/dev/codebase_walkthrough.html#vector-add-example





---
[Visit 
Topic](https://discuss.tvm.ai/t/how-cuda-kernel-is-launched-in-tvm-stack/6167/3)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.ai/email/unsubscribe/a4dbe57d33e2287327c4f6ef317b4688979aa3a2c4dd0965d391072a571f3fe3).

Reply via email to