The answer is we use CUDA driver API to launch kernels from C++ code. ```kernel<<<griddim,blockdim>>>(a,b,c)``` is not the only way to launch kernel and it requires compiling with NVCC.
See https://github.com/apache/incubator-tvm/blob/e0122c0ea68043372220e4e02b81692c34832227/src/runtime/cuda/cuda_module.cc#L189 There is a longer explanation of "a life of vector add" from python definition to cuda kernel launch here https://docs.tvm.ai/dev/codebase_walkthrough.html#vector-add-example --- [Visit Topic](https://discuss.tvm.ai/t/how-cuda-kernel-is-launched-in-tvm-stack/6167/3) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.ai/email/unsubscribe/a4dbe57d33e2287327c4f6ef317b4688979aa3a2c4dd0965d391072a571f3fe3).