Assuming you're looking for the low level code, you can find the cuda cast generator in `tvm/src/target/source/ codegen_cuda` in the `VisitExpr(const CastNode* op, std::ostream& os)` function. However, you probably want to do the casting ahead of time in relay rather than on device. If you use a pass like the one I posted above, you convert the operations int he graph before they're compiled.
--- [Visit Topic](https://discuss.tvm.ai/t/cuda-fp16-example/6190/9) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.ai/email/unsubscribe/d2d53b5ee42c5b9ac9724b4601043ab627b6b4646d131ea882343ec40805a3cf).