Assuming you're looking for the low level code, you can find the cuda cast 
generator in `tvm/src/target/source/ codegen_cuda` in the `VisitExpr(const 
CastNode* op, std::ostream& os)` function. However, you probably want to do the 
casting ahead of time in relay rather than on device. If you use a pass like 
the one I posted above, you convert the operations int he graph before they're 
compiled.





---
[Visit Topic](https://discuss.tvm.ai/t/cuda-fp16-example/6190/9) to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.ai/email/unsubscribe/d2d53b5ee42c5b9ac9724b4601043ab627b6b4646d131ea882343ec40805a3cf).

Reply via email to