To get better performance, you can try auto tuning or auto scheduling You may find these tutorials helpful https://tvm.apache.org/docs/how_to/tune_with_autotvm/tune_relay_cuda.html https://tvm.apache.org/docs/how_to/tune_with_autoscheduler/tune_network_cuda.html?highlight=tune%20relay
--- [Visit Topic](https://discuss.tvm.apache.org/t/using-tvm-quantize-model-is-too-slower-than-not-quantize/11487/2) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/ef0d159449eab0d1c3cbad50738c72732ba3de6c079e3f11383a5ae2dfed34ff).