hi @zpu , some related discussions: [Quantized models are slower than float models on GPUs - Questions - Apache TVM Discuss](https://discuss.tvm.apache.org/t/quantized-models-are-slower-than-float-models-on-gpus/15271/3)
--- [Visit Topic](https://discuss.tvm.apache.org/t/slower-execution-times-after-8-bit-quantization/15502/2) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/c43f271d7cd228884c53757773b6e7ae5ad7e34f6d73eb66aeaa525310f61cfb).