How much speedup does FP32 compared INT8 at rasp4?1.5×?
I saw some speedup conclusion [here](https://github.com/tvmai/meetup-slides/tree/master/tvm-meetup-shanghai-Nov-16-2019) saying that tvm is about 1.3×(=2.08/1.60)at mobilenet-v2@rasp 3b+AARCH64 than QNNPACK. They reported apparent speedup for both mobilenet-v1 and mobilene-v2:  However,you say qnnpack-int8 is better than tvm-int8 @rasp4,which conclusion is more reliable? If qnnpack is better,than why tvm develop int8 of its own instead of using qnnpack? --- [Visit Topic](https://discuss.tvm.ai/t/is-there-any-speed-comparison-of-quantization-on-cpu/6256/19) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.ai/email/unsubscribe/7459dfeb62fb3b24217c6b013ca003831848260a54a7e5fd0e93b87958d65c55).