[TVM Discuss] [Questions] Is there any speed comparison of quantization on cpu

kindlehe via TVM Discuss Thu, 09 Apr 2020 18:39:46 -0700


How much speedup does FP32 compared INT8 at rasp4？1.5×？


I saw some speedup conclusion 
[here](https://github.com/tvmai/meetup-slides/tree/master/tvm-meetup-shanghai-Nov-16-2019)
 saying that tvm is about 1.3×（=2.08/1.60）at mobilenet-v2@rasp 3b+AARCH64 than 
QNNPACK.

They reported apparent speedup for both mobilenet-v1 and mobilene-v2：
![image|690x431](upload://oqRljzqKWe45ll979kPI6Z8PeOE.jpeg) 

However，you say qnnpack-int8 is better than tvm-int8 @rasp4，which conclusion is 
more reliable？

If qnnpack is better，than why tvm develop int8 of its own instead of using 
qnnpack？





---
[Visit 
Topic](https://discuss.tvm.ai/t/is-there-any-speed-comparison-of-quantization-on-cpu/6256/19)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.ai/email/unsubscribe/7459dfeb62fb3b24217c6b013ca003831848260a54a7e5fd0e93b87958d65c55).

[TVM Discuss] [Questions] Is there any speed comparison of quantization on cpu

Reply via email to