How much speedup does FP32 compared INT8 at rasp4?1.5×?

I saw some speedup conclusion 
[here](https://github.com/tvmai/meetup-slides/tree/master/tvm-meetup-shanghai-Nov-16-2019)
 saying that tvm is about 1.3×(=2.08/1.60)at mobilenet-v2@rasp 3b+AARCH64 than 
QNNPACK.

They reported apparent speedup for both mobilenet-v1 and mobilene-v2:
![image|690x431](upload://oqRljzqKWe45ll979kPI6Z8PeOE.jpeg) 

However,you say qnnpack-int8 is better than tvm-int8 @rasp4,which conclusion is 
more reliable?

If qnnpack is better,than why tvm develop int8 of its own instead of using 
qnnpack?





---
[Visit 
Topic](https://discuss.tvm.ai/t/is-there-any-speed-comparison-of-quantization-on-cpu/6256/19)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.ai/email/unsubscribe/7459dfeb62fb3b24217c6b013ca003831848260a54a7e5fd0e93b87958d65c55).

Reply via email to