[quote="janimesh, post:5, topic:3920"] MobileNet models have slowdown because they use Depthwise convolution that has not been configured to use VNNI instructions. [/quote]
This might be the reason why tvm is slower than qnnpack. see [link](https://discuss.tvm.ai/t/quantization-story/3920/5?u=kindlehe) --- [Visit Topic](https://discuss.tvm.ai/t/is-there-any-speed-comparison-of-quantization-on-cpu/6256/38) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.ai/email/unsubscribe/7196354108050d560455c759764153d87b4a3ca823ab69de6dbd332d30b5a71d).