My guess is that tvm stops quantizing after the global average pooling  for 
accuracy purposes.

Usually in modern CNN after the global average pooling, you have the classifier 
(dense layer). In order to preserve accuracy the computation will be performed 
on 32 bit (instead of 8bit)





---
[Visit 
Topic](https://discuss.tvm.apache.org/t/why-stop-quantize-after-first-nn-global-avg-pool2d/8225/3)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.apache.org/email/unsubscribe/299c435bc3a55f02d23494d5a58150a878b1feb58c60350d95b3a529f683d881).

Reply via email to