My guess is that tvm stops quantizing after the global average pooling for accuracy purposes.
Usually in modern CNN after the global average pooling, you have the classifier (dense layer). In order to preserve accuracy the computation will be performed on 32 bit (instead of 8bit) --- [Visit Topic](https://discuss.tvm.apache.org/t/why-stop-quantize-after-first-nn-global-avg-pool2d/8225/3) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/299c435bc3a55f02d23494d5a58150a878b1feb58c60350d95b3a529f683d881).