Thanks @jackwish for confirming the python lowering looks good.

For max pooling, we used casting, because we have to subtract the zero point 
from the quantized tensor. That subtract needs to happen in higher precision 
than (u)int8. Correct me if I am wrong.

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/dmlc/tvm/issues/3617#issuecomment-517521048

Reply via email to