@anijain2305 I understand your thought and thought. I agree we should make the 
api minimal. However, no matter what way, q_conv2d’s int32 output should be 
clamped into uint8 range. If you don’t pass min / max, you also need do `output 
= std::max(output, 0)` and `output = std::min(output, 255)` then return output. 
So why not we set the default the value output_min = 0 / output_max = 255, and 
make the computation be `output = std::max(output, output_min)` and `output= 
std::min(output, output_max)` which will be suitable for tflite / mxnet / 
qnnpack and so on... API design is very important, we should consider as far as 
we could(tflite / mxnet , even other library we should also consider, qnnpack 
is a very high performance library on arm cpu, we can not avoid discussing it 
in my opinion), otherwise we have to do tricky workaround in the future when we 
do something. This is my point I wish to express before.

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/dmlc/tvm/issues/2351#issuecomment-502366780

Reply via email to