> > > It appears to me this would let them simulate smaller than 8 bit > > > quantizations. > > > > > > If _simulating 8 smaller bit_ is the case, 8 bit should be able to hold > > activation min/max value. > > 8 bits could hold. But what the value output_min / output_max is ? I think > @jnorwood want to express this point. Because we can not just simply use > `out_dtype` to decide what the value range is. But if we insert `clip` op in > frontend, I think it also could handle. Need some logic to calculate the min > / max. see my next comment.
I was saying the *It appears to me this would let them simulate smaller than 8 bit quantizations* reasoning could be somehow not the only possibility. -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/dmlc/tvm/issues/2351#issuecomment-502508779