[quote="electriclilies, post:21, topic:9775, full:true"]
@mikeseven
Yes, the goal is to create a fully quantized graph, and we do recognize that 
this transformation will change the output of the graph. For this reason, we're 
not going to present the rewrite as a Relay pass. And I definitely agree that 
we should let there be user-defined handling.

Also, we definitely have been thinking about simulating accumulation in affine 
space. For int8 input datatypes with int32 accumulation, simulating int32 
accumulation is probably not super important since there's a low likelihood of 
overflow. Therefore we're hoping to deal with it in the multi-dtype extension. 
One option for doing this is creating another simulated QNN op that simulates 
overflow for a given dtype.
[/quote]

Thanks Lily. Agree ;-)





---
[Visit 
Topic](https://discuss.tvm.apache.org/t/rfc-quantization-a-new-quantization-framework-in-tvm-initial-rfc-1-4/9775/23)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.apache.org/email/unsubscribe/b99563bb4fc2943481843c8a89db6f7aeaffd99e02be7cd74995f9523f0b0ae4).

Reply via email to