Re: [dmlc/tvm] [QNN] [RFC] QNN Dialect -- Prequantize Models (#3591)

Animesh Jain Tue, 23 Jul 2019 08:42:17 -0700

@jnorwood Yes, bias is kept outside as a separate operator. But, this can be 
fused with the qnn.con2d.


Regarding the accumulation point, if we perform fusion and add the bias in 
`int32` in the accumulator at the end, is it any different than preloading the 
accumulator? We need to ensure that op is fused i.e. the bias addition happens 
in the same accumulator where conv2d has just finished.

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/dmlc/tvm/issues/3591#issuecomment-514266390

Re: [dmlc/tvm] [QNN] [RFC] QNN Dialect -- Prequantize Models (#3591)

Reply via email to