@jnorwood Yes, bias is kept outside as a separate operator. But, this can be fused with the qnn.con2d.
Regarding the accumulation point, if we perform fusion and add the bias in `int32` in the accumulator at the end, is it any different than preloading the accumulator? We need to ensure that op is fused i.e. the bias addition happens in the same accumulator where conv2d has just finished. -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/dmlc/tvm/issues/3591#issuecomment-514266390