Hi all,
I'm interested in deploying pre-quantized networks, specially with 2-bit and 
4-bit quantization with VTA but I know the usage for the accelerator is still 
restricted to 8bit and only certain models are supported. Nevertheless, the 
tech report and the hardware design state that it's possible to have lower 
integer precision data types. My approach was to generate a custom bitstream 
with the desired data widths, considering always the buffer sizes but I still 
haven't been able to come up with a configuration that works.
As this issue was mentioned in the v0.6 roadmap but wasn't addressed I thought 
I could get some further guidelines in order to bring this enhancement. Can 
anybody help me out?

Thanks in advance





---
[Visit 
Topic](https://discuss.tvm.ai/t/vta-support-for-data-types-lower-than-int8/6946/1)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.ai/email/unsubscribe/4e3516ad6901054232c614fd683aacfadf4627b9d166e7720c6e7b0aaaa898ea).

Reply via email to