@srkreddy1238 @yzhliu Thanks comments! If all of you agree, I will make TFLite frontend support from NCHW to NHWC.
@yzhliu Yes. quantization part support is not been upstreamed yet. It has many changes. I plan to upstream it in dev 0.6. My original plan is to support TFLite NHWC the reason is we could leverage auto tuning of NCHW and see the performance of quantization model. The initial work is we could faster than FP32 30% in Mobilenet V1 using spatial pack. We also find this is the limit of quantization model, we could tensorize `q_conv2d` to get better performance. However, if we change the layout from NCHW to NHWC, we should have some additional work to do, for example auto tuning of NHWC support (including conv2d and depthwise convolution). Alright, I could start to do this work firstly to support TFLite NHWC and upstream it before quantization part, because this work is much easy than quantization part. -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: https://github.com/dmlc/tvm/issues/2519#issuecomment-484746869