Re: [dmlc/tvm] [RFC] Frontend layout transformation (#2519)

Zhao Wu Thu, 18 Apr 2019 19:21:34 -0700

@srkreddy1238 @yzhliu Thanks comments!

If all of you agree, I will make TFLite frontend support from NCHW to NHWC.


@yzhliu Yes. quantization part support is not been upstreamed yet. It has many 
changes. I plan to upstream it in dev 0.6. My original plan is to support 
TFLite NHWC the reason is we could leverage auto tuning of NCHW and see the 
performance of quantization model. The initial work is we could faster than 
FP32 30% in Mobilenet V1 using spatial pack. We also find this is the limit of 
quantization model, we could tensorize `q_conv2d` to get better performance. 
However, if we change the layout from NCHW to NHWC, we should have some 
additional work to do, for example auto tuning of NHWC support (including 
conv2d and depthwise convolution). Alright, I could start to do this work 
firstly to support TFLite NHWC and upstream it before quantization part, 
because this work is much easy than quantization part.

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/dmlc/tvm/issues/2519#issuecomment-484746869

Re: [dmlc/tvm] [RFC] Frontend layout transformation (#2519)

Reply via email to