No. TVM also generates CUDA code. It's because only X86 has NCHW[c]c layout that needs graph tuning to optimize layout transform overhead between different NCHW[x]c layouts. In GPU, all conv2d are in NCHW layout, so we don't have layout transform overhead between ops, so we don't need graph tuner.
--- [Visit Topic](https://discuss.tvm.apache.org/t/graph-optimization-not-exist-in-gpu/8051/4) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/d982860c5718e41186c7454fb7584db2e24585e057cced4871e07f5bb722c15e).