[quote="Hzfengsy, post:1, topic:7872"] #### TensorIR:
``` with tir.block([10]) as vi: B[vi] = A0[vi] with tir.block([10]) as vi: B[vi + 10] = A1[vi] with tir.block([10]) as vi: B[vi + 20] = A2[vi] ``` The critical improvement is performance. In TIR we optimize the program by deprecating the `if` branch, which is impossible in the TE schedule. [/quote] This looks confusing to me because there is no example showing how to schedule the three blocks. I'm curious if TIR's Block construct is strong enough to support holistic fusion like [Rammer](https://www.usenix.org/system/files/osdi20-ma.pdf) and [HFuse](https://arxiv.org/pdf/2007.01277.pdf), where different Block has heterogeneous workload but we can fuse them in a single kernel. If so we can make such fusion a primitive in TIR schedule and enlarge the auto-scheduling search space. --- [Visit Topic](https://discuss.tvm.apache.org/t/rfc-tensorir-a-schedulable-ir-for-tvm/7872/56) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/efec97d52d96bebf2d6c5966d951ac12d9a81dfec8fd6988ec19ddee04582b85).