One thing I've thought about is asynchronous execution support in Relax. I don't know if this is already planned as part of either [Heterogenous execution](https://github.com/apache/tvm/issues/15101) or [DistIR](https://github.com/apache/tvm/pull/15289) work, but just wanted to mention it in the discussion.
Even though we have async support in TIR, async support at the graph level could open up a lot of optimization opportunities, but it would also of course need to planned out properly. --- [Visit Topic](https://discuss.tvm.apache.org/t/discuss-tvm-community-strategy-for-foundational-models/15401/5) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/2de34fec36f4df0631375fe4332039ce95f2c9ea9e7150ab087f04a4e4d25166).