to clarify a bit, we do need have to ask for doing everything as form of schedule, so it is OK for example to generate a compute definition that already contains packing (you can view that as one special dispatch pass).
The main ask is that the TIR schedule pass should detect the already packed TIR and continue schedule it(one way might be detect an attached tag in block). This being said, i think it should be possible to insert array packing through cache_read and transform layout -- Reply to this email directly or view it on GitHub: https://github.com/apache/tvm-rfcs/pull/107#issuecomment-1944345073 You are receiving this because you are subscribed to this thread. Message ID: <apache/tvm-rfcs/pull/107/c1944345...@github.com>