I am bit confused, maybe I misunderstood your suggestion.
I am using the debug executor to measure the latency of the individual (fused) TIRfunctions, but I cannot tell which function corresponds to which part of the original/optimized relay graph. (example of TIR function name: fused_layout_transform_nn_batch_flatten) So I am aware of the n:m mapping between Relay nodes and TIR functions, however, I would like to keep information about filter sizes and which operations are fused in the TIR functions. As the model to predict the performance needs additional information. --- [Visit Topic](https://discuss.tvm.apache.org/t/profile-on-relay-level/9568/5) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/d10b066cd67568d1f570326694040424bea91f09f14c9858f0c03f2ff441656b).