
I'd like the post-processing part of the model to be executed on the CPU instead of the accelerator, is there a method to tell TVM to 'stop' partitioning for external BYOC compiler at some specific Relay operators (e.g., the `ArgMax` and `Gather` op in this case)? Should I write a custom pass or use the existing functionality to achieve this (like using `AnnotateTarget` and defining a custom pattern to isolate this subgraph)? --- [Visit Topic](https://discuss.tvm.apache.org/t/byoc-how-to-partition-specific-region-of-a-relay-graph-to-cpu/11758/1) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/22b667ec3edb9b37edab15522403fda0cfbc4c3d79fc97378c089949443d16b6).