I cannot reproduce the results you are getting. For me, the graph runtime and the VM are within 10% of each other in profiling. And they are pretty close to the benchmark results too.
Here are some questions that might help you debug this: - Have you tried running on a different machine? - Have you tried using a target that is specific to your machine? (Something like `llvm -mcpu=core-avx2 -model=epyc-7452`) - Have you tried running without graph tuning? - Have you tried different networks? --- [Visit Topic](https://discuss.tvm.apache.org/t/difference-in-profiler-outputs/11255/6) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/b3de5dabab61f24ffafe512d75c4169cc12bf7df5069a7784b77b2cf97736135).