Threads in python aren't actually executed concurrently due to the GIL. So the reason everything is slower is because you aren't actually doing anything in parallel. Also, it could be a lot of you time is actually spent in the python interpreter instead of executing you model. You should try using time evaluator (https://tvm.apache.org/docs/api/python/runtime.html#tvm.runtime.Module.time_evaluator) instead of your own python loop.
--- [Visit Topic](https://discuss.tvm.apache.org/t/multithread-threadpool-performance-degradation-when-running-relay-module-in-multiple-threads/10374/11) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/3e436386ee4915f592b2eec9812e5e7802fd90a4dfcafe8ceef5dcbe54d85d4d).