[Apache TVM Discuss] [Questions] Quantizatition and pruned model

Wang Yucheng via Apache TVM Discuss Thu, 24 Jun 2021 04:47:53 -0700


You can add your own legalize strategy, refer to this PR:
https://github.com/apache/tvm/pull/8222 
But I’m not sure if you will really get speed up after padding. So you can also 
consider directly modifying the cuda strategy of your own conv2d_int8 so that 
it can be distributed to topi that does not require dp4a. 
https://github.com/apache/tvm/blob/5fa1c6dae0903f4dc31d39d42fcf582190ac1a68/python/tvm/relay/op/strategy/cuda.py#L119






---
[Visit 
Topic](https://discuss.tvm.apache.org/t/quantizatition-and-pruned-model/10308/2)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.apache.org/email/unsubscribe/d21b382f92bea40fe50f1d87568d18be945dc83b267b64b2dae5e4cbec1d3f29).

[Apache TVM Discuss] [Questions] Quantizatition and pruned model

Reply via email to