Hi, All.
Existing `TensorIntrin` support "reduce_init" and "reduce_body" which could
cover most cases, which is very good. However, when I was trying to implement a
tensor intrinsic like "matmul_with_relu", current TensorIntrin is not
sufficient to describe it.
The TIR I'm looking for is something like:
<pre>
if (k == K - 1) {
# call "matmul_with_relu" kernel, currently this part is MISSING.
} else if (k == 0) {
# call "matmul_beta_0" kernel, which is exactly what "reduce_init" is doing.
} else {
# call "matmul_beta_1" kernel, which is exactly what "reduce_update" is doing.
}
</pre>
Do we have plan to support the "reduce_last" attribute for TensorIntrin.
---
[Visit
Topic](https://discuss.tvm.apache.org/t/tensorize-support-reduce-last-for-tensorintrin/10392/1)
to respond.
You are receiving this because you enabled mailing list mode.
To unsubscribe from these emails, [click
here](https://discuss.tvm.apache.org/email/unsubscribe/c6cd7241e001fe577f34964622c913b0f87a99a4e343a30e83a8683962ad2a1a).