everton.constantino added a comment.

@fhahn That was my first idea however its not as simple as it looks. I tried 
moving the adds but splats make it considerably harder to find a pattern that 
catches this and fuses the multiplies specially with bigger matrices. My real 
wish was to actually add a new IR instruction to handle matrices because the 
MADD is but a simple example of other more interesting optimizations that can 
be done, like using matrix associative properties to reduce the number of 
calculations. I found that path too complicated however and I opted for a 
compromise at the moment. I wish to start writing some GEMM micro-kernels with 
this extension and this builtin was the shortest path.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D99433/new/

https://reviews.llvm.org/D99433

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to