https://gcc.gnu.org/bugzilla/show_bug.cgi?id=51119
--- Comment #17 from Jerry DeLisle <jvdelisle at gcc dot gnu.org> --- I have done some experimenting. Since gcc supports OMP and I think to some extent ACC why not come up with a MATMUL that exploits these if present? On the darwin platform discussed in comment #12, the performance is excellent. Does darwin implementation provided exploit OpenCL? What is it using? Why not enable that on other platforms if present. I am going to explore OpenCL and clBLAS to see if I can get it to work. If I am successful, I would like to hide it behind MATMUL if possible. Any other opinions?