On 03/01/2017 01:00 PM, Thomas Koenig wrote:
Hello world,
the attached patch enables FMA for the AVX2 and AVX512F variants of
matmul. This should bring a very nice speedup (although I have
been unable to run benchmarks due to lack of a suitable machine).
Question: Is this still appropriate for the current state of trunk?
Or rather, OK for when gcc 8 opens (which might still be some time
in the future)?
I think it may be appropriate now because you are making an adjustment to the
just added new feature.
I would prefer that it was tested on the actual expected platform. Does anyone
anywhere on this list have access to one of these machines to test?
Jerry