Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-26 Thread Bill Seurer
On 05/26/2017 12:41 AM, Andrew Pinski wrote: On Thu, May 25, 2017 at 6:43 PM, Jerry DeLisle wrote: On 05/25/2017 02:57 PM, Thomas Koenig wrote: Hi everybody, I have committed the patch (with the corrections for the name) as rev 248472. The infrastructure is in place, so we will be able to m

Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-25 Thread Andrew Pinski
On Thu, May 25, 2017 at 6:43 PM, Jerry DeLisle wrote: > On 05/25/2017 02:57 PM, Thomas Koenig wrote: >> >> Hi everybody, >> >> I have committed the patch (with the corrections for the name) >> as rev 248472. >> >> The infrastructure is in place, so we will be able to make >> any fine-tuning easily

Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-25 Thread Jerry DeLisle
On 05/25/2017 02:57 PM, Thomas Koenig wrote: Hi everybody, I have committed the patch (with the corrections for the name) as rev 248472. The infrastructure is in place, so we will be able to make any fine-tuning easily. Regards Thomas Based on my testing I think it is close enough as i

Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-25 Thread Thomas Koenig
Hi everybody, I have committed the patch (with the corrections for the name) as rev 248472. The infrastructure is in place, so we will be able to make any fine-tuning easily. Regards Thomas

Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-25 Thread Jerry DeLisle
On 05/25/2017 10:20 AM, Janne Blomqvist wrote: On Thu, May 25, 2017 at 1:45 PM, Thomas Koenig wrote: Hello world, the attached patch speeds up the library version of matmul for AMD chips by selecting AVX128 instructions and, depending on which instructions are supported, either FMA3 (aka FMA)

Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-25 Thread Janne Blomqvist
On Thu, May 25, 2017 at 1:45 PM, Thomas Koenig wrote: > Hello world, > > the attached patch speeds up the library version of matmul for AMD chips > by selecting AVX128 instructions and, depending on which instructions > are supported, either FMA3 (aka FMA) or FMA4. > > Jerry tested this on his AMD

Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-25 Thread Thomas Koenig
Hi Jerry, Yes, OK. Maybe test Ryzen first? Sure, I can wait for a bit :-) I just confirmed access to the Ryzen machines so I plan to get set up and test there. The gcc compile farm machines? My ssh key does not work there... I have based the choice of FMA(3) over FMA4 when both are avail

Re: [patch, libfortran] AMD-specific versions of library matmul

2017-05-25 Thread Jerry DeLisle
On 05/25/2017 03:45 AM, Thomas Koenig wrote: Hello world, the attached patch speeds up the library version of matmul for AMD chips by selecting AVX128 instructions and, depending on which instructions are supported, either FMA3 (aka FMA) or FMA4. Jerry tested this on his AMD systems, and found