We aren't after average HPC programmers...

Even good compilers (Intel) are very very limited in their optimisations.
We got factors of 2x and 3x by hand writing SSSE3 commands on standard
Xeon's rather than let the compiler do its thing...  Compiler limitations
isn't particular to Phi.

On Wed, Jun 20, 2018 at 2:47 AM Prentice Bisbal <pbis...@pppl.gov> wrote:

>
> If you organize your code correctly, and call the compiler with the right
> optimization flags, shouldn't the compiler automatically handle a good
> portion of this 'low-level' stuff? I understand that hand-coding this stuff
> usually still give you the best performance (See GotoBLAS/OpenBLAS, for
> example), but does your average HPC programmer trying to get decent
> performance need to hand-code that stuff, too?
>
>
>

-- 
Dr Stuart Midgley
sdm...@gmail.com
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to