Just in the HPCG discussion, it was proposed to use the now widely used likwid benchmark to estimate memory bandwidth. It gives excellent estimates of hardware capabilities.
Am I right that likwid uses its own optimized assembler code for each specific hardware? If so, it turns out that for the HPC user, stream gives a more important estimate - the application is translated by the compiler (they do not write in assembler - except for modules from mathematical libraries), and stream will give a real estimate of what will be received in the application. Mikhail Kuzminsky _______________________________________________ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit https://beowulf.org/cgi-bin/mailman/listinfo/beowulf