In message from Bill Broadley <b...@cse.ucdavis.edu> (Fri, 14 Aug 2009
16:13:21 -0700):
Mikhail Kuzminsky wrote:
Your results look excellent, so I wouldn't be surprised if they are
running at 1333.
I have 12-18 GB/s on 4 threads of stream/ifort w/DDR3-1066 on dual
E5520
server. But it works under "numa-bad" kernel w/o control of
numa-efficient allocation.
Sounds pretty bad.
Why 4 threads? You need 8 cores to keep all 6 memory busses busy.
For comparison w/your tests: you have only 4 cores. On 8 threads I
have 20-26 GB/s.
Which compiler?
ifort pointed above means intel fortran 11.0.38.
Mikhail
open64 does substantially better than gcc.
--
üÔÏ ÓÏÏÂÝÅÎÉÅ ÂÙÌÏ ÐÒÏ×ÅÒÅÎÏ ÎÁ ÎÁÌÉÞÉÅ × ÎÅÍ ×ÉÒÕÓÏ×
É ÉÎÏÇÏ ÏÐÁÓÎÏÇÏ ÓÏÄÅÒÖÉÍÏÇÏ ÐÏÓÒÅÄÓÔ×ÏÍ
MailScanner, É ÍÙ ÎÁÄÅÅÍÓÑ
ÞÔÏ ÏÎÏ ÎÅ ÓÏÄÅÒÖÉÔ ×ÒÅÄÏÎÏÓÎÏÇÏ ËÏÄÁ.
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf