In message from Bill Broadley <b...@cse.ucdavis.edu> (Fri, 14 Aug 2009 16:13:21 -0700):
Mikhail Kuzminsky wrote:
Your results look excellent, so I wouldn't be surprised if they are
running at 1333.

I have 12-18 GB/s on 4 threads of stream/ifort w/DDR3-1066 on dual E5520
server. But it works under "numa-bad" kernel w/o control of
numa-efficient allocation.

Sounds pretty bad.

Why 4 threads?  You need 8 cores to keep all 6 memory busses busy.

For comparison w/your tests: you have only 4 cores. On 8 threads I have 20-26 GB/s.

Which compiler?
ifort pointed above means intel fortran 11.0.38.

Mikhail

open64 does substantially better than gcc.

--
üÔÏ ÓÏÏÂÝÅÎÉÅ ÂÙÌÏ ÐÒÏ×ÅÒÅÎÏ ÎÁ ÎÁÌÉÞÉÅ × ÎÅÍ ×ÉÒÕÓÏ×
É ÉÎÏÇÏ ÏÐÁÓÎÏÇÏ ÓÏÄÅÒÖÉÍÏÇÏ ÐÏÓÒÅÄÓÔ×ÏÍ
MailScanner, É ÍÙ ÎÁÄÅÅÍÓÑ
ÞÔÏ ÏÎÏ ÎÅ ÓÏÄÅÒÖÉÔ ×ÒÅÄÏÎÏÓÎÏÇÏ ËÏÄÁ.


_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to