"The Case of the Missing Supercomputing Performance"
I wondered if you were talking about that paper but it's from lanl not sandia,
it should be essential reading for everyone working with large clusters.
I love this paper. but it's critical to realize that it's all about
very large, very tightly-coupled, frequent-global-collective-using
applications. you could easily have a 2k-node cluster (I'd call it large)
dedicated to 1-to-100-core jobs and gleefully ignore jitter. or be running
an 8k-core montecarlo that never needs any global synchronization, etc.
I'd actually love to see data on whether jitter affects apps
other than ah, "stockpile stewardship" ;)
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf