----- "Jason Clinton" <[email protected]> wrote:
Hi Jason, > We saw a similar power-off issue on a customer of ours who upgraded > from 2220's to Barcelona's on a similar board; it was reproducible at > the same failure rate on approximately 160 nodes. After trying just > about everything under the sun, we wholesale replaced all the memory > in the entire cluster. The power-offs ceased immediately thereafter > and have not returned. We saw that with Barcelona's, but instead going to the 2.3GHz (75W) Shanghai's solved the issue for us - we were rather surprised to see it reappear with the 2.4GHz (55W) Shanghai. :-( cheers, Chris -- Christopher Samuel - (03) 9925 4751 - Systems Manager The Victorian Partnership for Advanced Computing P.O. Box 201, Carlton South, VIC 3053, Australia VPAC is a not-for-profit Registered Research Agency _______________________________________________ Beowulf mailing list, [email protected] sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
