Hi all. This is not a direct HPC question per-se, but your clusters are an excellent source for the information I need, so here it goes:
/Could those of you running ECC memory give me an updated figure on the number of errors detected/corrected per day per system? / We are working on self-healing mechanisms and we need actual information on the number of errors that state-of-the-art systems are facing today. You can imagine why I envy your farms.... I have an old figure of about 1 error-bit per day per system at sea level, but I would like to know if it is getting worse or better. thanks in advance ariel
_______________________________________________ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf