Hello All -

I have a compute node that has started dropping off. When I say drop off, I mean the node (while running a job) will lose all connectivity and the machine does not respond. I have viewed the logs and can find no reason for the node to cease functioning. Let me state that this behavior did not occur until after a processor upgrade, BIOS upgrade and OS upgrade. I went in to the BIOS and made a few changes that seemed to prolong it even though its occurrence was mostly random. If I leave the node idle, it will run for days.

Has anyone ever seen such behavior?

Tim
begin:vcard
fn:Timothy Moore
n:Moore;Timothy
org:Trident Consulting Group;Special Programs
adr:Suite 106;;600 Boulevard South;Huntsville;AL;35802;USA
email;internet:[EMAIL PROTECTED]
title:President & Chief Scientist
tel;work:(256) 882-1001
tel;fax:(256) 882-1002
tel;cell:(256) 348-9702
url:http://www.tcg-hsv.com
version:2.1
end:vcard

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to