s://github.com/ganglia/gmond_python_modules/tree/master/gpu/nvidia
Adam
On Friday, June 5, 2015, Kevin Abbey wrote:
Hi,
I recently installed a Nvidia K80 gpu in a server. Can anyone share
methods and procedures for monitoring and ensuring the card is cooled
sufficiently by the server fans? I
Hi,
I recently installed a Nvidia K80 gpu in a server. Can anyone share
methods and procedures for monitoring and ensuring the card is cooled
sufficiently by the server fans? I need to set this up and test before
running any compute tests.
Thanks,
Kevin
--
Kevin Abbey
Systems
I tried this on a Supermicro board and a Sun box. On both systems the
system would reboot randomly so I tuned it off. This is a serious
problem of false positives. In a cluster, you may need to notify the
scheduler in someway when a node reboots. Can someone elaborate on
this? Specifically
Hi Joe,
Can that 9% difference be due to the Intel capability to overclock one
core and turn the others off?
Or is does this Intel feature require manual switch somewhere?
Thank you,
Kevin
Joe Landman wrote:
Hi folks:
Thought you might like to see this. I rewrote the interior loop for
o