Graphically, Ganglia works well.

text bases you might consider using pdsh to do an hourly uptime for load info then run that through awk to clean it up and put it in a readable format. In the past, I've also run "top" with a count of 1 and used awk to clean it up.

It can be good to know not only memory and load, but what jobs are running at a given time.


Honestly though, I mainly use ganglia these days with job details being provided by SGE via the qstat command.


Mike Davis




A Lenzo wrote:
Hello Cluster Colleagues,

I would like to begin monitoring my cluster in order to see what the usage is at different times of day. A simple method would work - I am looking for advice on this. The mosmon utility is enticing since it shows the usage on all nodes at once, but of course, I can't pipe the output to a text file. If I can find the right tool for the job, I am sure I can keep it running every hour or so with Cron. Ideally, I'd love to measure memory usage and also CPU usage.

Suggestions?

Thanks!
Tony


------------------------------------------------------------------------

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to