Re: [Beowulf] Cluster Metrics? (Upper management view)

2010-08-20 Thread Mark Hahn
I think measuring a clusters success based on the number of jobs run or cpu's used is a bad measure of true success. I would be more inclined to consider a cluster a success by speaking with the people who use it and find out not only whether they can use it effectively and/or what new science ha

Re: [Beowulf] typical protocol for cleanup of /tmp: on reboot? cron job? tmpfs?

2010-08-20 Thread Mark Hahn
What's the typical protocol about the cleanup of /tmp folders? Do we just leave the default (2 week) cron-driven tmpwatch in place. we don't have a lot of users who bother with /tmp, though, since we have reasonable lustre-based storage on all our big clusters. if we took the time to do it righ

Re: [Beowulf] typical protocol for cleanup of /tmp: on reboot? cron job? tmpfs?

2010-08-20 Thread Michael Di Domenico
Redhat and the likes have a utility called 'tmpwatch' On Fri, Aug 20, 2010 at 6:53 PM, Rahul Nabar wrote: > On Fri, Aug 20, 2010 at 4:40 PM, Reuti wrote: >> Am 20.08.2010 um 22:40 schrieb Rahul Nabar: > > Thanks Jon and Reuti! > >> are you using any queuing system? I try to get all applications

Re: [Beowulf] Cluster Metrics? (Upper management view)

2010-08-20 Thread Alan Louis Scheinine
The measure of a cluster depends on how it is intended to be used. Big computer centers tend to measure the percentage of time the CPUs are running jobs. In contrast, if a cluster is used for program develop there must be idle CPUs in order to reduce that wait time, keeping in mind when people wa

Re: [Beowulf] typical protocol for cleanup of /tmp: on reboot? cron job? tmpfs?

2010-08-20 Thread Rahul Nabar
On Fri, Aug 20, 2010 at 4:40 PM, Reuti wrote: > Am 20.08.2010 um 22:40 schrieb Rahul Nabar: Thanks Jon and Reuti! > are you using any queuing system? I try to get all applications set up in > such a way, that they write all their stuff to $TMPDIR. It's in [OS]GE and I > think also in Torque fo

Re: [Beowulf] typical protocol for cleanup of /tmp: on reboot? cron job? tmpfs?

2010-08-20 Thread Reuti
Hi, Am 20.08.2010 um 22:40 schrieb Rahul Nabar: > What's the typical protocol about the cleanup of /tmp folders? Do > people clean them on each reboot or at intervals with a cron (sounds a > bad idea). I was always under the impression that a reboot cleans them > but apparantly not on my CentOS d

Re: [Beowulf] typical protocol for cleanup of /tmp: on reboot? cron job? tmpfs?

2010-08-20 Thread Jon Forrest
On 8/20/2010 1:40 PM, Rahul Nabar wrote: What's the typical protocol about the cleanup of /tmp folders? Do people clean them on each reboot or at intervals with a cron (sounds a bad idea). I was always under the impression that a reboot cleans them but apparantly not on my CentOS distro, by defau

[Beowulf] typical protocol for cleanup of /tmp: on reboot? cron job? tmpfs?

2010-08-20 Thread Rahul Nabar
What's the typical protocol about the cleanup of /tmp folders? Do people clean them on each reboot or at intervals with a cron (sounds a bad idea). I was always under the impression that a reboot cleans them but apparantly not on my CentOS distro, by default. I was burnt earlier today when ompi-ps

Re: [Beowulf] Cluster Metrics? (Upper management view)

2010-08-20 Thread Reuti
Hi, Am 20.08.2010 um 19:34 schrieb Stuart Barkley: > What sort of business management level metrics do people measure on > clusters? Upper management is asking for us to define and provide > some sort of "numbers" which can be used to gage the success of our > cluster project. > > We currently

Re: [Beowulf] Cluster Metrics? (Upper management view)

2010-08-20 Thread Prentice Bisbal
I couldn't have said it better myself. Be wary of suits asking for asking for numbers. Michael Di Domenico wrote: > I think measuring a clusters success based on the number of jobs run > or cpu's used is a bad measure of true success. I would be more > inclined to consider a cluster a success b

Re: [Beowulf] Cluster Metrics? (Upper management view)

2010-08-20 Thread Michael Di Domenico
I think measuring a clusters success based on the number of jobs run or cpu's used is a bad measure of true success. I would be more inclined to consider a cluster a success by speaking with the people who use it and find out not only whether they can use it effectively and/or what new science hav

[Beowulf] Cluster Metrics? (Upper management view)

2010-08-20 Thread Stuart Barkley
What sort of business management level metrics do people measure on clusters? Upper management is asking for us to define and provide some sort of "numbers" which can be used to gage the success of our cluster project. We currently have both SGE and Torque/Moab in use and need to measure both if