[Beowulf] Re: Time limits in queues

Bogdan Costescu Thu, 17 Jan 2008 06:05:51 -0800

On Wed, 16 Jan 2008, Craig Tierney wrote:

Our queue limits are 8 hours.
...
Did that sysadmin who set 24 hour time limits ever analyze the amount
of lost computational time because of larger time limits?

While I agree with the idea and reasons of short job runtime limits, Idisagree with your formulation. Being many times involved indiscussions about what runtime limits should be set, I wouldn't makemyself a statement like yours; I would say instead: YMMV. In otherwords: choose what fits better the job mix that users are actuallyrunning. If you have determined that 8h max. runtime is appropriatefor _your_ cluster and increasing it to 24h would lead to a waste ofcomputational time due to the reliability of _your_ cluster, thenyou've done your job well. But saying that everybody should use thislimit is wrong.

Furthermore, although you mention that system-level checkpointing isassociated with a performance hit, you seem to think that user-levelcheckpointing is a lot lighter, which is most often not the case.Apart from the obvious I/O limitations that could restrict saving &loading of checkpointing data, there are applications for whichdevelopers have chosen to not store certain data but recompute itevery time it is needed because the effort of saving, storing &loading it is higher than the computational effort of recreating it -but this most likely means that for each restart of the applicationthis data has to be recomputed. And smaller max. runtimes mean morerestarts needed to reach the same total runtime...


--
Bogdan Costescu

IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen
Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY
Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868
E-mail: [EMAIL PROTECTED]
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

[Beowulf] Re: Time limits in queues

Reply via email to