On Fri, Aug 20, 2010 at 4:40 PM, Reuti <re...@staff.uni-marburg.de> wrote: > Am 20.08.2010 um 22:40 schrieb Rahul Nabar:
Thanks Jon and Reuti! > are you using any queuing system? I try to get all applications set up in > such a way, that they write all their stuff to $TMPDIR. It's in [OS]GE and I > think also in Torque for some time now, to be created automatically (as job > specific directory on a node) and removed after the job. Yes. I'm using Torque. That's an interesting feature! I'll check it out. I guess one other option is to put an epilogue that does rm -fr /tmp/* Do you use a HDD temp or a tmpfs in RAM? > > A load sensor which checks the space on a node in /scratch and put the queue > instance into alarm state, if it falls under a certain value, can in addition > prevent a black hole in the cluster, where one after the other job crashes > due to missing scratch space. > That seems neat too! But I am not sure if torque can do an alarm state on a queue too that way. -- Rahul _______________________________________________ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf