On Fri, Aug 20, 2010 at 4:40 PM, Reuti <re...@staff.uni-marburg.de> wrote:
> Am 20.08.2010 um 22:40 schrieb Rahul Nabar:

Thanks Jon and Reuti!

> are you using any queuing system? I try to get all applications set up in 
> such a way, that they write all their stuff to $TMPDIR. It's in [OS]GE and I 
> think also in Torque for some time now, to be created automatically (as job 
> specific directory on a node) and removed after the job.

Yes. I'm using Torque. That's an interesting feature! I'll check it
out. I guess one other option is to put an epilogue that does rm -fr
/tmp/*

Do you use a HDD temp or a tmpfs in RAM?

>
> A load sensor which checks the space on a node in /scratch and put the queue 
> instance into alarm state, if it falls under a certain value, can in addition 
> prevent a black hole in the cluster, where one after the other job crashes 
> due to missing scratch space.
>

That seems neat too! But I am not sure if torque can do an alarm state
on a queue too that way.

-- 
Rahul

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to