On 1/24/22 11:17 am, Tom Harvill wrote:

We use a 'fair share' feature of our scheduler (SLURM) and have our decay half-life (the time needed for priority penalty to halve) set to 30 days.  Our maximum job runtime is 7 days.  I'm wondering what others use, please let me know if you can spare a minute.  Thank you!

We use Slurm but we don't use fairshare, instead we have a priority threshold which jobs have to age to before they can get a forward reservation on nodes (they can of course backfill before then). We configure things so that jobs age at ~1 priority point per minute and then set our QOS's so that the start time is a certain amount of time away from that threshold.

We also set things up so that only 2 jobs per user+account+qos association can age, and once one starts running the next in line will begin ageing.

All the best,
Chris
--
  Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
https://beowulf.org/cgi-bin/mailman/listinfo/beowulf

Reply via email to