Plus one for Arbiter. We used to use the banhammer but implemented this about a year ago with considerable success.
We did make a couple modifications around handling cached memory- fortunately it's written in Python and fairly straightforward to customize. - Michael On Fri, Mar 26, 2021 at 7:28 AM Prentice Bisbal via Beowulf < beowulf@beowulf.org> wrote: > Yes, there's a tool developed specifically for this called Arbiter that > uses Linux cgroups to dynamically limit resources on a login node based > on it's current load. It was developed at the University of Utah: > > https://dylngg.github.io/resources/arbiterTechPaper.pdf > > https://gitlab.chpc.utah.edu/arbiter2/arbiter2 > > Prentice > > On 3/26/21 9:56 AM, Michael Di Domenico wrote: > > does anyone have a recipe for limiting the damage people can do on > > login nodes on rhel7. i want to limit the allocatable cpu/mem per > > user to some low value. that way if someone kicks off a program but > > forgets to 'srun' it first, they get bound to a single core and don't > > bump anyone else. > > > > i've been poking around the net, but i can't find a solution, i don't > > understand what's being recommended, and/or i'm implementing the > > suggestions wrong. i haven't been able to get them working. the most > > succinct answer i found is that per user cgroup controls have been > > implemented in systemd v239/240, but since rhel7 is still on v219 > > that's not going to help. i also found some wonkiness that runs a > > program after a user logs in and hacks at the cgroup files directly, > > but i couldn't get that to work. > > > > supposedly you can override the user-{UID}.slice unit file and jam in > > the cgroup restrictions, but I have hundreds of users clearly that's > > not maintainable > > > > i'm sure others have already been down this road. any suggestions? > > _______________________________________________ > > Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing > > To change your subscription (digest mode or unsubscribe) visit > https://beowulf.org/cgi-bin/mailman/listinfo/beowulf > _______________________________________________ > Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing > To change your subscription (digest mode or unsubscribe) visit > https://beowulf.org/cgi-bin/mailman/listinfo/beowulf >
_______________________________________________ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit https://beowulf.org/cgi-bin/mailman/listinfo/beowulf