Hi,
I am trying to help a sysadmin colleague (and to understand for myself) trying
to configure a new slurm server and he struggles to understand if there is an
alternative way to config slurm managing job policy submission per user without
necessarily installing an accounting mariadb service.
Hi,
Surfing during days on the net and seeking talks/tutos on schedmd website, I
didn’t really find a tuto (that works on a systemd env) how to install,
configure and deploy a slurm system on a single compute server with many cores
and many memory. Explanations and tutos in administration I hav
Hi Mahmood,
Try the LBNL Node Health Check tool. Nodes which are determined to be
"unhealthy" can be marked as down or offline so as to prevent jobs from being
scheduled or run on them.
https://github.com/mej/nhc/blob/master/README.md#lbnl-node-health-check-nhc
Regards,
Richard
@cnscfr
--
Sent