Re: [slurm-users] ActiveFeatures job submission

2022-02-09 Thread Paul Brunk
Hi Alexander: This is a great case for using Node Health Check (https://github.com/mej/nhc). We use this so that each node periodically runs an admin-selected set of tests (e.g. "is /work readable?"), and automatically Drains a node which fails any of them, and puts the reason in the node's Re

[slurm-users] ActiveFeatures job submission

2022-02-01 Thread Alexander Block
Hello experts, I hope someone is out there having some experience with the "ActiveFeatures" and "AvailableFeatures" in the node configuration and can give some advise. We have configured 4 nodes with certain features, e.g. "NodeName=thin1 Arch=x86_64 CoresPerSocket=24    CPUAlloc=0 CPUTot=96