Re: [slurm-users] systemctl enable slurmd.service Failed to execute operation: No such file or directory

2022-01-30 Thread Nousheen
The same error shows up on compute node which is as follows: [root@c103008 ~]# systemctl enable slurmd.service [root@c103008 ~]# systemctl start slurmd.service [root@c103008 ~]# systemctl status slurmd.service ● slurmd.service - Slurm node daemon Loaded: loaded (/etc/systemd/system/slurmd.servi

Re: [slurm-users] systemctl enable slurmd.service Failed to execute operation: No such file or directory

2022-01-30 Thread Nousheen
Dear Jeffrey, Thank you for your response. I have followed the steps as instructed. After the copying the files to their respective locations "systemctl status slurmctld.service" command gives me an error as follows: (base) [nousheen@exxact system]$ systemctl daemon-reload (base) [nousheen@exxact

Re: [slurm-users] Fairshare within a single Account (Project)

2022-01-30 Thread Renfro, Michael
You can. We use: sacctmgr show assoc where account=researchgroup format=user,share to see current fairshare within the account, and: sacctmgr modify user where name=someuser account=researchgroup set fairshare=N to modify a particular user's fairshare within the account. From:

[slurm-users] How to tell SLURM to ignore specific GPUs

2022-01-30 Thread Paul Raines
I have a large compute node with 10 RTX8000 cards at a remote colo. One of the cards on it is acting up "falling of the bus" once a day requiring a full power cycle to reset. I want jobs to avoid that card as well as the card it is NVLINK'ed to. So I modified gres.conf on that node as follows:

[slurm-users] Fairshare within a single Account (Project)

2022-01-30 Thread Tomislav Maric
Hello everyone, We are a small research group that shares an account on the cluster and we thought we would be able to use usage reports to balance the CPUh from different users: we were wrong. Is it possible to set up Fairshare within a single Acco