Re: [slurm-users] job is pending but resources are available

2021-10-13 Thread Adam Xu
在 10/13/21 16:30, Ole Holm Nielsen 写道: On 10/13/21 9:59 AM, Adam Xu wrote: 在 2021/10/13 9:22, Brian Andrus 写道: Something is very odd when you have the node reporting: RealMemory=1 AllocMem=0 FreeMem=47563 Sockets=2 Boards=1 What do you get when you run ‘slurmd -C’ on the node? # slurmd

Re: [slurm-users] job is pending but resources are available

2021-10-13 Thread Adam Xu
ThreadsPerCore=1 RealMemory=128306 UpTime=22-16:14:48 Brian Andrus *From: *Adam Xu <mailto:adam...@adagene.com.cn> *Sent: *Tuesday, October 12, 2021 6:07 PM *To: *slurm-users@lists.schedmd.com *Subject: *Re: [slurm-users] job is pending but resources are available 在 2021/10/12 21:21, Adam

Re: [slurm-users] job is pending but resources are available

2021-10-12 Thread Adam Xu
在 2021/10/12 21:21, Adam Xu 写道: Hi All, OS: Rocky Linux 8.4 slurm version: 20.11.7 the partition's name is apollo. the node's name is apollo too. the node has 36 cpu cores and 8GPUs in it. partition info $ scontrol show partition apollo PartitionName=apollo    Allow

[slurm-users] job is pending but resources are available

2021-10-12 Thread Adam Xu
remaining resources are sufficient. but why the job is pending with reason "Resources"? -- Adam Xu