在 10/13/21 16:30, Ole Holm Nielsen 写道:
On 10/13/21 9:59 AM, Adam Xu wrote:
在 2021/10/13 9:22, Brian Andrus 写道:
Something is very odd when you have the node reporting:
RealMemory=1 AllocMem=0 FreeMem=47563 Sockets=2 Boards=1
What do you get when you run ‘slurmd -C’ on the node?
# slurmd
ThreadsPerCore=1 RealMemory=128306
UpTime=22-16:14:48
Brian Andrus
*From: *Adam Xu <mailto:adam...@adagene.com.cn>
*Sent: *Tuesday, October 12, 2021 6:07 PM
*To: *slurm-users@lists.schedmd.com
*Subject: *Re: [slurm-users] job is pending but resources are available
在 2021/10/12 21:21, Adam
在 2021/10/12 21:21, Adam Xu 写道:
Hi All,
OS: Rocky Linux 8.4
slurm version: 20.11.7
the partition's name is apollo. the node's name is apollo too. the
node has 36 cpu cores and 8GPUs in it.
partition info
$ scontrol show partition apollo
PartitionName=apollo
Allow
remaining resources are sufficient. but why the job is
pending with reason "Resources"?
--
Adam Xu