You are getting close :)
You can see why n010 is able to have multiple jobs. It shows more
resources available.
What are the specific requests for resources from a job?
Nodes, Cores, Memory, threads, etc?
Brian Andrus
On 1/28/2021 12:52 PM, Chandler wrote:
OK I'm getting this same output on nodes n[011-013]:
# slurmd -C
NodeName=n011 slurmd: error: FastSchedule will be removed in 20.02, as
will the FastSchedule=0 functionality. Please consider removing this
from your configuration now.
slurmd: Considering each NUMA node as a socket
slurmd: error: Thread count (255) not multiple of core count (128)
CPUs=255 Boards=1 SocketsPerBoard=8 CoresPerSocket=16 ThreadsPerCore=1
RealMemory=1031878
UpTime=86-20:59:54
#
but on n010 it looks like:
# slurmd -C
NodeName=n010 CPUs=256 Boards=1 SocketsPerBoard=2 CoresPerSocket=64
ThreadsPerCore=2 RealMemory=1031887
UpTime=20-00:01:31
#