s *On Behalf Of
> *Sean Crosby
> *Sent:* Thursday, April 8, 2021 10:18 AM
> *To:* Slurm User Community List
> *Subject:* Re: [slurm-users] [EXT] slurmctld error
>
>
>
> The reason why your nodes are drained is "Low RealMemory"
>
>
>
> This reason is because
Behalf Of Sean
Crosby
Sent: Thursday, April 8, 2021 10:18 AM
To: Slurm User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
The reason why your nodes are drained is "Low RealMemory"
This reason is because you have told Slurm about the RAM on the node, but it is
les
;
>Reason=Low RealMemory [root@2021-04-
>
>
>
> *From:* slurm-users *On Behalf Of
> *Sean Crosby
> *Sent:* Tuesday, April 6, 2021 2:11 PM
> *To:* Slurm User Community List
> *Subject:* Re: [slurm-users] [EXT] slurmctld error
>
>
>
> I just checked my clust
-- --- - - --- -
- --- - - ---
-
tuc 127.0.0.1 6817 8704 1
From: slurm-users On Behalf Of Sean
Crosby
Sent: Tuesday, April 6, 2021 2:11 PM
To: Slurm User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
I just checked my
=n/s
Reason=Low RealMemory [root@2021-04-
From: slurm-users On Behalf Of Sean
Crosby
Sent: Tuesday, April 6, 2021 2:11 PM
To: Slurm User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
I just checked my cluster and my spool dir is
SlurmdSpoolDir=/var/spool/s
drained 0/0/2/2 3934 TUC* up
>>>
>>> wn029 drained 0/0/2/2 3934 TUC* up
>>>
>>> wn030 drained 0/0/2/2 3934 TUC* up
>>>
>>> wn031 drained 0/0/2/2 3934 TUC* up
>>>
>>> wn032 drained 0/0/2/2 3934 TUC* up
>>>
>>> wn033 d
up
>>
>> wn035 drained 0/0/2/2 3934 TUC* up
>>
>> wn036 drained 0/0/2/2 3934 TUC* up
>>
>> wn037 drained 0/0/2/2 3934 TUC* up
>>
>> wn038 drained 0/0/2/2 3934 TUC* up
>>
>> wn039 drained 0/0/2/2 3934 TUC* up
>>
>> wn040 d
>
> wn038 drained 0/0/2/2 3934 TUC* up
>
> wn039 drained 0/0/2/2 3934 TUC* up
>
> wn040 drained 0/0/2/2 3934 TUC* up
>
> wn041 drained 0/0/2/2 3934 TUC* up
>
> wn042 drained 0/0/2/2 3934 TUC* up
>
> wn043 drained 0/0/2/2 3934 TUC* up
>
> wn044 drained 0/0/
ubject: Re: [slurm-users] [EXT] slurmctld error
It looks like your attachment of sinfo -R didn't come through
It also looks like your dbd isn't set up correctly
Can you also show the output of
sacctmgr list cluster
and
scontrol show config | grep ClusterName
Sea
: slurm-users On Behalf Of Sean
Crosby
Sent: Tuesday, April 6, 2021 12:47 PM
To: Slurm User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
It looks like your attachment of sinfo -R didn't come through
It also looks like your dbd isn't set up correctly
Can you
slurmdbd: PERSIST_RC is -1 from
> DBD_FLUSH_JOBS(1408): (null)
>
> [2021-04-06T12:10:35.702] debug: backfill: beginning
>
> [2021-04-06T12:10:35.702] debug: backfill: no jobs to backfill
>
> [2021-04-06T12:10:37.001] debug: slurmdbd: PERSIST_RC is -1 from
> DBD_FLUSH_JOBS(1
<mailto:slurm-users-boun...@lists.schedmd.com>
slurm-users-boun...@lists.schedmd.com> On Behalf Of Sean Crosby
Sent: Tuesday, April 6, 2021 12:49 AM
To: Slurm User Community List < <mailto:slurm-users@lists.schedmd.com>
slurm-users@lists.schedmd.com>
Subject: Re: [slurm-user
Hi Ioannis,
On 06-04-2021 07:56, Ioannis Botsis wrote:
slurmctld is active and running but on system reboot doesn’t start
automatically…..I have to start it manually
Maybe you will find my Slurm Wiki pages of use for setting up your Slurm
system: https://wiki.fysik.dtu.dk/niflheim/SLURM
Fo
Hi Sean,
slurmctld is active and running but on system reboot doesn’t start
automatically…..I have to start it manually
jb
From: slurm-users On Behalf Of Sean
Crosby
Sent: Tuesday, April 6, 2021 7:54 AM
To: Slurm User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
I turned DbdAddr and DbdHost to localhost and now slurmctld is active and
running…..
Thanks
jb
From: slurm-users On Behalf Of Sean
Crosby
Sent: Tuesday, April 6, 2021 7:54 AM
To: Slurm User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
The other thing I
se01.grid.tuc.gr systemd[1]: Starting Slurm DBD
>> accounting daemon...
>>
>> Apr 05 13:52:35 se01.grid.tuc.gr systemd[1]: slurmdbd.service: Can't
>> open PID file /run/slurmdbd.pid (yet?) after start: Operation not permitted
>>
>> Apr 05 13:52:35 se01.grid.tuc.gr
On Tue, 6 Apr 2021 at 05:00, wrote:
>
> *UoM notice: *External email. Be cautious of links, attachments, or
> impersonation attempts
>
>
> --
>
> Hi Sean,
>
>
>
> 10.0.0.100 is the dbd and ctld host with name se01. Firewall is inactive…
emd[1]: Started Slurm DBD accounting
daemon.
File /run/slurmdbd.pid exist and has pidof slurmdbd value….
From: slurm-users On Behalf Of Sean
Crosby
Sent: Tuesday, April 6, 2021 12:49 AM
To: Slurm User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
What's the
Connection not working
>
>
>
> give me back ….. Connection not working
>
>
>
> jb
>
>
>
>
>
> *From:* slurm-users *On Behalf Of
> *Sean Crosby
> *Sent:* Monday, April 5, 2021 2:52 PM
> *To:* Slurm User Community List
> *Subject:* Re: [slurm-us
User Community List
Subject: Re: [slurm-users] [EXT] slurmctld error
The error shows
slurmctld: debug2: Error connecting slurm stream socket at 10.0.0.100:6819
<http://10.0.0.100:6819> : Connection refused
slurmctld: error: slurm_persist_conn_open_without_init: failed to open
pers
ou for your prompt response, I made the changes you suggested,
> slurmctld refuse running……. find attached new slurmctld -D
>
>
>
> jb
>
>
>
>
>
>
>
> *From:* slurm-users *On Behalf Of
> *Sean Crosby
> *Sent:* Monday, April 5, 2021 11:46 AM
> *To:
-users] [EXT] slurmctld error
Hi Jb,
You have set AccountingStoragePort to 3306 in slurm.conf, which is the MySQL
port running on the DBD host.
AccountingStoragePort is the port for the Slurmdbd service, and not for MySQL.
Change AccountingStoragePort to 6819 and it should fix your
Hi Jb,
You have set AccountingStoragePort to 3306 in slurm.conf, which is the
MySQL port running on the DBD host.
AccountingStoragePort is the port for the Slurmdbd service, and not for
MySQL.
Change AccountingStoragePort to 6819 and it should fix your issues.
I also think you should comment ou
23 matches
Mail list logo