On Wed, Dec 13, 2023 at 08:16:39PM +, Jackson, Gary L. wrote:
Hi Gary,
> The SlurmctldHost value is set like the following in my slurm.conf:
>
> SlurmctldHost=host0,host1
>
> That seems to be legal according to the documentation. However, I get error
> messages like the following:
>
> $ sr
I'll apologize because I don't have a complete answer. I'm not sure why
that doesn't work, but my understanding of how it should work for failover
scenarios is a "SlurmctldHost" line for each of the controllers, e.g.:
SlurmctldHost=host1
SlurmctldHost=host2
...
The list format seems to be used i
The SlurmctldHost value is set like the following in my slurm.conf:
SlurmctldHost=host0,host1
That seems to be legal according to the documentation. However, I get error
messages like the following:
$ srun id
srun: error: get_addr_info: getaddrinfo() failed: Name or service not known