On 4/13/23 11:49, Shaghuf Rahman wrote:
I am setting up Slurmdb in my system and I need some inputs

My current setup is like
server1 : 192.168.123.12(slurmctld)
server2: 192.168.123.13(Slurmctld)
server3: 192.168.123.14(Slurmdbd) which is pointing to both Server1 and Server2.
database: MySQL

I have 1 more server named as server 4: 192.168.123.15 which I need to make it as a secondary database server. I want to configure this server4 which will sync the database and make it either Active-Active slurmdbd or Active-Passive.

Could anyone please help me with the *steps* how to configure and also how am i going to *sync* my *database* on both the servers simultaneously.

Slurm administrators have different opinions about the usefulness versus complexity of HA setups. You could read SchedMD's presentation from page 38 and onwards: https://slurm.schedmd.com/SLUG19/Field_Notes_3.pdf

Some noteworthy slides state:

Separating slurmctld and slurmdbd in normal production use
is recommended.
Master/backup slurmctld is common, and - as long as the
performance for StateSaveLocation is kept high - not that
difficult to implement.

For slurmdbd, the critical element in the failure domain is
MySQL, not slurmdbd. slurmdbd itself is stateless.

IMNSHO, the additional complexity of a redundant MySQL
deployment is more likely to cause an outage than it is to
prevent one.
So don’t bother setting up a redundant slurmdbd, keep
slurmdbd + MySQL local to a single server.

I hope this helps.

/Ole

Reply via email to