On 2/2/21 3:40 PM, Benson Muite wrote:
On 2/2/21 3:30 PM, Zainul Abiddin wrote:
Hi All,
I am new to Slurm and trying to setup Slurm20.11.2 on Centos 7
My environment is Master node (smaster) + compute Node (snode)
and i am using
https://www.slothparadise.com/how-to-install-slurm-on-centos-7-cluster/ <https://www.slothparadise.com/how-to-install-slurm-on-centos-7-cluster/>
link to setup Slurm on Master and compute nodes.
I have tried installing Munge on both the nodes and it's running fine.
However when i try to run the Munge command from Master to Node its
asking password.
export MUNGEUSER=1001
groupadd -g $MUNGEUSER munge
useradd -m -c "MUNGE Uid 'N' Gid Emporium" -d /var/lib/munge -u
$MUNGEUSER -g munge -s /sbin/nologin munge
export SlurmUSER=1002
groupadd -g $SlurmUSER slurm
useradd -m -c "Slurm workload manager" -d /var/lib/slurm -u
$SlurmUSER -g slurm -s /bin/bash slurm
yum install -y epel-release
yum install munge munge-libs munge-devel -y
yum install rng-tools -y
rngd -r /dev/urandom
/usr/sbin/create-munge-key -r
dd if=/dev/urandom bs=1 count=1024 > /etc/munge/munge.key
chown munge: /etc/munge/munge.key
chmod 400 /etc/munge/munge.key
scp /etc/munge/munge.key root@snode:/etc/munge
chown munge: /etc/munge/munge.key
chmod 400 /etc/munge/munge.key
chown -R munge: /etc/munge/ /var/log/munge/
chmod 0700 /etc/munge/ /var/log/munge/
systemctl enable munge
systemctl start munge
systemctl status munge
[root@smaster ~]# systemctl status munge
? munge.service - MUNGE authentication service
Loaded: loaded (/usr/lib/systemd/system/munge.service; enabled;
vendor preset: disabled)
Active: active (running) since Mon 2021-02-01 12:52:54 IST; 1h
4min ago
Docs: man:munged(8)
Process: 2547 ExecStart=/usr/sbin/munged (code=exited,
status=0/SUCCESS)
Main PID: 2550 (munged)
Tasks: 4
CGroup: /system.slice/munge.service
+-2550 /usr/sbin/munged
Feb 01 12:52:54 smaster.calligotech.com
<http://smaster.calligotech.com> systemd[1]: Starting MUNGE
authentication service...
Feb 01 12:52:54 smaster.calligotech.com
<http://smaster.calligotech.com> systemd[1]: Started MUNGE
authentication service.
[root@smaster ~]# munge -n
MUNGE:AwQDAAAg5PQzQhz/D4h7OGUU4Cx4QAgZ4z/0MMt0SP+uhuP927Xcl2t8EC4izsUj6xpMRslnIb2g4RCz2vayu0wW1o8mNNuy7cVv/PmsuO9XsAJ7aLl1n/M=:
[root@smaster ~]#
Below is the screenshot for reference.
Smaster:
image.png
Snode:
image.png
Am I configuring properly or Do I need to set up passwordless
authentication on Master to Node and vice-versa?
Please clarify to me, whether Mugne will do passwordless login else
do we need to setup passwordless.
Please guide me with a proper setup link/Doc which includes Munge
Configuration, Slurm account database Daemon configuration and Slurm
installation and configuration with testing simple jobs on Master and
Compute Nodes.
--
*Regards*
*Zain*
Are you able to do passwordless ssh between the nodes?
May also find the following helpful:
https://github.com/dun/munge/wiki/Installation-Guide
https://southgreenplatform.github.io/trainings/hpc/slurminstallation/
See also
https://wiki.fysik.dtu.dk/niflheim/Slurm_installation