At least for Grid Engine/OpenMPI the preferred mechanism ("tight integration") involves the shepherds running on each exec hosts to start MPI, without any SSH/RSH required at all. I'm not sure if you've run across this documentation, but it might help to figure out what's going on:
https://slurm.schedmd.com/mpi_guide.html#intel_mpi I'm guessing you're using the "srun" method right now. Skylar On Thu, Apr 5, 2018 at 8:10 AM, Faraz Hussain <i...@feacluster.com> wrote: > Here's something quite baffling. I have a cluster running slurm but have > not setup passwordless ssh for a user yet. So when the user runs "mpirun -n > 2 -hostfile hosts hostname", it will hang because of ssh issue. That is > expected. > > Now the baffling thing is the mpirun command works inside a slurm script! > How can it work if passwordless ssh has not been configured? Does slurm use > some different authentication (munge?) to login to the hosts and execute > the hostname command? > > Or does slurm have some fancy behind the scenes integration with Intel mpi > ? > > _______________________________________________ > Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing > To change your subscription (digest mode or unsubscribe) visit > http://www.beowulf.org/mailman/listinfo/beowulf >
_______________________________________________ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf