[slurm-users] Re: [EXTERNAL] Re: [EXTERN] Re: Slurm 24.05 and OpenMPI

2025-03-27 Thread Pritchard Jr., Howard via slurm-users
ith though. Howard From: Davide DelVento Date: Thursday, March 27, 2025 at 9:00 AM To: "Pritchard Jr., Howard" Cc: Matthias Leopold , Slurm User Community List Subject: Re: [EXTERNAL] [slurm-users] Re: [EXTERN] Re: Slurm 24.05 and OpenMPI ♥️ Davide DelVento reacted via

[slurm-users] Re: [EXTERNAL] Re: [EXTERN] Re: Slurm 24.05 and OpenMPI

2025-03-27 Thread Pritchard Jr., Howard via slurm-users
h OpenMPI outside of containers (with ompi examples or python mpi4py). Matthias Am 27.03.25 um 15:46 schrieb Pritchard Jr., Howard: > HI Matthias, > > It looks like the Open MPI in the containers was not built with PMI1 or > PMI2 support, so its defaulting to using PMIx. > &

[slurm-users] Re: [EXTERNAL] Re: [EXTERN] Re: Slurm 24.05 and OpenMPI

2025-03-27 Thread Pritchard Jr., Howard via slurm-users
HI Matthias, It looks like the Open MPI in the containers was not built with PMI1 or PMI2 support, so its defaulting to using PMIx. You are seeing this error message because the call within Open MPI 4.1.x’s runtime system to PMIx_Init returned an error. Namely that there was no PMIx server to co

Re: [slurm-users] [EXTERNAL] Re: Question about PMIX ERROR messages being emitted by some child of srun process

2023-05-23 Thread Pritchard Jr., Howard
Thanks Christopher, This doesn't seem to be related to Open MPI at all except that for our 5.0.0 and newer one has to use PMix to talk to the job launcher. I built MPICH 4.1 on Perlmutter using the --with-pmix option and see a similar message from srun --mpi=pmix hpp@nid008589:~/ompi/examples>

[slurm-users] Question about PMIX ERROR messages being emitted by some child of srun process

2023-05-19 Thread Pritchard Jr., Howard
HI, So I’m testing the use of Open MPI 5.0.0 pre-release with the Slurm/PMIx setup currently on NERSC Perlmutter system. First off, if I use the PRRte launch system, I don’t see the issue I’m raising here. But, many NERSC users prefer to use the srun “native” launch method with applications co

Re: [slurm-users] [EXTERNAL] OpenMPI and Slurm clarification?

2023-03-27 Thread Pritchard Jr., Howard
en I'm gonna have to figure out a way to determine what that was/is. On 3/27/23 15:28, Pritchard Jr., Howard wrote: HI Craig, Your use of the –with-pmix on the open mpi configure line is important. Without any args to this configure option open mpi configure will first check if there’s

Re: [slurm-users] [EXTERNAL] OpenMPI and Slurm clarification?

2023-03-27 Thread Pritchard Jr., Howard
#x27;m not sure that tells me much about how I am supposed to be building OpenMPI? On 3/27/23 14:41, Pritchard Jr., Howard wrote: HI Craig, If you run srun –mpi=list what does slurm report? That will help in determining what argument you want to supply for the –mpi srun option. Howard Fr

Re: [slurm-users] [EXTERNAL] OpenMPI and Slurm clarification?

2023-03-27 Thread Pritchard Jr., Howard
HI Craig, If you run srun –mpi=list what does slurm report? That will help in determining what argument you want to supply for the –mpi srun option. Howard From: slurm-users on behalf of Craig Reply-To: Slurm User Community List Date: Monday, March 27, 2023 at 12:38 PM To: "slurm-users@

Re: [slurm-users] [EXTERNAL] --no-alloc breaks mpi?

2021-03-08 Thread Pritchard Jr., Howard
Hi Chris, What’s happening is that there’s no SLURM_JOBID (my speculation since I don’t have perms to use –no-alloc) is set, but SLURM_NODELIST may be set, so its confusing ORTE. Could you list which SLURM env variables are set in the shell in which your running the srun command? Howard From

Re: [slurm-users] [EXTERNAL] problems with OpenMPI 4.0.3

2020-06-01 Thread Pritchard Jr., Howard
Hello Angelines, Do you know how the Open MPI 4.0.3 package was configured and built? That information would be useful to help diagnose the problem. Thanks, Howard From: slurm-users on behalf of "Alberto Morillas, Angelines" Reply-To: Slurm User Community List Date: Friday, May 29, 2020

Re: [slurm-users] [EXTERNAL] problems with OpenMPI 4.0.3

2020-06-01 Thread Pritchard Jr., Howard
2. Re: [EXTERNAL] problems with OpenMPI 4.0.3 (Pritchard Jr., Howard) 3. Re: Slurm Job Count Credit system (Songpon Srisawai) -- Message: 1 Date: Mon, 1 Jun

[slurm-users] salloc --no-shell question

2019-01-24 Thread Pritchard Jr., Howard
Hello Slurm experts, We have a workflow where we have a script which invoke salloc —noshell and then launches a series of MPI jobs using srun with the jobid= option to make use of the reservation we got from the salloc invocation. We are needing to do things this way because the script itself ne

Re: [slurm-users] Heterogeneous job one MPI_COMM_WORLD

2018-10-10 Thread Pritchard Jr., Howard
Hi Christopher, We hit some problems at LANL trying to use this SLURm feature. At the time, I think SchedMD said there would need to be fixes to the SLURM PMI2 library to get this to work. What version of SLURM are you using? Howard -- Howard Pritchard B Schedule HPC-ENV Office 9, 2nd floor