> What's the best way to troubleshoot this when orted fails but doesn't give
> any sort of error to indicate what the root cause of the failure might be?
> And I also can't predictably induce the failure, just have to wait until it
> randomly chokes.
You can try increasing the Open MPI verbos
;> On Jan 7, 2025, at 13:26, Ole Holm Nielsen via slurm-users
>>> wrote:
>>>
>>> Hi Jeffrey,
>>>
>>> Thanks a lot, I'd like to try out snodelist. Not knowing much about CMake,
>>> I couldn't build the tool :-( I've opened an
st/issues/1
> Can you help me out?
>
> On 07-01-2025 16:27, Jeffrey Frey via slurm-users wrote:
>> We use a tool that's compiled against the Slurm library itself so that the
>> expansion/contraction of lists is always 100% in sync with Slurm itself:
>> https://git
We use a tool that's compiled against the Slurm library itself so that the
expansion/contraction of lists is always 100% in sync with Slurm itself:
https://github.com/jtfrey/snodelist
> On Jan 7, 2025, at 10:12, Davide DelVento via slurm-users
> wrote:
>
> Wonderful. Thanks Ole for the re