Hi,
We set up a slurm system with email notification, this is the slurm.conf
`MailProg=/usr/sbin/sendmail`
But the email that I get has not status, just an empty message:
[image: image.png]
no subject, no info, what are we missing?
Thanks~
--
slurm-users mailing list -- slurm-users@lists.schedm
r whatever you’re using), but
> that’ll potentially have serious side effects for others’ jobs.
>
>
>
> So back to the original question: why **not** pack 20 jobs onto fewer
> nodes if those nodes have the capacity to run the full set of jobs? You
> shouldn’t have a constraint w
not** pack 20 jobs onto fewer
> nodes if those nodes have the capacity to run the full set of jobs? You
> shouldn’t have a constraint with memory or CPUs. Are you trying to spread
> out an I/O load somehow? Networking?
>
>
>
> *From: *Oren via slurm-users
> *Date: *Tuesday, Decemb
jobs.
>
>
>
> So back to the original question: why **not** pack 20 jobs onto fewer
> nodes if those nodes have the capacity to run the full set of jobs? You
> shouldn’t have a constraint with memory or CPUs. Are you trying to spread
> out an I/O load somehow? Networking?
>
Hi,
I have a cluster of 20-nodes, and I want to run a jobarray on that cluster,
but I want each node to get one job per node.
When I do the following:
#!/bin/bash
#SBATCH --job-name=process_images_train# Job name
#SBATCH --time=50:00:00 # Time limit hrs:min:sec
#SBATCH --tasks=1
t;
slurm-users@lists.schedmd.com> wrote:
> Hello Oren,
>
> On 8/5/24 3:20 PM, Oren via slurm-users wrote:
> > When I am running this command:
> > `salloc --nodelist=gpu03 -p A4500_Features --gres=gpu:1`
> > and then automatically ssh to the job, what should I see when I run
>
Hello,
When I am running this command:
`salloc --nodelist=gpu03 -p A4500_Features --gres=gpu:1`
and then automatically ssh to the job, what should I see when I run
nvidia-smi? All the GPUs in the host or just a single one?
Thanks
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To un