Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Yakupov, Renat /DZNE
Community List Subject: Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun Renat, Not to toot our own horn too much, but TACC develops a tool designed for launching lots of individual serial tasks on a parallel system and distributing them well inside a single scheduler

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Bill Barth
introduce short delay starting multiple parallel jobs with srun Hi Renat, On Fri, Nov 10, 2017 at 10:03:37AM +0100, Yakupov, Renat /DZNE wrote: > slurm 2.5.0! seeing today's announcement about a double digit version > release, that is... ancient! accor

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Yakupov, Renat /DZNE
] On Behalf Of Gennaro Oliva [oliv...@na.icar.cnr.it] Sent: Friday, November 10, 2017 10:51 AM To: Slurm User Community List Subject: Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun Hi Renat, On Fri, Nov 10, 2017 at 10:03:37AM +0100, Yakupov, Renat /DZNE wrote

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Gennaro Oliva
Hi Renat, On Fri, Nov 10, 2017 at 10:03:37AM +0100, Yakupov, Renat /DZNE wrote: > slurm 2.5.0! seeing today's announcement about a double digit version > release, that is... ancient! according to the NEWS file, native jobarray support was introduced in version 2.6.0pre1 ... bad luck. You can sti

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Yakupov, Renat /DZNE
[slurm-users-boun...@lists.schedmd.com] On Behalf Of John Hearns [hear...@gmail.com] Sent: Friday, November 10, 2017 10:11 AM To: Slurm User Community List Subject: Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun Renat, I understand better here. It does look like

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Gennaro Oliva
Hi Renat, On Fri, Nov 10, 2017 at 09:44:06AM +0100, Yakupov, Renat /DZNE wrote: > How can I find that out? Dont see anything like that in slurm.conf... srun -V Regards, -- Gennaro Oliva

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread John Hearns
Behalf Of > Loris Bennett [loris.benn...@fu-berlin.de] > Sent: Friday, November 10, 2017 10:00 AM > To: Slurm User Community List > Subject: Re: [slurm-users] introduce short delay starting multiple > parallel jobs with srun > > "Yakupov, Renat /DZNE" writes: >

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Yakupov, Renat /DZNE
er 10, 2017 10:00 AM To: Slurm User Community List Subject: Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun "Yakupov, Renat /DZNE" writes: > How can I find that out? Dont see anything like that in slurm.conf... squeue --versio

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Yakupov, Renat /DZNE
vember 09, 2017 4:39 PM To: Slurm User Community List Subject: Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun Renat, I know that this is not going to be helpful. I can understand that perhaps if you are using NFS storage then 20(*) processes might not

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Loris Bennett
liva [oliv...@na.icar.cnr.it] > Sent: Thursday, November 09, 2017 3:55 PM > To: Slurm User Community List > Subject: Re: [slurm-users] introduce short delay starting multiple parallel > jobs with srun > > Hi Renat, > > On Thu, Nov 09, 2017 at 03:46:23PM +0100, Yakupov, Rena

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Yakupov, Renat /DZNE
Community List Subject: Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun Hi Renat, On Thu, Nov 09, 2017 at 03:46:23PM +0100, Yakupov, Renat /DZNE wrote: > I tried that. It doesnt even queue the job with an error: > sbatch: unrecognized option '--array=1-2

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-09 Thread John Hearns
Renat, I know that this is not going to be helpful. I can understand that perhaps if you are using NFS storage then 20(*) processes might not be able to open files at the same time. I would consider the following: a) looking at your storage. This is why HPC systems have high performance and pa

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-09 Thread Gennaro Oliva
Hi Renat, On Thu, Nov 09, 2017 at 03:46:23PM +0100, Yakupov, Renat /DZNE wrote: > I tried that. It doesnt even queue the job with an error: > sbatch: unrecognized option '--array=1-24' > sbatch: error: Try help for more information. what version of slurm are you using? Regards -- Gennaro Oliva

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-09 Thread Yakupov, Renat /DZNE
Of Gennaro Oliva [oliv...@na.icar.cnr.it] Sent: Thursday, November 09, 2017 3:40 PM To: Slurm User Community List Subject: Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun Hi Renat, On Thu, Nov 09, 2017 at 03:09:17PM +0100, Yakupov, Renat /DZNE wrote: > I wo

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-09 Thread Gennaro Oliva
Hi Renat, On Thu, Nov 09, 2017 at 03:09:17PM +0100, Yakupov, Renat /DZNE wrote: > I would like some suggestions on how to spread out in time the start > of multiple parallel jobs with srun. I would use: sbatch --array=... As far as I know srun doesn't support arrays. > Is there a way to get a

[slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-09 Thread Yakupov, Renat /DZNE
Dear SLURM users, I would like some suggestions on how to spread out in time the start of multiple parallel jobs with srun. I have a very basic script which specifies number of nodes and tasks with just one command: srun myjob. The problem is that 10-20 tasks start accessing files at the same t