Re: [slurm-users] Topology configuration questions:

2019-01-17 Thread Nicholas McCollum
odes using the --constraint=whatever flag. Nicholas McCollum Alabama Supercomputer Authority From: "Fulcomer, Samuel" Sent: Thursday, January 17, 2019 5:58 PM To: Slurm User Community List Subject: Re: [slurm-users] Topology configuration questions: We

Re: [slurm-users] "fatal: can't stat gres.conf"

2018-07-23 Thread Nicholas McCollum
You may want to check and make sure your GPUs are in persistance mode. You can enable it through the nvidia-smi utility. Nicholas McCollum Alabama Supercomputer Authority From: Alex Chekholko Sent: Monday, July 23, 2018 6:00 PM To: Slurm User Community List

Re: [slurm-users] Problem launching interactive jobs using srun

2018-03-09 Thread Nicholas McCollum
You may have to use... # systemctl stop firewalld # systemctl start firewalld If you use firewalld. --- Nicholas McCollum - HPC Systems Expert Alabama Supercomputer Authority - CSRA On 03/09/2018 02:45 PM, Andy Georges wrote: Hi all, Cranked up the debug level a bit Job was not star

Re: [slurm-users] How to deal with user running stuff in frontend node?

2018-02-15 Thread Nicholas McCollum
it until I saw this thread. If you'd like a copy of the shell scripts, just send me an e-mail. --- Nicholas McCollum - HPC Systems Expert Alabama Supercomputer Authority - CSRA On 02/15/2018 03:05 PM, Ryan Cox wrote: Manuel, We set up cgroups and also do cputime limits (60 minutes in

Re: [slurm-users] giving smaller jobs higher priority

2017-11-23 Thread Nicholas McCollum
e = "9000" end return slurm.SUCCESS end --- Nicholas McCollum HPC Systems Administrator Alabama Supercomputer Authority On Wed, Nov 22, 2017 at 01:14:16PM -0500, Satrajit Ghosh wrote: > hi sam, > > thanks for that pointer. we already have: > > PriorityFavorSmall=YES

Re: [slurm-users] Graphing job metrics

2017-11-15 Thread Nicholas McCollum
u need more details, I'll be glad to answer your questions. Regards, Carlos On Tue, Nov 14, 2017 at 6:10 PM, Nicholas McCollum mailto:nmccol...@asc.edu>> wrote: All, I went to the SchedMD booth last night and talked with the guys. Tim told me that the Barcelona Supercomputing Center is w

Re: [slurm-users] Graphing job metrics

2017-11-14 Thread Nicholas McCollum
te at the recommendation of some people for performance improvements when querying hundreds of jobs at the same time. If anyone wants a specific time to meet, just e-mail me directly. I will be at the SC17 convention center all week. --- Nicholas McCollum HPC Systems Administrator Al

[slurm-users] Graphing job metrics

2017-11-13 Thread Nicholas McCollum
share it on github. I am also at SC17 if anyone wants to meet up and check it out in person. Thanks! --- Nicholas McCollum HPC Systems Administrator Alabama Supercomputer Authority