[slurm-users] %x in job names

2021-05-28 Thread Bill Barth
wards to stop this error. Best regards, Bill. -- Bill Barth, Ph.D., Director, FutureTechnologies bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445

Re: [slurm-users] Slurm doesn't call mpiexec or mpirun when run through a GUI app

2019-03-22 Thread Bill Barth
h the others who think that the environment inside the script is likely screwed up. Throwing in a printenv and saving that can't hurt. Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445

Re: [slurm-users] siesta jobs with slurm, an issue

2018-07-22 Thread Bill Barth
All I can suggest is to check that all the paths you have provided SIESTA are correct (the path to the executable is clearly fine b/c SIESTA starts, but can it fine prime.fdf?). Otherwise start with your local support team. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.

Re: [slurm-users] Slurm overhead

2018-04-24 Thread Bill Barth
> Hi Bill, > In order to shutdown the slurm process on the compute node, is it fine > to kill /usr/sbin/slurm? Or there is a better and safer way for that? > > Regards, > Mahmood > > > > >> On Sun, Apr 22, 2018 at 5:44 PM, Bill Barth wrote: >> Mahmo

Re: [slurm-users] Slurm overhead

2018-04-22 Thread Bill Barth
handle it for them. Maybe you should look into that after you eliminate direct interference from Slurm. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 4/22/18, 1:06 AM, "

Re: [slurm-users] ulimit in sbatch script

2018-04-15 Thread Bill Barth
memory that the node has (minus some padding for the OS, etc.). IS UsePAM enabled in your slurm.conf, maybe that’s doing it. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 4/15

Re: [slurm-users] ulimit in sbatch script

2018-04-15 Thread Bill Barth
wants (cgroups, perhaps?). Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 4/15/18, 1:41 PM, "slurm-users on behalf of Mahmood Naderan" wrote: Excuse me... I

Re: [slurm-users] ulimit in sbatch script

2018-04-15 Thread Bill Barth
/pam.d/sshd file has pam_limits.so in it, that’s probably where the unlimited setting for root is coming from. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 4/15/18, 1:26 PM

Re: [slurm-users] What's the best way to suppress core dump files from jobs?

2018-03-21 Thread Bill Barth
better forms of these, but they’re working for us. I guess this counts now as being documented in a public place! Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 3/21/18, 7:49 AM

Re: [slurm-users] What's the best way to suppress core dump files from jobs?

2018-03-21 Thread Bill Barth
, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 3/21/18, 6:08 AM, "slurm-users on behalf of Ole Holm Nielsen" wrote: We experience problems with MPI jobs dumping lots

Re: [slurm-users] #SBATCH options as Bash script parameters

2018-03-18 Thread Bill Barth
going to depend on the shebang line (as to what’s being invoked) bash? csh? python? perl? /usr/bin/env X? So, I’d be surprised if there was a mode for this. Also, would you expect Slurm to delete any options it used from your command line or leave them? Best, Bill. -- Bill Barth, Ph.D., Director

Re: [slurm-users] Automatically setting OMP_NUM_THREADS=SLURM_CPUS_PER_TASK?

2018-03-06 Thread Bill Barth
We do the same at TACC in our base module (which happens to be called “TACC”), and then we document it. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 3/6/18, 5:13 PM, "

Re: [slurm-users] Over-riding array limits

2018-02-24 Thread Bill Barth
ThatParameter=100’ or whatever you like to change it. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 2/23/18, 11:13 PM, "slurm-users on behalf of ~Stack~" wrote: Greetings,

Re: [slurm-users] How to deal with user running stuff in frontend node?

2018-02-15 Thread Bill Barth
We kick them off and lock them out until they respond. Disconnections are common enough that it doesn’t always get their attention. Inability to log back in always does. Best, Bill. Sent from my phone. > On Feb 15, 2018, at 9:25 AM, Patrick Goetz wrote: > > The simple solution is to tell p

Re: [slurm-users] Single user consuming all resources of the cluster

2018-02-07 Thread Bill Barth
e probably other ways to do this, but the infrastructure is now historical and set in some stone. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 2/7/18, 12:28 AM, "slu

Re: [slurm-users] Single user consuming all resources of the cluster

2018-02-06 Thread Bill Barth
file with job records which our local accounting system consumes to decrement allocation balances, if you care to know). Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 2/6/18

Re: [slurm-users] Slurm and available libraries

2018-01-17 Thread Bill Barth
use Lmod to make it available and visible to our users. There are more strategies for this than you can imagine, so settle on a few and keep it simple for you! Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435

Re: [slurm-users] lmod and slurm

2017-12-20 Thread Bill Barth
://sourceforge.net/p/lmod/mailman/) which is very active and monitored by the author and a very knowledgeable community. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475-9445 On 12/19/17, 8:43 AM

Re: [slurm-users] introduce short delay starting multiple parallel jobs with srun

2017-11-10 Thread Bill Barth
install is much more recent and does support them) for internal reasons, so we provide the Launcher for folks who have similar needs to you. Best, Bill. -- Bill Barth, Ph.D., Director, HPC bba...@tacc.utexas.edu| Phone: (512) 232-7069 Office: ROC 1.435| Fax: (512) 475