Re: [slurm-users] Slurm Install on Remote System

2019-05-25 Thread Raymond Wan
On 25/5/2019 7:37 PM, John Hearns wrote: > > On Sat, 25 May 2019 at 11:38, Priya Mishra > > <26priy...@gmail.com > wrote: > > > > I am trying to install slurm on a remote system for > > which I don't have root access. > > After following the instructions

Re: [slurm-users] Kinda Off-Topic: data management for Slurm clusters

2019-02-26 Thread Raymond Wan
Hi Janne, On Tue, Feb 26, 2019 at 3:56 PM Janne Blomqvist wrote: > When reaping, it searches for these special .datasync directories (up to > a configurable recursion depth, say 2 by default), and based on the > LAST_SYNCED timestamps, deletes entire datasets starting with the oldest > LAST_SYNC

Re: [slurm-users] Kinda Off-Topic: data management for Slurm clusters

2019-02-22 Thread Raymond Wan
Hi Will, On 23/2/2019 1:50 AM, Will Dennis wrote: For one of my groups, on the GPU servers in their cluster, I have provided a RAID-0 md array of multi-TB SSDs (for I/O speed) mounted on a given path ("/mnt/local" for historical reasons) that they can use for local scratch space. Their othe

Re: [slurm-users] Kinda Off-Topic: data management for Slurm clusters

2019-02-22 Thread Raymond Wan
Hi Will, On 23/2/2019 12:50 AM, Will Dennis wrote: ... would be considered “scratch space”, not for long-term data storage, but for use over the lifetime of a job, or maybe perhaps a few sequential jobs (given the nature of the work.) “Permanent” storage would remain the existing NFS serve

Re: [slurm-users] Help With Slurm 18.08 Installation on Ubuntu Server 18.04

2018-12-16 Thread Raymond Wan
On Sun, Dec 16, 2018 at 4:30 AM Kenneth Russell wrote: > For several days, I have been pouring through on-line sites to learn the > specifics of what files need to be in what directories, etc. Progress > has been very slow. > > 1. Does anyone know of a detailed procedure for installing Slurm 18.

Re: [slurm-users] possible to set memory slack space before killing jobs?

2018-12-09 Thread Raymond Wan
Hi, On 7/12/2018 6:23 PM, Bjørn-Helge Mevik wrote: Raymond Wan writes: However, a more general question... I thought there is no fool-proof way to watch the amount of memory a job is using. What if within the script they ran another program using "nohup", for example. Wouldn&

Re: [slurm-users] possible to set memory slack space before killing jobs?

2018-12-06 Thread Raymond Wan
On Thu, Dec 6, 2018 at 10:01 PM Eli V wrote: > On Thu, Dec 6, 2018 at 2:08 AM Loris Bennett > wrote: > > > Anyone have some thoughts/ideas about this? Seems like it should be > > > relatively straightforward to implement, though of course using it > > > effectively will require some tuning. > >

Re: [slurm-users] swap size

2018-09-23 Thread Raymond Wan
Hi Chris, On Mon, Sep 24, 2018 at 7:36 AM Christopher Samuel wrote: > On 24/09/18 00:46, Raymond Wan wrote: > > > Hmm, I'm way out of my comfort zone but I am curious about what > > happens. Unfortunately, I don't think I'm able to read kernel code

Re: [slurm-users] swap size

2018-09-23 Thread Raymond Wan
Hi Chris, On Sunday, September 23, 2018 09:34 AM, Chris Samuel wrote: On Saturday, 22 September 2018 4:19:09 PM AEST Raymond Wan wrote: SLURM's ability to suspend jobs must be storing the state in a location outside of this 512 GB. So, you're not helping this by allocating more

Re: [slurm-users] swap size

2018-09-21 Thread Raymond Wan
Hi Ashton, On Sat, Sep 22, 2018 at 5:34 AM A wrote: > So I'm wondering if 20% is enough, or whether it should scale by the number > of single jobs I might be running at any one time. E.g. if I'm running 10 > jobs that all use 20 gb of ram, and I suspend, should I need 200 gb of swap? Perhaps

Re: [slurm-users] ubuntu 16.04 > 18.04

2018-09-13 Thread Raymond Wan
Hi Ashton, On Thu, Sep 13, 2018 at 1:46 AM A wrote: > Thinking about upgrading to Ubuntu 18.04 on my workstation, where I am > running a single node slurm setup. Any issues any one has run across in the > update? I had no issues with SLURM per se... But upgrading from Ubuntu 16.04 to 18.04 c

Re: [slurm-users] Configuration issue on Ubuntu

2018-08-28 Thread Raymond Wan
Hi, On Tuesday, August 28, 2018 09:43 PM, Umut Arus wrote: # COMPUTE NODES NodeName=umuta CPUs=1 State=UNKNOWN I'm not sure what's the cause of your problem, but one thing I noticed is that the line above should be replaced with the output of the first line of "slurmd -C". The

Re: [slurm-users] AccountingStorageLoc=

2018-05-22 Thread Raymond Wan
Hi Eric, On Wednesday, May 23, 2018 12:33 AM, Eric F. Alemany wrote: > Can I choose any path that i want to store the filetxt? e.g. >  /AccountingStorageLoc=/media/slurmlog/accounting/ > OR > do i need to stick > with AccountingStorageLoc=/var/log/slurm/accounting? I believe you can, though I

Re: [slurm-users] Python and R installation in a SLURM cluster

2018-05-10 Thread Raymond Wan
Hi Eric, On 10/05/18 23:35, Eric F. Alemany wrote: > I know this might sounds as a very basic question: where in > the cluster should I install Python and R? > Headnode? > Execute nodes ? I don't think there is a fixed rule for a question like this and it depends on the compromise between what

Re: [slurm-users] sacct: error

2018-05-03 Thread Raymond Wan
Hi Eric, On Fri, May 4, 2018 at 6:04 AM, Eric F. Alemany wrote: > # COMPUTE NODES > NodeName=radonc[01-04] NodeAddr=10.112.0.5 10.112.0.6 10.112.0.14 > 10.112.0.16 CPUs=32 RealMemory=64402 Sockets=2 CoresPerSocket=8 > ThreadsPerCore=2 State=UNKNOWN > PartitionName=debug Nodes=radonc[01-04] Def

Re: [slurm-users] SLURM on Ubuntu 18.04

2018-05-03 Thread Raymond Wan
Hi Eric and Patrick, On Fri, May 4, 2018 at 5:25 AM, Eric F. Alemany wrote: > I am not sure why but i dodn’t see /etc/slurm. when i install slurmctld and > slurmd from the ubuntu packages it created a directory slurm-llnl directory > - /etc/slurm-llnl > Putting slurm.conf in /etc/slurm-llnl/ s

Re: [slurm-users] SLURM on Ubuntu 18.04

2018-05-03 Thread Raymond Wan
Hi Eric, On Thu, May 3, 2018 at 11:21 PM, Eric F. Alemany wrote: > I will follow your advice. It doesn't hurt to try right (?) > > Thank you for your quick reply No, it doesn't hurt to try. If this was an upgrade of some sort, then the only concern might be downtime and its effect on users.

Re: [slurm-users] SLURM on Ubuntu 18.04

2018-05-03 Thread Raymond Wan
Hi Eric, On Thu, May 3, 2018 at 10:51 PM, Eric F. Alemany wrote: > > My questions is: once i edit the slurm.conf file where do i save it on the > master and nodes. what path or what directory should slurm.conf reside? On Ubuntu 17.10, I placed it in /etc/slurm-llnl/, but I don't know if it's c

Re: [slurm-users] [slurm-dev] Re: Installing SLURM locally on Ubuntu 16.04

2017-11-12 Thread Raymond Wan
Hi Will, On Sun, Nov 12, 2017 at 11:07 PM, Will L wrote: > In general, is Debian more cooperative with HPC systems than Ubuntu? Because > I may just kill my KDE Neon installation and trying Debian + KDE Plasma. By "HPC" systems, do you mean a server? Or a supercomputer?? As far as I know, no

Re: [slurm-users] [slurm-dev] Re: Installing SLURM locally on Ubuntu 16.04

2017-11-12 Thread Raymond Wan
Hi Rajiv, On Sun, Nov 12, 2017 at 9:47 PM, Rajiv Nishtala wrote: > it maybe that I’m missing context here - but in regards to munge, it makes > much more sense to follow the munge follow instructions from the GitHub of > munge. > Remember, the permissions for the key and the /var/ and /etc fold

Re: [slurm-users] [slurm-dev] Re: Installing SLURM locally on Ubuntu 16.04

2017-11-09 Thread Raymond Wan
Hi Will, On Thu, Nov 9, 2017 at 11:01 AM, Will L wrote: > > Thanks for the suggestions. Munge seems to be working just fine. At one point > I tried to build SLURM from the source, but when I could not make it work, I > `sudo make uninstall`ed it and opted for the pre-built apt version all over