[slurm-users] Re: Slurm versions 24.11.2 and 24.05.6 are now available

2025-02-26 Thread Markus Köberl via slurm-users
On Tuesday, 25 February 2025 22:10:02 CET Marshall Garey via slurm-users wrote: > We are pleased to announce the availability of Slurm versions 24.11.2 > and 24.05.6. On the download page the wrong md5sum is displayed for slurm-24.11.2.tar.bz2 regards Markus Köberl -- Markus Koeber

[slurm-users] Re: Scheduling oddity with multiple GPU types in same partition

2024-10-29 Thread Markus Köberl via slurm-users
u=4,mem=64G,node=1,billing=144,gres/gpu=1,gres/gpu:h100=1 > > 8338676 500060 > cpu=4,mem=64G,node=1,billing=144,gres/gpu=1,gres/gpu:h100=1 Do you have Backfill Scheduling configured with bf_continue? regards Markus Köberl -- Markus Koeberl Graz University of Technol

[slurm-users] Re: problem with squeue --json with version 24.05.1

2024-08-05 Thread Markus Köberl via slurm-users
For me the problem is now fixed with SLURM 24.05.2 regards Markus Köberl On Wednesday, 3 July 2024 15:34:37 CEST Ümit Seren wrote: > We experience the same issue. > > SLURM 24.05.1 segfaults with squeue –json and squeue --json=v0.0.41 but > works with squeue --json=v0.0.40 > &g

[slurm-users] Re: problem with squeue --json with version 24.05.1

2024-07-03 Thread Markus Köberl via slurm-users
On Wednesday, 3 July 2024 13:26:25 CEST Joshua Randall wrote: > Markus, > > I had a similar problem after upgrading from v23 to v24 but found that > specifying _any_ valid data version worked for me, it was only > specifying `--json` without a version that triggered an error (which > in my case wa

[slurm-users] problem with squeue --json with version 24.05.1

2024-07-02 Thread Markus Köberl via slurm-users
$ squeue --version slurm 24.05.1 $ squeue --json malloc(): invalid size (unsorted) Aborted forcing an older data_parser version works: $ squeue --json=v0.0.40 regards Markus Köberl -- Markus Koeberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail

Re: [slurm-users] Cluster not booting after upgrade to debian jessie

2018-01-08 Thread Markus Köberl
off * > *(initramfs)* > > Maybe did you ever had this type of problem? Where is your root file system located? If it is on a local disk check your /etc/fstab Maybe the device location has changed with the newer kernel? regards Markus Köberl -- Markus Koeberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at

Re: [slurm-users] Graphing job metrics

2018-01-05 Thread Markus Köberl
thub.com/firehol/netdata) also provides such information collected from cgroups in real-time with 1 hour history. It can be configured to use back-ends to archive the metrics. regards Markus Köberl -- Markus Koeberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at

Re: [slurm-users] Limit number of CPU in a partition

2018-01-05 Thread Markus Köberl
(in case of a CPU with hyper threading it means 10*20 threads) regards Markus Köberl -- Markus Koeberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at

Re: [slurm-users] Limit number of CPU in a partition

2018-01-05 Thread Markus Köberl
On Friday, 5 January 2018 10:55:47 CET Nicolò Parmiggiani wrote: > Hi, > > can someone help me? How can I limit the maximum number of CPUs that a > partition can use. Have a look at the option MaxCPUsPerNode for partitons regards Markus Köberl -- Markus Koeberl Graz University o

Re: [slurm-users] Query about Compute + GPUs

2017-11-30 Thread Markus Köberl
On Tuesday, 21 November 2017 16:38:48 CET Ing. Gonzalo E. Arroyo wrote: > I have a problem detecting RAM and Arch (maybe some more), check this... > > NodeName=fisesta-21-3 Arch=x86_64 CoresPerSocket=1 >CPUAlloc=0 CPUErr=0 CPUTot=2 CPULoad=0.01 >AvailableFeatures=rack-21,2CPUs >ActiveF

Re: [slurm-users] Query about Compute + GPUs

2017-11-21 Thread Markus Köberl
s,qos,WCKey AccountingStorageType=accounting_storage/slurmdbd AccountingStoreJobComment=YES AccountingStorageTRES=CPU,Mem,Gres/gpu JobAcctGatherFrequency=30 JobAcctGatherType=jobacct_gather/cgroup regards Markus Köberl -- Markus Koeberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at

Re: [slurm-users] Query about Compute + GPUs

2017-11-21 Thread Markus Köberl
hat could be? I am using slurm 16.05.9 on debian stretch. regards Markus Köberl -- Markus Koeberl Graz University of Technology Signal Processing and Speech Communication Laboratory E-mail: markus.koeb...@tugraz.at