That means the system has no idea who that user is.
If you are using /etc/passwd, that file is not synched on the slurm
master node(s)
If you are part of a domain or other shared directory (ldap, etc), your
master is likely not configured right.
If you are using SSSD, it is also possible your sssd has too long of a
cache time. Run "sss_cache -E" to clear everything.
Note: that doesn't always work well, so you can stop sssd, delete the
cache and restart.
If you have a forest, it could be the information has not propagated to
all the servers, so you have to wait.
I've been places where that can take 24 hours.
Brian Andrus
On 9/29/2020 6:18 AM, Diego Zuccato wrote:
Hello all.
One of the users is unable to submit jobs to our cluster.
The first time he tries, he gets
$ sbatch test.job
sbatch: fatal: Invalid user id: 621049927
then:
$ sbatch test.job
sbatch: error: Batch job submission failed: Invalid account or
account/partition combination specified
In slurmctl.log I see:
[2020-09-29T13:55:20.018] error: User 621049927 not found
[2020-09-29T13:55:20.019] _job_create: invalid account or partition for
user 621049927, account '(null)', and partition 'matrix'
[2020-09-29T13:55:20.019] _slurm_rpc_submit_batch_job: Invalid account
or account/partition combination specified
(621049927 is the user's correct UID).
I suspected a PBIS glitch, but both "id studenti#name.surname" and "id
UID" returned the correct info.
I found an old thread suggesting to add DefaultAccount, and I tried that
even if it was already done by the script that imports AD users in Slurm
accounting.
# sacctmgr modify user studenti#name.surname cluster=oph set
DefaultAccount=materia
The db seems OK:
# sacctmgr list user STUDENTI#name.surname WithAssoc
User Def Acct Admin Cluster Account Partition
Share MaxJobs MaxNodes MaxCPUs MaxSubmit MaxWall MaxCPUMins
QOS Def QOS
---------- ---------- --------- ---------- ---------- ----------
--------- ------- -------- -------- --------- ----------- -----------
-------------------- ---------
studenti#+ materia None oph materia
2 20
normal
What am I missing?
Tks.