Re: [slurm-users] Exposing only requested CPUs to a job on a given node.

Ryan Cox Fri, 14 May 2021 15:53:30 -0700

You can check with something like this inside of a job: cat/sys/fs/cgroup/cpuset/slurm/uid_$UID/job_$SLURM_JOB_ID/cpuset.cpus. Thatlists which cpus you have access to.


On 5/14/21 4:40 PM, Renfro, Michael wrote:

Untested, but prior experience with cgroups indicates that if thingsare working correctly, even if your code tries to run as manyprocesses as you have cores, those processes will be confined to thecores you reserve.
Try a more compute-intensive worker function that will take someseconds or minutes to complete, and watch the reserved node with 'top'or a similar program. If for example, the job reserved only 1 core andtried to run 20 processes, you'd see 20 processes in 'top', each at 5%CPU time.
To make the code a bit more polite, you can import the os module andcreate a new variable from the SLURM_CPUS_ON_NODE environment variableto guide Python into starting the correct number of processes:
                cpus_reserved = int(os.environ['SLURM_CPUS_ON_NODE'])
*From: *slurm-users <slurm-users-boun...@lists.schedmd.com> on behalfof Rodrigo Santibáñez <rsantibanez.uch...@gmail.com>
*Date: *Friday, May 14, 2021 at 5:17 PM
*To: *Slurm User Community List <slurm-users@lists.schedmd.com>
*Subject: *Re: [slurm-users] Exposing only requested CPUs to a job ona given node.
*External Email Warning*
*This email originated from outside the university. Please use cautionwhen opening attachments, clicking links, or responding to requests.*
------------------------------------------------------------------------

Hi you all,
I'm replying to have notifications answering this question. I have auser whose python script used almost all CPUs, but configured to useonly 6 cpus per task. I reviewed the code, and it doesn't have anexplicit call to multiprocessing or similar. So the user is unaware ofthis behavior (and also me).
Running slurm 20.02.6

Best!
On Fri, May 14, 2021 at 1:37 PM Luis R. Torres <lrtor...@gmail.com<mailto:lrtor...@gmail.com>> wrote:
    Hi Folks,

    We are currently running on SLURM 20.11.6 with cgroups constraints
    for memory and CPU/Core.  Can the scheduler only expose the
    requested number of CPU/Core resources to a job?  We have some
    users that employ python scripts with the multi processing
    modules, and the scripts apparently use all of the CPU/Cores in a
    node, despite using options to constraint a task to just a given
    number of CPUs.    We would like several multiprocessing jobs to
    run simultaneously on the nodes, but not step on each other.

    The sample script I use for testing is below; I'm looking for
    something similar to what can be done with the GPU Gres
    configuration where only the number of GPUs requested are exposed
    to the job requesting them.

    #!/usr/bin/env python3

    import multiprocessing

    def worker():

    print("Worker on CPU #%s" % multiprocessing.current_process

    ().name)

        result=0

        for j in range(20):

          result += j**2

        print ("Result on CPU {} is {}".format(multiprocessing.curr

    ent_process().name,result))

        return

    if __name__ == '__main__':

        pool = multiprocessing.Pool()

        jobs = []

        print ("This host exposed {} CPUs".format(multiprocessing.c

    pu_count()))

        for i in range(multiprocessing.cpu_count()):

            p = multiprocessing.Process(target=worker, name=i).star

    t()

    Thanks,
--
    ----------------------------------------
    Luis R. Torres


--
Ryan Cox
Director
Office of Research Computing
Brigham Young University

Re: [slurm-users] Exposing only requested CPUs to a job on a given node.

Reply via email to