SLURM and python, nodes are allocated, but the code only runs on one node

Question

I have a 4*64 CPU cluster. I installed SLURM, and it seems to be working, as if i call sbatch i get the proper allocation and queue. However if i use more than 64 cores (so basically more than 1 node) it perfectly allocates the correct amount of nodes, but if i ssh into the allocated nodes i only see actual work in one of them. The rest just sits there doing nothing.

My code is complex, and it uses multiprocessing. I call pools with like 300 workers, so i guess it should not be the problem.

What i would like to achieve is to call sbatch myscript.py on like 200 cores, and SLURM should distribute my run on these 200 cores, not just allocate the correct amount of nodes but actually only use one.

The header of my python script looks like this:

#!/usr/bin/python3

#SBATCH --output=SLURM_%j.log
#SBATCH --partition=part
#SBATCH -n 200

and i call the script with sbatch myscript.py.

score 8 · Accepted Answer · edited May 23 '17 at 12:24

Unfortunately, multiprocessing does not allow working on several nodes. From the documentation:

the multiprocessing module allows the programmer to fully leverage multiple processors on a given machine

One option, often used with Slurm, is to use MPI (with the MPI4PY package) but MPI is considered to be the 'the assembly language of parallel programming' and you will need to modify your code extensibly.

Another option is to look into the Parallel Processing packages for one that suits your needs and requires minimal changes to your code. See also this other question for more insights.

A final note: it is perfectly fine to put the #SBATCH directives in the Python script and use the Python shebang. But as Slurm executes a copy of the script rather than the script itself, you must add a line such as

sys.path.append(os.getcwd())

at the beginning of the script (but after the #SBATCH lines) to make sure Python finds any module located in your directory.

score 1 · Answer 2 · answered Nov 30 '16 at 11:39

1

I think your sbatch script should not be inside the python script. Rather it should be a normal bash script including the #SBATCH options followed by the actual script to run with srun jobs. like the following:

#!/usr/bin/bash

#SBATCH --output=SLURM_%j.log
#SBATCH --partition=part
#SBATCH -n 200

srun python3 myscript.py

I suggest testing this with a simple python script like this:

import multiprocessing as mp

def main():
    print("cpus =", mp.cpu_count())

if __name__ == "__main__":
    main()

answered Nov 30 '16 at 11:39

Lukisn

190
8

Sadly this is correct, it is not the answer for my problem. `srun` only run the `myscript.py` 200 times. What i want is allow my python script to use 200 cores with multiprocessing. – Gábor Erdős Nov 30 '16 at 17:08
Unfortunately I currently have no access to a cluster running SLURM to verify this, but I think the problem could be a confusion with the -n option and the terms process, task etc. used by SLURM [link](https://slurm.schedmd.com/srun.html) and python multiprocessing. Maybe try using -c instead. [link](https://slurm.schedmd.com/srun.html) – Lukisn Dec 01 '16 at 14:48

score 1 · Answer 3 · answered Nov 29 '18 at 20:30

I tried to get around using different python libraries by using srun on the following bash script. srun should run on each node that you have allocated to you. The basic idea is that it determines what node it's running on and assigns a node id of 0, 1, ... , nnodes-1. Then it passes that information off to the python program along with a thread id. In the program I combine these two numbers to make a distinct id for each cpu on each node. This code assumes that there are 16 cores on each node and 10 nodes are going to be used.

#!/bin/bash

nnames=(`scontrol show hostnames`)
nnodes=${#nnames[@]}
nIDs=`seq 0 $(($nnodes-1))`
nID=0
for i in $nIDs
do
    hname=`hostname`
    if [ "${nnames[$i]}" == "$hname" ]
        then nID=$i
    fi
done
tIDs=`seq 0 15`

for tID in $tIDs
do
    python testDataFitting2.py $nID $tID 160 &
done
wait

SLURM and python, nodes are allocated, but the code only runs on one node

3 Answers3