MPI jobs

We deployed the MPI Operator from Kubeflow to run MPI jobs on the DSRI.

Request access to the MPI Operator

To be able to deploy MPI-Jobs you will need to ask the DSRI admins to enable the MPI Operator in your project. Once enabled you will be able to start an MPI-job in a few clicks.

The MPI Operator makes it easy to run allreduce-style distributed training on Kubernetes. Please check out this blog post for an introduction to MPI Operator and its industry adoption.

Run MPI-jobs on CPU

Checkout the repository of the example for a complete example of an MPI-job: python script, Dockerfile, and the job deployment YAML.

Clone the repository, and go to the example folder:

git clone https://github.com/kubeflow/mpi-operator.git
cd mpi-operator/examples/v2beta1/pi

Open the pi.yaml file, and make sure the apiVersion on the first line is the following:

apiVersion: kubeflow.org/v2beta1

You will also need to specify those containers can run with the root user by adding the serviceAccountName between spec: and container: for the launcher and the worker templates:

#...
  Launcher:
      replicas: 1
      template:
        spec:
          serviceAccountName: anyuid
          containers:
          - image: mpioperator/mpi-pi:openmpi

#...
    Worker:
      replicas: 2
      template:
        spec:
          serviceAccountName: anyuid
          containers:
          - image: mpioperator/mpi-pi:openmpi

Your tensorflow-mnist.yaml file should look like this:

apiVersion: kubeflow.org/v2beta1
kind: MPIJob
metadata:
  name: pi
spec:
  slotsPerWorker: 1
  runPolicy:
    cleanPodPolicy: Running
    ttlSecondsAfterFinished: 60
  sshAuthMountPath: /home/mpiuser/.ssh
  mpiReplicaSpecs:
    Launcher:
      replicas: 1
      template:
        spec:
          serviceAccountName: anyuid
          containers:
          - image: mpioperator/mpi-pi:openmpi
            name: mpi-launcher
            securityContext:
              runAsUser: 1000
            command:
            - mpirun
            args:
            - -n
            - "2"
            - /home/mpiuser/pi
            resources:
              limits:
                cpu: 1
                memory: 1Gi
    Worker:
      replicas: 2
      template:
        spec:
          serviceAccountName: anyuid
          containers:
          - image: mpioperator/mpi-pi:openmpi
            name: mpi-worker
            securityContext:
              runAsUser: 1000
            command:
            - /usr/sbin/sshd
            args:
            - -De
            - -f
            - /home/mpiuser/.sshd_config
            resources:
              limits:
                cpu: 1
                memory: 1Gi

Once this has been set, create the job in your current project on the DSRI (change with oc project my-project):

oc create -f pi.yaml

You should see the 2 workers and the main job running in your project Topology page in the DSRI web UI. You can then easily check the logs of the launcher and workers.

To run your own MPI job on the DSRI, you can take a look at, and edit, the different files provided by the MPI Operator example:

🐍 tensorflow_mnist.py: the python script with the actual job to run

🐳 Dockerfile.cpu: the Dockerfile to define the image of the containers in which your job will run (install dependencies)

⛵️ tensorflow-mnist.yaml: the YAML file to define the MPI deployment on Kubernetes (number and limits of workers, mpirun command, etc)

Visit the Kubeflow documentation to create a MPI job for more details.

Feel free to contact us on the DSRI Slack #helpdesk channel to discuss the use of MPI on the DSRI.

Run MPI-jobs on CPU​

Run MPI-jobs on CPU