Parallel Computing#
Course Description#
This course will provide an introduction to parallel programming. Participants will gain practical experience in writing parallel software, understanding how to decompose problems for efficient execution across multiple processes and threads.
Course Objectives#
On completion of this series of workshops, participants will:
- Be able to explain what is meant by distributed and shared-memory parallelism. 
- Know how to write software that can run across multiple processes using MPI. 
- Be able to write code that utilizes multithreading for parallel execution. 
- Be able to identify how a problem can be divided and parallelised effectively. 
- Gain hands-on experience writing and optimizing parallel code. 
Pre-requisite Knowledge#
This course is for participants who already have some programming experience with Python. If you are not familiar with Python, our Introduction to Python course, is available here.
The interactive network visualisation below displays the prerequisite structure for this course within the training program. Each node represents a course that you may need to complete beforehand, and the arrows show the recommended order in which to take them, leading up to your selected course. You can click on any course node to view more information about that course. This interactive tool helps you clearly see the learning path required to access this course, making it easier to plan your progress with the Coding for Reproducible Research Training (CfRR) initiative.
Pre-Reqs Subnetwork
Sign-up#
To check for upcoming course dates and to register, please visit the Workshop Schedule and Sign-up page available here.
import pandas as pd
from datetime import datetime
from IPython.display import display, HTML
# Define the course that is being looked at
course_name = "Parallel Computing"
# Load the CSV file
file_path = '../data/workshop_info.csv'  # Adjust the path to your file location
courses_df = pd.read_csv(file_path)
# Strip any extra spaces in the column names
courses_df.columns = courses_df.columns.str.strip()
# Convert date columns to datetime
courses_df['Start Date'] = pd.to_datetime(courses_df['Start Date'], dayfirst=True, errors='coerce')
courses_df['End Date'] = pd.to_datetime(courses_df['End Date'], dayfirst=True, errors='coerce')
# Get today's date
today = datetime.now()
# Function to generate markdown text based on the course dates
def generate_html(row):
    if pd.notna(row['Start Date']) and pd.notna(row['End Date']):
        if row['Start Date'] <= today <= row['End Date']:
            return f"<div style='font-weight: bold;'>This course is currently accepting applications.</div>"
    return ""
# Apply the function and create a new column for Markdown
courses_df['HTML'] = courses_df.apply(generate_html, axis=1)
# Variable for course name
# Filter the DataFrame for the given course name and display the HTML text
html_output = courses_df[courses_df['Course Name'] == course_name]['HTML'].values[0]
display(HTML(html_output))
Installation Instructions#
Installing Python#
If you do now have a working Python installation on your machine (you can check this with which python), you can follow the installation instructions from the CFRR Intro to Python course here.
Installing MPI#
The Message Passing Interface (MPI) is a standard for passing messages between multiple networked processes running a parallel program. As MPI is a standard, rather than a piece of software, there is not a single software package that you need to install. There are a few different ‘flavours’ of MPI, and the majority of HPC systems will have their own versions which are tuned for their specific systems. It is highly recommended that you use a system’s built-in MPI libraries if they are available, but if not let’s go through the process of installing an MPI library.
Note
Requirements
For this workshop, you will need a multi-core machine which can run a Unix-based terminal (i.e. Linux/WSL or Mac).
MacOS MPI installation#
MPI can be easily installed with Homebrew. Check your machine has homebrew installed with
$ which brew
If this returns the location of the brew executable, then you can proceed with:
$ brew install open-mpi
Linux/WSL MPI installation#
There are two ways to ways to install MPI on Linux or WSL platforms is using the Spack package manager (this can also be done for MacOS but requires some additional steps), or using conda. Each of these two methods will be explained below.
Using Spack package manager#
In our case, we can install OpenMPI, which is a free and open source MPI implementation. OpenMPI can be installed in a number of different ways, but the recommended way is to use the Spack HPC package manager, which is in a class of its own in the way it handles different MPI implementations.
Spack is really simple to install, all you need to need to do is clone the Spack repository:
$ git clone --depth=100 --branch=releases/v0.21 https://github.com/spack/spack.git ~/spack
and source the included setup script:
$ source ~/spack/share/spack/setup-env.sh
Every time you want to use Spack you will need to source this script, so it may be easier to add this to your shell login script, (i.e. ~/.bashrc, ~/.zshrc, etc.).
We need to let Spack find any compilers in our system, which we can do with:
$ spack compiler find
Note
Next you have to install MPI with spack. This method requires you to have a working set of compilers for C and Fortran. If you don’t have these on your system the simplest way to get them is to install them using your system package manager.
We can use Spack to install an MPI library, which will default to installing OpenMPI. If we run
$ spack spec mpi
we can see what Spack will install, and we can use
$ spack install mpi
to execute the installation. Once this is done, we can load the new mpi module with
$ spack load mpi
and check the installation with
$ which mpirun
This command tells us where the mpirun command has been installed, which is the primary way that we can launch an executable across multiple process with MPI. With the installation complete we are ready to run some programs with MPI.
Using Conda package manager#
First download the Miniforge installer from https://conda-forge.org/miniforge/ . Then run the installer using bash within your WSL terminal:
$ bash Miniforge3-xx.x.x-x-<operating-system>-x86_64.sh
Next create a conda environment and then activate the conda base environment.
$ conda activate
Now create a new environment, for example, named mpi_env
$ conda activate
$ conda create -n mpi_env python=3.9  # Replace 3.9 with your desired Python version
Finally install OpenMPI:
$ conda install -c conda-forge openmpi openmpi-mpicc openmpi-mpicxx openmpi-mpifort compilers
Installing MPI for Python (MPI4Py)#
Once you have installed MPI, either via spack or conda, the next step is install a Python interface to this library. There are many different interfaces to MPI for many different languages, but we’ve chosen Python for the benefits it provides to write examples in an easy-to-understand format. Whilst the specific syntax of the commands learned in this part of the course wont be applicable across different languages, the overall code structures and concepts are highly transferable, so once you have a solid grasp of the fundamentals of MPI you should be able to take those concepts to any language with an MPI interface and write parallel code!
The python package that we will be using in this course to implement MPI command is the MPI4Py package. To install the package, make sure you have loaded your spack or conda environment with the MPI library install.
The MPI4Py package can be installed via pip as follows:
pip install mpi4py
Self Study Material Link#
The self-study material for this course is available here.
Acknowledgements#
This course was adapted from the Software Carpentries Programming with Python. It has been developed by the University of Exeter Research Software Engineering Group and a team of generous volunteers who are enthusiastic about sharing their skills with the wider research community.
Its provision is dependent on the time of these volunteers. If you have benefited in any way from this course and want to support its long term sustainability then please take the time to complete our feedback survey, recommend it to your colleagues, and enthuse about it to your senior leadership team!
Developers#
The contributors to this course include:
- Ed Hone 
- Ricky Olivier 
Course Delivery Content#
There is currently no additional content that is used outside of the self-study notes to deliver this course.
License Info#
Instructional Material
The instructional material in this course is copyright © 2024 University of Exeter and is made available under the Creative Commons Attribution 4.0 International licence (https://creativecommons.org/licenses/by/4.0/). Instructional material consists of material that is contained within the “individual_modules/parallel_computing” directory, and images folders in this directory, with the exception of code snippets and example programs found in files within these folders. Such code snippets and example programs are considered software for the purposes of this licence.
Software
Except where otherwise noted, software provided in this repository is made available under the MIT licence (https://opensource.org/licenses/MIT).
Copyright © 2024 University of Exeter
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
The software in this repository is adapted from software that is covered by the following copyright and permission notice:
Copyright © 2024 Software Carpentry
Permission is hereby granted, free of charge, to any person obtaining
a copy of this software and associated documentation files (the
"Software"), to deal in the Software without restriction, including
without limitation the rights to use, copy, modify, merge, publish,
distribute, sublicense, and/or sell copies of the Software, and to
permit persons to whom the Software is furnished to do so, subject to
the following conditions:
The above copyright notice and this permission notice shall be
included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE
LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
