Message Passing Interface¶

How do we realize practically this parallelism?

Let us focus on what we have discussed until now:

We have ``machines’’ with multiple processors and whose main memory is partitioned into fragmented components,
We have algorithms that can divide a problem of size \(N\) among these processors so that they can run (almost) independently,
With a certain degree of approximation, we know how to compute what is the best improvement we can expect from a parallel program with \(M\) processors on a problem of size \(N\).

What we need to discuss now is then: “How can we actually implement these algorithms on real machines?”

We need a way to define a parallel environment in which every processor is accounted for,
We need to have data formats that are aware of the fact that we have a distributed memory,
We need to exchange data between the various memory fragments.

“MPI (Message Passing Interface) is a specification for a standard library for message passing that was defined by the MPI Forum, a broadly based group of parallel computer vendors, library writers, and applications specialists.” – W. Gropp, E. Lusk, N. Doss, A. Skjellum,’’ – A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing, 22 (6), 1996.

MPI implementations consist of a specific set of routines directly callable from C, C++, Fortran, Python;
MPI uses Language Independent Specifications for calls and language bindings;
The MPI interface provides an essential virtual topology, synchronization, and communication functionality inside a set of processes.
There exist many implementations of the MPI specification, e.g., MPICH, Open MPI, etc.

Our First MPI Program¶

In all the course we are going to use the MPI inside Python programs.

Let us start from the classical helloworld program:

%%file ccode/helloworld.c
#include"mpi.h"
#include<stdio.h>

int main(int argc,char **argv){
 MPI_Init( &argc, &argv);
 printf("Hello, world!\n");
 MPI_Finalize();
 return 0;
}

Overwriting ccode/helloworld.c

We can compile it by doing

mpicc helloworld.c -o helloworld

mpicc is a wrapper for a C compiler provided by the Open MPI implementation of MPI.
the option -o sets the name of the compiled (executable) file.

Let us see what is happening behind the curtains

you can first try to discover what compiler are you using by executing

mpicc --version

that will give you something like

gcc (Ubuntu 7.4.0-1ubuntu1~18.04.1) 7.4.0
Copyright (C) 2017 Free Software Foundation, Inc.

or discover what are the library inclusion and linking options by asking for mpicc --showme:compile and mpicc --showme:link, respectively.
In general, looking at the output of the man mpicc command is always a good idea.

``If you find yourself saying, “But I don’t want to use wrapper compilers!”, please humor us and try them. See if they work for you. Be sure to let us know if they do not work for you. ‘’ - https://www.open-mpi.org/faq/?category=mpi-apps

Note

A piece of advice: if your program is anything more realistic than a classroom exercise use make1, and save yourself from writing painfully long compiling commands, and dealing with complex dependencies more than once.

“Make gets its knowledge of how to build your program from a file called the makefile, which lists each of the non-source files and how to compute it from other files.”

A very simple Makefile for our first test would be

MPICC = mpicc #The wrapper for the compiler
CFLAGS += -g  #Useful for debug symbols
all: helloworld
helloworld: helloworld.c
  $(MPICC) $(CFLAGS) $(LDFLAGS) $? $(LDLIBS) -o $@
clean:
  rm -f helloworld

Let us run our first parallel program by doing:

mpirun [ -np X ] [ --hostfile <filename> ]  python helloworld.py

or by using its synonym

mpiexec [ -np X ] [ --hostfile <filename> ] python helloworld.py

mpiexec will run X copies of helloworld in your current run-time environment, scheduling (by default) in a round-robin fashion by CPU slot.
if running under a supported resource manager, Open MPI’s mpirun will usually automatically use the corresponding resource manager process starter, as opposed to, for example, rsh or ssh, which require the use of a hostfile, or will default to running all X copies on the localhost
as always, look at the manual, by doing man mpirun.

!(cd ccode && make helloworld)
!mpiexec -np 4 ./ccode/helloworld

make[1]: ingresso nella directory "/home/cirdan/Documenti/RTDa-PISA/CorsoCalcoloParallelo2021/introtoparallelcomputing/intrompi/ccode"
mpicc			 -march=nocona -mtune=haswell -ftree-vectorize -fPIC -fstack-protector-strong -fno-plt -O2 -ffunction-sections -pipe -isystem /home/cirdan/anaconda3/envs/parallel/include -g			 -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,-rpath,/home/cirdan/anaconda3/envs/parallel/lib -Wl,-rpath-link,/home/cirdan/anaconda3/envs/parallel/lib -L/home/cirdan/anaconda3/envs/parallel/lib helloworld.c -lm -ldl -o helloworld

make[1]: uscita dalla directory "/home/cirdan/Documenti/RTDa-PISA/CorsoCalcoloParallelo2021/introtoparallelcomputing/intrompi/ccode"

Hello, world!
Hello, world!
Hello, world!
Hello, world!

Every process executes the line that it is a local routine!

A procedure is local if completion of the procedure depends only on the local executing process.

A procedure is non-local if completion of the operation may require the execution of some MPI procedure on another process. Such an operation may require communication occurring with another user process.

The MPI parallel environment¶

The MPI parallel environment Let us modify our helloworld to investigate the MPI parallel environment. Specifically, we want to answer, from within the program, to the questions:

How many processes are there?
Who am I?

%%file ccode/hamlet.c
#include "mpi.h"
#include <stdio.h>
int main( int argc, char **argv ){
 int rank, size;
 MPI_Init( &argc, &argv );
 MPI_Comm_rank( MPI_COMM_WORLD, &rank );
 MPI_Comm_size( MPI_COMM_WORLD, &size );
 printf( "Hello world! I'm process %d of %d\n",rank, size );
 MPI_Finalize();
 return 0;
}

Overwriting ccode/hamlet.c

How many is answered by a call to MPI_Comm_size as an int value,
Who am I? Is answered by a call to MPI_Comm_rank as an int value that is conventionally called rank and is a number between 0 and size-1.

The MPI parallel environment The last keyword we need to describe is the MPI_COMM_WORLD, this is the standard Communicator object.

Communicator: A Communicator object connects a group of processes in one MPI session. There can be more than one communicator in an MPI session, each of them gives each contained process an independent identifier and arranges its contained processes in an ordered topology.

This provides

a safe communication space, that guarantees that the code can communicate as they need to, without conflicting with communication extraneous to the present code, e.g., if other parallel libraries are in use,
a unified object for conveniently denoting communication context, the group of communicating processes and to house abstract process naming.

The MPI parallel environment If we have saved our inquiring MPI program in the file hamlet.c, we can then modify our Makefile by modifying/adding the lines

all: helloworld hamlet
hamlet: hamlet.c
 $(MPICC) $(CFLAGS) $(LDFLAGS) $? $(LDLIBS) -o $@
clean:
 rm -f helloworld hamlet

Then, we compile everything by doing make hamlet (or, simply, make).

!(cd ccode && make hamlet)
!mpiexec -np 6 ./ccode/hamlet

make[1]: ingresso nella directory "/home/cirdan/Documenti/RTDa-PISA/CorsoCalcoloParallelo2021/introtoparallelcomputing/intrompi/ccode"
mpicc			 -march=nocona -mtune=haswell -ftree-vectorize -fPIC -fstack-protector-strong -fno-plt -O2 -ffunction-sections -pipe -isystem /home/cirdan/anaconda3/envs/parallel/include -g			 -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,-rpath,/home/cirdan/anaconda3/envs/parallel/lib -Wl,-rpath-link,/home/cirdan/anaconda3/envs/parallel/lib -L/home/cirdan/anaconda3/envs/parallel/lib hamlet.c -lm -ldl -o hamlet

make[1]: uscita dalla directory "/home/cirdan/Documenti/RTDa-PISA/CorsoCalcoloParallelo2021/introtoparallelcomputing/intrompi/ccode"

Hello world! I'm process 1 of 6
Hello world! I'm process 4 of 6
Hello world! I'm process 0 of 6
Hello world! I'm process 2 of 6
Hello world! I'm process 3 of 6
Hello world! I'm process 5 of 6

We can rewrite the same code in Python as

%%file hamlet.py
"""
Hello (parallel) world!
"""
from mpi4py import MPI

comm = MPI.COMM_WORLD 
rank = comm.Get_rank() 
size = comm.Get_size() 

print("Hello world! I'm process ",rank," of ",size)

Overwriting hamlet.py

What have we done here:

The instruction

from mpi4py import MPI

provides basic MPI definitions and types, if this was a C code, this would have been a preprocessor directive of the form #include "mpi.h"

start MPI by creating a communicator

comm = MPI.COMM_WORLD

For the Python code

How many is answered by a call to comm.Get_size() as an int value,
Who am I? Is answered by a call to comm.Get_rank() as an int value that is conventionally called rank and is a number between 0 and size-1.

!mpiexec -n 4 python hamlet.py

Hello world! I'm process  0  of  4
Hello world! I'm process  1  of  4
Hello world! I'm process  2  of  4
Hello world! I'm process  3  of  4

Every processor answers the call,
But it answers it as soon as he has done doing the computation! There is no synchronization.

MPI Data Types	C Type
`MPI_CHAR`	`signed char`
`MPI_SHORT`	`signed short int`
`MPI_INT`	`signed int`
`MPI_LONG`	`signed long int`
`MPI_FLOAT`	`float`
`MPI_DOUBLE`	`double`
`MPI_LONG_DOUBLE`	`long double`
`MPI_UNSIGNED_CHAR`	`unsigned char`
`MPI_UNSIGNED_SHORT`	`unsigned short int`
`MPI_UNSIGNED`	`unsigned int`
`MPI_UNSIGNED_LONG`	`unsigned long int`

A Short Introduction to Parallel Computing

Message Passing Interface¶

Our First MPI Program¶

The MPI parallel environment¶

Point-to-point communication¶

The blocking send and receive¶

A simple send/receive example¶

A simple send/receive example : programmer smash!¶

Deadlock¶

Deadlock Issues¶

Nonblocking communications¶

Sendreceive¶