0% found this document useful (0 votes)

25 views20 pages

MPI Part2 Updated

The document provides an overview of the Message Passing Interface (MPI), detailing its capabilities for both shared and distributed memory architectures, and its support for various programming languages. It explains the concept of MPI communicators, including the default communicator MPI_COMM_WORLD, and outlines important MPI calls for initiating and finalizing computations, as well as sending and receiving messages. Additionally, it discusses the differences between point-to-point and collective communications, highlighting the benefits of using collective operations for improved code readability and performance.

Uploaded by

ARKAMIDES OFFICIAL

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views20 pages

MPI Part2 Updated

Uploaded by

ARKAMIDES OFFICIAL

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Distributed Memory Programming Model: MPI Overview

MPI Overview

Message Passing Interface

MPI Can be used for Shared Memory, as well as Distributed
Memory architectures (Hybrid, if requiredi)
Supported by Fortran, C, C++ (but modules also available
for python, & Java) Hides hardware details of underlying
system (so portable)
Many high performance libraries have MPI versions of API
calls
MPI version 3.0 specification has 400+ commands (function
calls). Knowledge of only 11-12 of them can help you do the
job in more than 90% of cases.

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI Communicators

MPI Communicators
MPI_COMM_WORLD: Name of default MPI Communicator
A communication universe (communication domain,
communication group) for a group of processes
Stored in variables of type MPI_COMM
Communicators are used as arguments to all message
transfer MPI routines
Each process within communicator has a rank; a unique
integer identifier ranging between [0, #processors − 1]
Multiple communicators can be established in a single MPI
program
Intra-Communicator: Used for communication within a
single group
Inter-Communicator: Used for communication between two
disjoint groups
Parallel & Distributed Systems
Distributed Memory Programming Model: MPI Communicators

MPI Communicators (cont.)

MPI_COMM_WORLD

P2 COMM 2
COMM 1
COMM 4

P1
P0

COMM 3

P3
COMM 5

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI First Look at OpenMPI

First Look (hellompi.c)

#include <stdio.h>
#include <mpi.h>

int main(int a r gc, char **argv)

{
int s i z e , my_rank;

MPI_Init(&argc, &argv);
MPI_Comm_size(MPI_COMM_WORLD, &size);
MPI_Comm_rank(MPI_COMM_WORLD, &my_rank);

printf("Hello from %d out of %d\n", my_rank, s i z e ) ;

MPI_Finalize();

return 0 ;

}
mpicc hellompi.c # Compilation (mpiCC f o r C++, a l s o gcc h e llo mp i. c -lmpi )
mpirun -np 4 -h o s t f i l e filename a.out # Execution

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI First Look at OpenMPI

Configuring a Simple MPI based Distributed Computing Cluster

Requirements

SSH Server
apt-get inst all openssh-server
OpenMPI Library
apt-get inst a l l openmpi-bin openmpi-doc
libopenmpi-dev
NFS Network File System
apt-get inst a l l nfs-server nfs-client

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI First Look at OpenMPI

Configuring a Simple MPI based Distributed Computing Cluster (cont.)

Transfering Files

There are many ways to transfer files. You can setup an

NFS mountpoint, share files using dropbox, or send files
using scp. The scp method is given below:
scp /location/of/a.out
username@ipaddress:/home/username/a.out
Note: All cluster nodes must be able to find the
executable file at the same location as any other cluster
node

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI First Look at OpenMPI

Important MPI Calls

MPI_Init(int*, char**); / / I n i t i a t e an MPI Computation

MPI_Finalize(void); / / Terminate an MPI Computation
MPI_Comm_size(MPI_COMM, i n t ) ; / / How many processes
MPI_Comm_rank(MPI_COMM, i n t ) ; / / Who am I?
MPI_Get_processor_name(char*, i n t ) ; / / What i s the hostname?
MPI_Wtime(void); / / Elapsed time in seconds
MPI_Abort(MPI_COMM); / / Terminate a l l processes

Sending/Receiving

What may happen in code P0 (left) and P1 (right) below?

int a = 100; int a ;
send(&a, P1); receive(&a, P0); a = 0;
printf("%d\n", a);

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI Sending/Receiving Messages

Approaches to Send/Receive

Blocking (Non-Buffered) Send/Receive

Follow some form of “handshaking” protocol

Request to Send → Clear to Send → Send Data →

Acknowledgement Problem 1: Idling Overhead (both
sender/receiver side)
Problem 2: Deadlock (sending at same time)
Blocking (Buffered) Send/Receive
Copy send-data to designated buffer, and returns after “copy” operation is
completed
Problem 1: Buffer Size
for ( i = 0 ; i < 1000; i + + ) { for ( i = 0 ; i < 1000; i + + ) {
produce_data(&a); receive(&a, P0);
send(&a, P1); consume_data(&a);
} }

Problem 2: Deadlock (sending at same time)

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI Sending/Receiving Messages

Approaches to Send/Receive (cont.)

Non-Blocking Send/Receive
Return from Send/Receive operation before it is “safe” to return.
Programmer responsibility to ensure that “sending data” is not altered immediately

Blocking Operations: Safe and Easy Programming (at cost of overhead and risk of
deadlocks)
Non-Blocking Operations: Useful for Performance optimization, and breaking
deadlocks (but brings in plenty of race-conditions if programmer not careful)

Parallel & Distributed Systems

Point to Point Communication
Collective communication involves communication of data using all processes inside of a given
communicator, the default communicator that contains all available processes is called
MPI_COMM_WORLD. Whenever a collective call is made it must be called by all processes
inside of the communicatior. Collective communications will not interfere with point-to-point
communications nor will point-to-point communications interfere with collective communication.
Collective communications also do not need the use of tags. Send and receive buffers when using
collective communication calls must match in order for the call to work and there is no guarantee
that a function will be synchronizing (except for barrier). Also all collective communication
operations are blocking. These are some things to keep in mind while using collective
communication operations.
Distributed Memory Programming Model: MPI Point-to-Point Send/Receive

Point to Point Communication

MPI provides a set of send and receive functions that allow communication of typed
data with an associated message tag Typing of the message contents is necessary for
heterogeneous support. The type information is needed so that correct data
representation conversions can be performed as data is sent from one architecture to
another. The tag allows selectivity of messages at the receiving end. One can receive
on a particular tag, or one can wild-card this quantity, allowing reception of messages
with any tag. Message selectivity on the source process of the message is provided.

Types of Point-to-Point Send/Receive Calls

Synchronous Transfer: Send/Receive routines return only when the message

transfer is completed. Not only does this transfer data, but it also synchronizes
processes
MPI_Send() / / Blocking Send
MPI_Recv() / / Blocking Receive
Asynchronous Transfers: Send/Receive do not wait for transfer data and proceeds
with execution next line of instruction. (Precaution: Do not modify the
send/receive buffers)
MPI_Isend() / / Non-Blocking Send
MPI_Irecv() / / Non-Blocking Receive
Distributed Memory Programming Model: MPI Point-to-Point Send/Receive

Point to Point Communication (cont.)

Sending

int MPI_Send(void * bu ffe r , int count, MPI_DATATYPE datatype,

int destination , int tag, MPI_Comm comm);

Send the data stored in buffer

Count is the number of entries in the buffer

What is the datatype of the buffer (MPI_CHAR, MPI_INT, MPI_FLOAT,

MPI_DOUBLE, MPI_LONG_DOUBLE, MPI_LONG, MPI_SHORT,
MPI_UNSIGNED_CHAR, etc.)

Destination is the rank of process, to whom buffer is to be sent to, residing in

communication universe comm

The tag of the message (to distinguish between different types of messages)

Parallel & Distributed Systems

Distributed Memory Programming Model: MPI Point-to-Point Send/Receive

Point to Point Communication (cont.)

Receiving
int MPI_Recv(void * b u ffe r , int count, MPI_DATATYPE datatype,
int source, int tag, MPI_Comm comm,
MPI_Status * s t a t u s );

Store the received message in buffer

Count is the number of entries to be received in the buffer. If number of entries is
larger than the capacity of buffer, an overflow error MPI_ERR_TRUNCATE is
returned.
Datatype is the type of data that has been received
Source is the rank of process, residing in communication domain comm, from whom
buffer is received. Source can be hard-set, or a wild-card MPI_ANY_SOURCE.
To retrieve message of certain type, set the tag argument. If there are many
messages of same tag from same process, any one of them may be retrieved. If
message of any tag is to be retrieved, use the wild-card MPI_ANY_TAG.
Store status of received message in status (next slide). If not needed, use
MPI_STATUS_IGNORE

Parallel & Distributed Systems

Collective communications
Point-to-Point: It is programmer’s responsibility to ensure that all processes
participate correctly in a given communication (Programmer’s burden)
MPI simplifies this using Collective Communication.
Collective communications transmit data among all processes in a group
specified by an intra-communicator object. One function, the barrier
function, serves to synchronize processes without passing data. No
process returns from the barrier function until all processes in the group
have called it. A barrier is a simple way of separating two phases of
computation to ensure that the messages generated in the two phases
do not intermingle
Types are:
Synchronization:
Barriers: MPI_Barrier()
Moving Data:
Broadcasting: MPI_Bcast() Scattering: MPI_Scatter() Gathering:
MPI_Gather()
Collective Computation:
Reduction: MPI_Reduce()
Difference to point-to-point communications
No message tags
Most calls/versions support blocking communication only
MPI provides the following collective communication functions.

Barrier synchronization across all group members

Global communication functions – Data Movement Routines
- Broadcast of same data from one member to all members of a group
- Gather data from all group members to one member
- Scatter different data from one member to other members of a group
- A variation on Gather where all members of the group receive the result
- Scatter/Gather data from all members to all members of a group (also
called complete exchange or all-to-all)
Global reduction operations such as sum and product, max and min, bitwise and
logical, or user-defined functions.
-Reduction where the result is returned to all group members and a
variation where the result is returned to one member
-A combined reduction and scatter operation
Code Readability and Maintainability.

It is easier to read and maintain code with collectives.

For example if we want to send something to every process it
would require N^2 point to point communications, with a
collective it is one simple call.
Performance
MPI has designed algorithms that are optimized to do
collective communication. As mentioned above, we can also
save a lot of time having one call versus several.
The five major ways of communication that MPI implements are:
barriers: wait for others before proceeding - uses Barrier
all-to-one: all processes send data to one - uses Gather and Allgather

one-to-all: sends data to all processes from one - uses Broadcast and Scatter

all-to-all: all processes send data to all processes - uses Alltoall

combining results: get results from every process and do something with it. -
uses Reduce
Communication Domains
A communicator object specifies a communication domain
which can be used for point-to-point communications.

An intracommunicator is used for communicating within a

single group of processes. The intracommunicator has fixed
attributes, for example, that describe the process group and
the topology of the processes in the group.
Intracommunicators are also used for collective operations
within a group of processes.

An intercommunicator is used for point-to-point

communication between two disjoint groups of processes.
The fixed attributes of an intercommunicator are the two
groups. No topology is associated with an
intercommunicator.

b93c La Guia Esencial Definitiva Pokemon Pokemon Deluxe Essential Handbook Todo Lo Que Necesitas Saber Sobre Mas de 700 Pokemon The Need To Know Stats and Facts On Over 700 Pokemon Pokem
25% (8)
b93c La Guia Esencial Definitiva Pokemon Pokemon Deluxe Essential Handbook Todo Lo Que Necesitas Saber Sobre Mas de 700 Pokemon The Need To Know Stats and Facts On Over 700 Pokemon Pokem
2 pages
Data Management and Analysis Methods (Hanbook of Qualitative Research) (Denzin & Lincoln) (2000) PDF
100% (1)
Data Management and Analysis Methods (Hanbook of Qualitative Research) (Denzin & Lincoln) (2000) PDF
35 pages
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
No ratings yet
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
91 pages
Lecture 11 Distributed Memory Programming
No ratings yet
Lecture 11 Distributed Memory Programming
28 pages
Parallel Programming With Message-Passing Interface (MPI)
No ratings yet
Parallel Programming With Message-Passing Interface (MPI)
6 pages
Message Passing and MPI: John Mellor-Crummey
No ratings yet
Message Passing and MPI: John Mellor-Crummey
78 pages
MiniTool Partition Wizard Crack 12 Key Download Free 2025
No ratings yet
MiniTool Partition Wizard Crack 12 Key Download Free 2025
29 pages
message passing-1
No ratings yet
message passing-1
76 pages
Distributed Memory Programming Using
No ratings yet
Distributed Memory Programming Using
113 pages
Mpi Unit 5 Part 2 1
No ratings yet
Mpi Unit 5 Part 2 1
65 pages
Cs-3006 6 Mpi Basics 2
No ratings yet
Cs-3006 6 Mpi Basics 2
52 pages
mpi2
No ratings yet
mpi2
46 pages
Unit - 3 - My
No ratings yet
Unit - 3 - My
84 pages
Lecture 12-MPI Collective Communication
No ratings yet
Lecture 12-MPI Collective Communication
53 pages
ch5 MPI
No ratings yet
ch5 MPI
53 pages
BIg data anslysi
No ratings yet
BIg data anslysi
57 pages
Message Passing Interface: Parallel Processing Course University of Tehran
No ratings yet
Message Passing Interface: Parallel Processing Course University of Tehran
49 pages
Week 10
No ratings yet
Week 10
52 pages
Unit Iv Distributed Memory Programming With Mpi
No ratings yet
Unit Iv Distributed Memory Programming With Mpi
19 pages
CS-3006_5_MPI Basics
No ratings yet
CS-3006_5_MPI Basics
53 pages
CH 6
No ratings yet
CH 6
47 pages
Intro_MPI
No ratings yet
Intro_MPI
60 pages
Chapter 4 - Message-Passing Programming, MPI
No ratings yet
Chapter 4 - Message-Passing Programming, MPI
79 pages
CSC4005 Tutorial3
No ratings yet
CSC4005 Tutorial3
40 pages
Module 5
No ratings yet
Module 5
9 pages
02 Message Passing Interface Tutorial
No ratings yet
02 Message Passing Interface Tutorial
34 pages
Unit IV
No ratings yet
Unit IV
12 pages
Introduction to MPI Basics
No ratings yet
Introduction to MPI Basics
8 pages
PDC Lecture 17 & 18
No ratings yet
PDC Lecture 17 & 18
16 pages
05 DistributedMemoryMPI
No ratings yet
05 DistributedMemoryMPI
77 pages
Module 3 Solutions PCS Ia2 Q.banks
No ratings yet
Module 3 Solutions PCS Ia2 Q.banks
13 pages
PDC Lecture 16 MPI - Net-New
No ratings yet
PDC Lecture 16 MPI - Net-New
59 pages
2.0 Semantic Terms: Distributed Memory System
No ratings yet
2.0 Semantic Terms: Distributed Memory System
4 pages
Intro To MPI: Hpc-Support@duke - Edu
No ratings yet
Intro To MPI: Hpc-Support@duke - Edu
56 pages
60004210188_RajSingh_HPCexp2A
No ratings yet
60004210188_RajSingh_HPCexp2A
3 pages
5CS022 Lecture 2
No ratings yet
5CS022 Lecture 2
24 pages
Lecture 11 MPI Point to Point Communication
No ratings yet
Lecture 11 MPI Point to Point Communication
36 pages
HPC Lecture40
No ratings yet
HPC Lecture40
25 pages
MPI (2)
No ratings yet
MPI (2)
25 pages
l2
No ratings yet
l2
24 pages
An Introduction To MPI: Parallel Programming With The Message Passing Interface
No ratings yet
An Introduction To MPI: Parallel Programming With The Message Passing Interface
48 pages
Lec 9 DR Marwa Abbas
No ratings yet
Lec 9 DR Marwa Abbas
64 pages
Send and Receive
No ratings yet
Send and Receive
11 pages
The Message Passing Interface (MPI)
No ratings yet
The Message Passing Interface (MPI)
18 pages
04 cmsc416 Mpi
No ratings yet
04 cmsc416 Mpi
31 pages
Introduction to C MPI PM
No ratings yet
Introduction to C MPI PM
50 pages
in3200-chap09
No ratings yet
in3200-chap09
56 pages
Parallel & Distributed Computing: MPI - Message Passing Interface
No ratings yet
Parallel & Distributed Computing: MPI - Message Passing Interface
49 pages
ECE 1747H: Parallel Programming: Message Passing (MPI)
No ratings yet
ECE 1747H: Parallel Programming: Message Passing (MPI)
67 pages
Mpi
No ratings yet
Mpi
67 pages
Mpi p2
No ratings yet
Mpi p2
51 pages
CP4253 Map Unit Iv
No ratings yet
CP4253 Map Unit Iv
22 pages
Message Passing Interface (MPI) Programming
No ratings yet
Message Passing Interface (MPI) Programming
11 pages
03-MPIProgramStructure[1]
No ratings yet
03-MPIProgramStructure[1]
42 pages
2-MPI
No ratings yet
2-MPI
13 pages
Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
22 pages
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
No ratings yet
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
20 pages
Cs6801 - Multicore Architectures and Programming 2 Marks Q & A Unit Iv - Distributed Memory Programming With Mpi
No ratings yet
Cs6801 - Multicore Architectures and Programming 2 Marks Q & A Unit Iv - Distributed Memory Programming With Mpi
15 pages
1 MPI Communications: CS424. Parallel Computing Lab#4
No ratings yet
1 MPI Communications: CS424. Parallel Computing Lab#4
30 pages
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
From Everand
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
Nathan Metzler
4/5 (2)
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
From Everand
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
Miguel Miranda de Mattos
No ratings yet
Asm 10
No ratings yet
Asm 10
4 pages
Academic Planner Grade 9 (21-22)
No ratings yet
Academic Planner Grade 9 (21-22)
13 pages
Mapped Superclass: @mappedsuperclass Public Abstract Class Publication ( )
No ratings yet
Mapped Superclass: @mappedsuperclass Public Abstract Class Publication ( )
6 pages
LK 6 - Compressed
100% (1)
LK 6 - Compressed
10 pages
Red Hat Virtualization-4.4-Installing Red Hat Virtualization As A Standalone Manager With Remote Databases-En-Us
No ratings yet
Red Hat Virtualization-4.4-Installing Red Hat Virtualization As A Standalone Manager With Remote Databases-En-Us
87 pages
Chitra
No ratings yet
Chitra
11 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
2 pages
2019-Word of The Lord-Advance Boldly-Ps Ashish Raichur
No ratings yet
2019-Word of The Lord-Advance Boldly-Ps Ashish Raichur
7 pages
Asses Q4 W Ans.
No ratings yet
Asses Q4 W Ans.
4 pages
21ST REVIEWER Uwu
No ratings yet
21ST REVIEWER Uwu
4 pages
flutter - routing
No ratings yet
flutter - routing
60 pages
Symbolic-Aesthetics Jon Lang
No ratings yet
Symbolic-Aesthetics Jon Lang
16 pages
Final Project - 07
No ratings yet
Final Project - 07
69 pages
Choosing The Right Statistical Test
No ratings yet
Choosing The Right Statistical Test
6 pages
Revision Test (2) 1819: I. Decide If The Following Statements Are TRUE (T) or FALSE (F)
No ratings yet
Revision Test (2) 1819: I. Decide If The Following Statements Are TRUE (T) or FALSE (F)
3 pages
QA SB 1.2
No ratings yet
QA SB 1.2
3 pages
Target Code Generation: Utkarsh Jaiswal 11CS30038
No ratings yet
Target Code Generation: Utkarsh Jaiswal 11CS30038
15 pages
Aristotle's Deductive Method
No ratings yet
Aristotle's Deductive Method
3 pages
LG2 ExtraPractice-Answers
No ratings yet
LG2 ExtraPractice-Answers
4 pages
Abstraction Mechanism Group 2
No ratings yet
Abstraction Mechanism Group 2
5 pages
A SPOONFUL OF TIME Educators' Guide
100% (1)
A SPOONFUL OF TIME Educators' Guide
13 pages
Mystery of Dreams
No ratings yet
Mystery of Dreams
7 pages
King of Kites
No ratings yet
King of Kites
33 pages
Biomed Gpt
No ratings yet
Biomed Gpt
32 pages
Letter B
No ratings yet
Letter B
7 pages
FORGAS InstallHelp
No ratings yet
FORGAS InstallHelp
22 pages
1.2 Rational Numbers
No ratings yet
1.2 Rational Numbers
33 pages
Real-time Dynamics of False Vacuum Decay
No ratings yet
Real-time Dynamics of False Vacuum Decay
18 pages

MPI Part2 Updated

Uploaded by

MPI Part2 Updated

Uploaded by

Distributed Memory Programming Model: MPI Overview

Message Passing Interface

Parallel & Distributed Systems

MPI Communicators (cont.)

Parallel & Distributed Systems

First Look (hellompi.c)

int main(int a r gc, char **argv)

printf("Hello from %d out of %d\n", my_rank, s i z e ) ;

Parallel & Distributed Systems

Configuring a Simple MPI based Distributed Computing Cluster

Parallel & Distributed Systems

Configuring a Simple MPI based Distributed Computing Cluster (cont.)

There are many ways to transfer files. You can setup an

Parallel & Distributed Systems

Important MPI Calls

MPI_Init(int*, char**); / / I n i t i a t e an MPI Computation

What may happen in code P0 (left) and P1 (right) below?

Parallel & Distributed Systems

Blocking (Non-Buffered) Send/Receive

Request to Send → Clear to Send → Send Data →

Problem 2: Deadlock (sending at same time)

Parallel & Distributed Systems

Approaches to Send/Receive (cont.)

Parallel & Distributed Systems

Point to Point Communication

Types of Point-to-Point Send/Receive Calls

Synchronous Transfer: Send/Receive routines return only when the message

Point to Point Communication (cont.)

int MPI_Send(void * bu ffe r , int count, MPI_DATATYPE datatype,

Send the data stored in buffer

Count is the number of entries in the buffer

What is the datatype of the buffer (MPI_CHAR, MPI_INT, MPI_FLOAT,

Destination is the rank of process, to whom buffer is to be sent to, residing in

Parallel & Distributed Systems

Point to Point Communication (cont.)

Store the received message in buffer

Parallel & Distributed Systems

Barrier synchronization across all group members

It is easier to read and maintain code with collectives.

all-to-all: all processes send data to all processes - uses Alltoall

An intracommunicator is used for communicating within a

An intercommunicator is used for point-to-point

You might also like