0% found this document useful (0 votes)

20 views8 pages

Parallel Solutions for Poisson's Equation

(1) The document discusses solving partial differential equations using the finite difference method. It describes partitioning a region into a grid and using central difference approximations to derive the finite difference equations. (2) It then explains how to solve these equations in parallel using multiple processors. Each processor is assigned a portion of the grid and computes the solution for those points. Values at boundary points must be communicated between processors. (3) An example application to Poisson's equation is given. Testing on networks of workstations and PC clusters shows that total computation time decreases as more processors are used, but efficiency declines slowly with increasing numbers of processors.

Uploaded by

Ioan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views8 pages

Parallel Solutions for Poisson's Equation

Uploaded by

Ioan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

Finite Difference Method

We consider the elliptic partial differential equation, knows as Poisson’s equation,

where
To solve Poisson’s equation by difference method, the region is partitioned into a

grid consisting of n x m rectangles with sides h and k. The mesh point are given by
, , ,
By using central difference approximations for the special derivatives, the finite

difference equation is

Often it is desirable to set h=k, the equation becomes

(i+1,j)

(i,j-1) (i,j) (i,j+1)

(i-1,j)

We can also solve this equation by parallel computing. Suppose first that the parallel

system consists of a mesh connected array of p processors arranged in a two-

dimensional lattice. Suppose that . Then it is natural to assign m unknowns to

each processor. Each processor will proceed to compute iterates for the unknowns it

holds. On a local memory system, at the end of each iteration the new iterates at

certain grid points will need to be transmitted to adjacent processors.

28
We denote by ” internal boundary values “ those values of that are needed by other
processors at the next iteration and must be transmitted.

xxxxxxxxxxxxxxxxxxxxxxx
x x
x x
x x
x x
x x.
xxxxxxxxxxxxxxxxxxxxxxxx

For example, for the situation the computation in processor would be

compute ; send to
compute , send to

From the above, we know that there are different types of connection between the

processors. We choose the type of connection depend on the structure of the matrix.

Example:

29
A star A ring A cube

Example 9

We have poisson equation

where
The region is partitioned into a grid consisting of 240x240 rectangles.

Result:

No of processor 1 4 9 16
Time used in each processor calculating x 17.64 12.17 5.57 3.27
Total time used in this function 21.08 13.4 7.42 4.51

(1) In the Scientific Computing Lab, the result shows that the total time used in this

method is decreasing if more processors are used. E.g. When 9 computers are used,

the total time used is 7.42s. It accelerates about triple times comparing with that when

single processor is used. The time needed to calculate x becomes shortened

5.24+0.23=5.57s. It is because each computer just needs to calculate 1/9 part of x. The

time used in ‘Send’ and ‘Receive’ is very small and the time mainly used in

calculation. So we find that this method can apply parallel algorithm.

The speedup of parallel algorithm is

=execution time for a single processor / execution time using p processors

30
p 1 4 9 16
Speed up, 1 1.5731 2.841 4.6741

Efficiency, 1 0.3933 0.3157 0.2921

We find that the efficiency of the algorithm is decreasing slowly. Since the number of

processor is limited in our network of work-stations, we need to use PC cluster to see

the development of curve when more processors are used.

The upper line shows the ideal case where Sp=1.

(2) In TDG Cluster, the result is similar. The number of processor is increased and we
can use up to 100 CPUs.

No of processor, p 1 4 9 16 25 36 64 100
Total time used in this function 39.68 16.17 7.8 4.25 2.86 2.28 1.92 1.85

31
Speed up, 1 2.4539 5.0872 9.3365 13.874 17.4035 20.0667 21.4486

Efficiency, 1 0.6135 0.5652 0.5835 0.555 0.4834 0.3135 0.2145

From the table, we know that the efficiency is decreasing gradually. It means that the

development of Sp is increased slowly.

We notice that the curve of speed up becomes a horizontal line at the end. It is

because when we use more than 70 processors, the time used in calculation is also

around 2 seconds and the time used in message passing is no changed, so the total

time is almost the same.

Conclusion
In conclusion, we know that the jacobi method can apply parallel algorithm. We can

connect several processors to calculate the x at the same time, it will decrease the time

32
used. But the Gauss-Seidel method need the latest information as new as possible, so

the processors can not run the program at the same time and it can not apply parallel

algorithm.

For jacobi method, we need to notice the time used in message passing. It is because

the time need to broadcast or gather the x is very long. The solution is we can apply

the property of the matrix’s structure. Ex. For the sparse matrix, we just need to send

the necessary data to the adjacent processor, it will decrease the time used in

transferring data. In addition, we just use this parallel algorithm if the program need

extensive time to solve in one processor, i.e. the matrix’s size is very large or it need

many iterates to convergent, then the efficiency will be more evident.

The main factors that cause degradation from perfect speedup are:

(i) Lack of a perfect degree of parallelism in the algorithm and/or lack of perfect

load balance;

(ii) Communication, contention, and synchronization time

By load balancing we mean the assignment of tasks to the processors of the system so

as to keep each processor doing useful work as much as possible.

We expect to need a parallel machine only for those problems too large to run in a

feasible time on a single processor.

Bibliography

[1] K. A. Gallivan, Michael T. Heath, Esmond Ng, James M. Ortega, Barry W.

Peyton, R. J. Plemmons, Charles H. Romine, A. H. Sameh, Robert G. Voigt,

33
Parallel Algorithms for Matrix Computations, Society for Industrial and
Applied Mathematics, 1990

[2] U. Schendel, Introduction to Numerical Methods for Parallel Computers, Ellis

Horwood Limited, 1984

[3] Marc Snir, MPI--the complete reference, Cambridge, Mass, 1998

[4] C. T. Kelley, Iterative methods for linear and nonlinear equations, Society for
Industrial and Applied Mathematics, 1995

[5] Anne Greenbaum, Iterative Method for Solving Linear Systems, Society of
Industrial and Applied Mathematics, 1997

[6] Wolfgang Hackbusch, Iterative Solution of Large Sparse Systems of

Equations, Springer-Verlag, 1994

Appendix

Total time used in different method

34
Example 1 Jacobi method (matrix size:6x6)
Gauss-Seidel method (matrix size:6x6)

Example 2: Jacobi method (matrix size:6x6)

Comparing the cases when 1,3 clusters are used.

Example 3: Jacobi method (Sparse matrix:57600x57600)

Comparing the cases when 1,2,4,8 clusters are used.

Example 4: Jacobi method (Sparse matrix:921600x921600)

Comparing the cases when 1,2,4,8 clusters are used.

Example 5: Gauss-Seidel method (matrix size:6x6)

Comparing the cases when 1,3 clusters are used.

Example 6: Gauss-Seidel method (Sparse matrix:57600x57600)

Comparing the cases when 1,5 clusters are used.

Example 7: Conjugate gradient method (Sparse matrix) (diagonal=5)

Comparing the cases when 1,2,4 clusters are used.

Example 8: Conjugate gradient method (Sparse matrix) (diagonal=4)

Comparing the cases when 1,2,4,8 clusters are used.

Example 9: Finite different method (Poisson’s matrix)

Comparing the cases when 1,4,9,16,25,36,64,100 clusters are used.

Profile Report
(1) Network of work-stations
(2) PC Cluster

Content PDF
No ratings yet
Content PDF
14 pages
Parallel-Port-Example-Computer-Science-2004-7-7-The-Point-Jacobi-Iteration - PRG Örnekleri
No ratings yet
Parallel-Port-Example-Computer-Science-2004-7-7-The-Point-Jacobi-Iteration - PRG Örnekleri
60 pages
Mid Sem2
No ratings yet
Mid Sem2
2 pages
2022 Mid 1
No ratings yet
2022 Mid 1
4 pages
Thesis 1997 Abdullah
No ratings yet
Thesis 1997 Abdullah
259 pages
Parallel Computing
No ratings yet
Parallel Computing
30 pages
Numerical Linear Algebra
No ratings yet
Numerical Linear Algebra
45 pages
Numerical Methods For Partial Differential Algebraic Systems of Equations
No ratings yet
Numerical Methods For Partial Differential Algebraic Systems of Equations
61 pages
Lecture 4: Principles of Parallel Algorithm Design (Part 4)
No ratings yet
Lecture 4: Principles of Parallel Algorithm Design (Part 4)
27 pages
BL - En.u4ece22033 - M Bhanu Charan
No ratings yet
BL - En.u4ece22033 - M Bhanu Charan
24 pages
Compre 1
No ratings yet
Compre 1
2 pages
Ee8218 Lab2
No ratings yet
Ee8218 Lab2
7 pages
Mpi Course
No ratings yet
Mpi Course
202 pages
Domain Decomposition for Parallel Computing
No ratings yet
Domain Decomposition for Parallel Computing
17 pages
Parallel Computing in CFD: Milovan Perić
No ratings yet
Parallel Computing in CFD: Milovan Perić
25 pages
Parallel Computing Challenges & Trends
No ratings yet
Parallel Computing Challenges & Trends
81 pages
Parallel Numerical Methods Overview
No ratings yet
Parallel Numerical Methods Overview
46 pages
Quiz For Chapter 7 With Solutions
No ratings yet
Quiz For Chapter 7 With Solutions
8 pages
Notes About Numerical Methods With Matlab
No ratings yet
Notes About Numerical Methods With Matlab
50 pages
Lecture 1: Sci. Comp. For Dphil Stduents
No ratings yet
Lecture 1: Sci. Comp. For Dphil Stduents
5 pages
Matrix Multiplication Optimization
No ratings yet
Matrix Multiplication Optimization
32 pages
SET-02 - SOCS - ESE-DEC23 - B.Tech (CSE-H+NH) - All Spec. - 3 - CSEG2021 - Design and Analysis of Algorithm
No ratings yet
SET-02 - SOCS - ESE-DEC23 - B.Tech (CSE-H+NH) - All Spec. - 3 - CSEG2021 - Design and Analysis of Algorithm
4 pages
Exercise 9
No ratings yet
Exercise 9
5 pages
18-Assignment 1 - Solution
No ratings yet
18-Assignment 1 - Solution
12 pages
Using The Gaussian Elimination Method For Large Banded Matrix Equations
No ratings yet
Using The Gaussian Elimination Method For Large Banded Matrix Equations
75 pages
CS-218 Data Structures Final Exam 2020
100% (2)
CS-218 Data Structures Final Exam 2020
7 pages
Artikel Internasional
No ratings yet
Artikel Internasional
7 pages
Discrete-Time Signals and Systems: H. C. So Semester B, 2011-2012
No ratings yet
Discrete-Time Signals and Systems: H. C. So Semester B, 2011-2012
50 pages
Unit-3-Floyd Warshal Algorithm
No ratings yet
Unit-3-Floyd Warshal Algorithm
22 pages
++probleme Tot
No ratings yet
++probleme Tot
22 pages
Outline of Next 2 Lectures: Matrix Computations: Direct Methods I
No ratings yet
Outline of Next 2 Lectures: Matrix Computations: Direct Methods I
16 pages
CS Fundamentals Exam 2022-2023 Marking Schema
No ratings yet
CS Fundamentals Exam 2022-2023 Marking Schema
9 pages
Calculus and Vectors First Edition Chris Kirkpatrick PDF Available
No ratings yet
Calculus and Vectors First Edition Chris Kirkpatrick PDF Available
139 pages
Data Structures Unit - 1 1. Algorithm
No ratings yet
Data Structures Unit - 1 1. Algorithm
64 pages
Unit 3
No ratings yet
Unit 3
10 pages
Alg2025sp hw3
No ratings yet
Alg2025sp hw3
4 pages
Introduction To Matlab, Signal Processing & Speech Signal Processing
No ratings yet
Introduction To Matlab, Signal Processing & Speech Signal Processing
49 pages
Dis Top Tim Notes 1
No ratings yet
Dis Top Tim Notes 1
3 pages
Pram
No ratings yet
Pram
23 pages
Linear Algebra for STEM Students
No ratings yet
Linear Algebra for STEM Students
71 pages
Old Spec D1
No ratings yet
Old Spec D1
158 pages
DSP Midterm Exam for Undergrads
No ratings yet
DSP Midterm Exam for Undergrads
5 pages
Matrix Computation For Engineers and Scientist by Jennings
100% (1)
Matrix Computation For Engineers and Scientist by Jennings
348 pages
SEE 312 Assignment 3 Guidelines
No ratings yet
SEE 312 Assignment 3 Guidelines
3 pages
C++ Software for Solving Linear Equations
No ratings yet
C++ Software for Solving Linear Equations
60 pages
Solution of Linear Algebraic Equations
No ratings yet
Solution of Linear Algebraic Equations
6 pages
Performance Analysis of Different Iterative Solvers Parallelized On Gpu Architecture
No ratings yet
Performance Analysis of Different Iterative Solvers Parallelized On Gpu Architecture
8 pages
DSP Lab 1 Fall 20.PDF NEW
No ratings yet
DSP Lab 1 Fall 20.PDF NEW
12 pages
Parallel Computing Project Topics
No ratings yet
Parallel Computing Project Topics
2 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
Assignment: Application of Graphs in Computer Programming
No ratings yet
Assignment: Application of Graphs in Computer Programming
11 pages
Iterative Methods for Sparse Systems
No ratings yet
Iterative Methods for Sparse Systems
24 pages
Assignment No. 2 PDC 21L-1786
No ratings yet
Assignment No. 2 PDC 21L-1786
6 pages
Lecture-1 New
No ratings yet
Lecture-1 New
71 pages
Scilab Basics: Matrices, Functions, Plots
No ratings yet
Scilab Basics: Matrices, Functions, Plots
32 pages
Mathematical Modeling for Experts
100% (1)
Mathematical Modeling for Experts
32 pages
CS500 Sheet2 Answer 2021
No ratings yet
CS500 Sheet2 Answer 2021
5 pages
COEP B.Tech Provisional Grade Report
No ratings yet
COEP B.Tech Provisional Grade Report
1 page
WWW - Manaresults.Co - In: - (5+5) 3.A) Obtain The Parallel Realization of The System Described by The Difference Equation
No ratings yet
WWW - Manaresults.Co - In: - (5+5) 3.A) Obtain The Parallel Realization of The System Described by The Difference Equation
2 pages
Tutorial 2 - Sol
No ratings yet
Tutorial 2 - Sol
2 pages
Jacobian and Newton's Method
No ratings yet
Jacobian and Newton's Method
5 pages
Introduction To Data Structures
No ratings yet
Introduction To Data Structures
39 pages
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
No ratings yet
Course Title: Fundamentals of Deep Learning Lab: BTECH Programme: AI&DS
81 pages
DSP 4
No ratings yet
DSP 4
4 pages
Lecture Slide 10
No ratings yet
Lecture Slide 10
48 pages
BCS Theory
No ratings yet
BCS Theory
14 pages
MPC for Autonomous Ground Vehicles
No ratings yet
MPC for Autonomous Ground Vehicles
12 pages
Understanding The RSA Algorithm
No ratings yet
Understanding The RSA Algorithm
7 pages
ADC Guide for Firebird-V Robot
No ratings yet
ADC Guide for Firebird-V Robot
116 pages
Linear Programming: Principles & Applications
No ratings yet
Linear Programming: Principles & Applications
3 pages
Module 4.1 Introduction To Signal Space
No ratings yet
Module 4.1 Introduction To Signal Space
52 pages
Newton's Divided Difference Interpolation
No ratings yet
Newton's Divided Difference Interpolation
23 pages
GR 11 L7 Linear Programming
No ratings yet
GR 11 L7 Linear Programming
18 pages
Self-Supervised Learning For Tool Wear Monitoring
No ratings yet
Self-Supervised Learning For Tool Wear Monitoring
30 pages
AI 3rd Assignment
No ratings yet
AI 3rd Assignment
7 pages
Eigenvalues in Classical Dynamics
No ratings yet
Eigenvalues in Classical Dynamics
27 pages
An Acoustic Approach To Drone Indentification Using Machine Learning
No ratings yet
An Acoustic Approach To Drone Indentification Using Machine Learning
129 pages
CIT756 July 2017
No ratings yet
CIT756 July 2017
2 pages
What Is AI Project Cycle
No ratings yet
What Is AI Project Cycle
6 pages
L22
No ratings yet
L22
11 pages
Frequency Synchronization Based Algorithmic Trading Using MATLAB
No ratings yet
Frequency Synchronization Based Algorithmic Trading Using MATLAB
10 pages
CS20A - Assignment 5
No ratings yet
CS20A - Assignment 5
4 pages
Igcse Computer Science - 15 Marker Answering Guide (Concise, Exam Ready)
No ratings yet
Igcse Computer Science - 15 Marker Answering Guide (Concise, Exam Ready)
5 pages
What Is Unit Root in A Time Series - Quora
No ratings yet
What Is Unit Root in A Time Series - Quora
2 pages
Semas
No ratings yet
Semas
116 pages
QM2 Assignment1
No ratings yet
QM2 Assignment1
2 pages

Parallel Solutions for Poisson's Equation

Uploaded by

Parallel Solutions for Poisson's Equation

Uploaded by

Finite Difference Method

We consider the elliptic partial differential equation, knows as Poisson’s equation,

Often it is desirable to set h=k, the equation becomes

(i,j-1) (i,j) (i,j+1)

system consists of a mesh connected array of p processors arranged in a two-

dimensional lattice. Suppose that . Then it is natural to assign m unknowns to

certain grid points will need to be transmitted to adjacent processors.

For example, for the situation the computation in processor would be

We have poisson equation

single processor is used. The time needed to calculate x becomes shortened

calculation. So we find that this method can apply parallel algorithm.

The speedup of parallel algorithm is

Efficiency, 1 0.3933 0.3157 0.2921

processor is limited in our network of work-stations, we need to use PC cluster to see

the development of curve when more processors are used.

The upper line shows the ideal case where Sp=1.

Efficiency, 1 0.6135 0.5652 0.5835 0.555 0.4834 0.3135 0.2145

development of Sp is increased slowly.

time is almost the same.

many iterates to convergent, then the efficiency will be more evident.

(ii) Communication, contention, and synchronization time

as to keep each processor doing useful work as much as possible.

feasible time on a single processor.

[1] K. A. Gallivan, Michael T. Heath, Esmond Ng, James M. Ortega, Barry W.

[2] U. Schendel, Introduction to Numerical Methods for Parallel Computers, Ellis

[3] Marc Snir, MPI--the complete reference, Cambridge, Mass, 1998

[6] Wolfgang Hackbusch, Iterative Solution of Large Sparse Systems of

Total time used in different method

Example 2: Jacobi method (matrix size:6x6)

Example 3: Jacobi method (Sparse matrix:57600x57600)

Example 4: Jacobi method (Sparse matrix:921600x921600)

Example 5: Gauss-Seidel method (matrix size:6x6)

Example 6: Gauss-Seidel method (Sparse matrix:57600x57600)

Example 7: Conjugate gradient method (Sparse matrix) (diagonal=5)

Example 8: Conjugate gradient method (Sparse matrix) (diagonal=4)

Example 9: Finite different method (Poisson’s matrix)

You might also like