0% found this document useful (0 votes)

2 views

HPC Lecture 2 Points

Uploaded by

omargamalelziky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

HPC Lecture 2 Points

Uploaded by

omargamalelziky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

High Performance Computing (HPC) - Lecture 2

Agenda

- Parallel Computer Memory Architectures

- Multithreading vs. Multiprocessing
- Designing Parallel Programs
- HPC Cluster Architecture

Parallel Computer Memory Architectures

- Shared Memory
- All processors access a single global address space.
- Fast data sharing.

-Lack of scalability between memory and CPUs‫ز‬

- Advantages:
- Global address space provides a user-friendly programming perspective.
- Data sharing between tasks is fast and uniform due to proximity of memory to CPUs.
- Disadvantages:
- Lack of scalability between memory and CPUs.
- Requires programmer responsibility for synchronization.
- Expensive to design and produce shared memory machines with many processors.

- Distributed Memory
- Each processor has its own memory.
- Scalable; no overhead for cache coherency.
- Advantages:
- Memory is scalable with the number of processors.
- Each processor accesses its own memory without interference or cache coherency
issues.
- Cost-effective, using off-the-shelf processors and networking.
- Disadvantages:
- Programmer responsible for communication between processors.
Multithreading vs. Multiprocessing

- Threads
- Share the same memory space and global variables.
- Processes
- Separate program with its own variables, stack, and memory allocation.

Designing Parallel Programs

1. Understand the Problem and the Program

- Confirm the problem can be parallelized.
- Analyze any existing serial code for parallel suitability.

- Examples of non-parallelizable problems:

- Sequential Dependency Problems (e.g., Fibonacci series).

- Input/Output Bound Tasks (e.g., file compression).

- Dynamic Programming Problems with dependencies (e.g., Knapsack Problem).

- Embarrassingly Parallel Computations

- Computation can be divided into independent parts.

- Minimal or no communication needed between processes.
- Identify program hotspots : Know where most of the real work is being done. The
majority of scientific and technical programs usually accomplish most of their work in a few
places (functions).
- Focus on parallelizing sections with high CPU usage.
- Use profilers and performance analysis tools.

- Identify bottlenecks

Other considerations : Identify inhibitors to parallelism. One common class of inhibitor is

data dependence, as demonstrated by the Fibonacci sequence above. 

Investigate other algorithms if possible. This may be the single most important
consideration when designing a parallel application.

2. Partitioning
- Break the problem into chunks for distribution across tasks (decomposition).
- Types of Partitioning:
- Domain Decomposition: Split data for each parallel task.
- Functional Decomposition: Split based on the computation needed.

3. Communication and Data Dependencies

- No Communication Needed
- Tasks execute with minimal data sharing.
- Communication Required
- Tasks need to share data (e.g., 3-D heat diffusion problems).

- Factors to Consider in Communication:

- Cost of Communication
- Communication uses machine cycles/resources, requiring synchronization.
- Bandwidth saturation due to competing communication traffic.
- Communication Metrics
- Latency: Time to send a minimal message.
- Bandwidth: Data transmitted per unit time.
- Communication Types
- Synchronous Communication: Requires handshaking; blocking.
- Asynchronous Communication: Non-blocking, allows simultaneous tasks.
* The main advantage of asynchronous communication is the ability to interleave
computation with communication, maximizing efficiency.

- Scope of Communication
- Point-to-Point: Two tasks communicate (producer and consumer).
- Collective: Multiple tasks share data in groups.

* Data Dependencies
- Dependencies affect program order and inhibit parallelism.
- Handling Dependencies:
- Distributed Memory: Communicate data at synchronization points.
- Shared Memory: Synchronize read/write operations.

4. Mapping (Load Balancing)

- Load Balancing: Distribute work to keep all tasks busy.
- Achieving Load Balance:
- Equally partition work among tasks.
- Use dynamic work assignment.

HPC Platforms

- Vertical Scaling (Scale-Up)

- Integration: Tightly integrated components (CPUs, memory, storage).
- Scalability: Add powerful components within a single system.
- Efficiency: Optimized for specific tasks with reduced communication overhead.

- Horizontal Scaling (Scale-Out)

- Distributed Architecture: Independent nodes connected via a network.
- Scalability: Add nodes for growth but limited by network bandwidth and latency.
- Flexibility: Adaptable to different workloads, complex management.

Measuring Computer Performance

- FLOPS (Floating-point Operations per Second)

- Metric for computational performance.
- Supercomputers measured in PFLOPS (PetaFLOPS).
- FLOPS Calculation: Nodes × cores per node × cycles/second × FLOPs per cycle.

HPC Benchmarking

- LINPACK Benchmarks
- Measure floating-point computing power.
- Approximates real problem-solving performance.
- HPL (High-Performance Linpack)
- Portable Linpack implementation in C.
- Provides data for the TOP500 list.
- Metrics:
- Rmax: Achieved LINPACK performance.
- Rpeak: Theoretical peak performance.

Top 500 Supercomputers

Ranked by performance metrics like Rmax, Rpeak, and power usage.

HPC Cluster Architecture

- Cluster Components:
- Nodes: Individual computers in the cluster.
- Cores (Threads): Processing units within each node’s CPU.
- Shared Disk: Storage accessible by all nodes.

Parallel Programming for Modern High Performance Computing Systems (Czarnul, Pawel)
No ratings yet
Parallel Programming for Modern High Performance Computing Systems (Czarnul, Pawel)
330 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
SAP Security Tutorial
No ratings yet
SAP Security Tutorial
9 pages
Distributed Computing Full Assignment
No ratings yet
Distributed Computing Full Assignment
4 pages
U1&U2 PADCOM-25 (2)
No ratings yet
U1&U2 PADCOM-25 (2)
95 pages
unit1 2 and 3
No ratings yet
unit1 2 and 3
76 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
CS621-CHEATSHEET.docx
No ratings yet
CS621-CHEATSHEET.docx
11 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
Parallel Programming
No ratings yet
Parallel Programming
42 pages
BDS Session 2
No ratings yet
BDS Session 2
56 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
Khaitan PSERC Webinar HPC Mar 2013 Slides
No ratings yet
Khaitan PSERC Webinar HPC Mar 2013 Slides
52 pages
PDC-3
No ratings yet
PDC-3
26 pages
Parallel Computing
No ratings yet
Parallel Computing
91 pages
Parallel Computing
100% (1)
Parallel Computing
12 pages
High Performance Computing (HPC) - Lec2
No ratings yet
High Performance Computing (HPC) - Lec2
53 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
Parallel Algorithms Presentation (1)
No ratings yet
Parallel Algorithms Presentation (1)
32 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
HPC Note
No ratings yet
HPC Note
39 pages
Parallel and Distributed Computing Complete Notes
No ratings yet
Parallel and Distributed Computing Complete Notes
41 pages
Co-1 (2)
No ratings yet
Co-1 (2)
66 pages
CAQA5e ch1
No ratings yet
CAQA5e ch1
42 pages
L1.3a HPC Concepts
No ratings yet
L1.3a HPC Concepts
43 pages
Algorithms and Parallel Computing: Dr. Fayez Gebali, P.Eng
No ratings yet
Algorithms and Parallel Computing: Dr. Fayez Gebali, P.Eng
17 pages
pdcco1
No ratings yet
pdcco1
8 pages
Untitled document (2)
No ratings yet
Untitled document (2)
39 pages
Untitled document (3)
No ratings yet
Untitled document (3)
63 pages
chapter 1
No ratings yet
chapter 1
25 pages
HPC Ut 2
No ratings yet
HPC Ut 2
4 pages
Parallel Computing
No ratings yet
Parallel Computing
24 pages
Parallel Performance Analysis and Tuning
No ratings yet
Parallel Performance Analysis and Tuning
8 pages
Perfbook-Eb 2023 06 11a
No ratings yet
Perfbook-Eb 2023 06 11a
1,432 pages
Is Parallel Programming Hard, And, If So, What Can You Do
No ratings yet
Is Parallel Programming Hard, And, If So, What Can You Do
475 pages
CC QUESTION AND ANSWERS
No ratings yet
CC QUESTION AND ANSWERS
14 pages
Module 1
No ratings yet
Module 1
14 pages
Paralle Processing in Brief
No ratings yet
Paralle Processing in Brief
31 pages
BCSE412L - Parallel Computing 01
No ratings yet
BCSE412L - Parallel Computing 01
27 pages
Lecture 3
No ratings yet
Lecture 3
24 pages
Parallel Programming- Unit 1
No ratings yet
Parallel Programming- Unit 1
81 pages
Multiprocessors - Parallel Processing Overview: "The Real World Is Inherently Concurrent Yet Our Computational
No ratings yet
Multiprocessors - Parallel Processing Overview: "The Real World Is Inherently Concurrent Yet Our Computational
78 pages
Syllabus
No ratings yet
Syllabus
2 pages
Lecture 01
No ratings yet
Lecture 01
34 pages
Distributed-computing-architecture
No ratings yet
Distributed-computing-architecture
4 pages
The Parallel Book
No ratings yet
The Parallel Book
646 pages
24-25 - Parallel Processing PDF
No ratings yet
24-25 - Parallel Processing PDF
36 pages
Doc2 2
No ratings yet
Doc2 2
4 pages
Document
No ratings yet
Document
35 pages
BDS Session 2
No ratings yet
BDS Session 2
58 pages
Perfbook 1c E2 rc11
No ratings yet
Perfbook 1c E2 rc11
881 pages
Project - ParallelComputing BSR v2
No ratings yet
Project - ParallelComputing BSR v2
40 pages
HPC Module 4
No ratings yet
HPC Module 4
18 pages
Group3_Parallel_Computing_Techniques_presentation power point 2025 (2)
No ratings yet
Group3_Parallel_Computing_Techniques_presentation power point 2025 (2)
27 pages
Perfbook 2023 06 11a
No ratings yet
Perfbook 2023 06 11a
662 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Design of Parallel Algorithm'S: Faculty Guide: Group Members
No ratings yet
Design of Parallel Algorithm'S: Faculty Guide: Group Members
49 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
Is Parallel Programming Hard, And, If So, What Can You Do About It V2021.12.22a
No ratings yet
Is Parallel Programming Hard, And, If So, What Can You Do About It V2021.12.22a
630 pages
Parallel Programming
No ratings yet
Parallel Programming
692 pages
2021-Development and Characterization of Biodegradable Starch-Based Fibre by Wet Extrusion
No ratings yet
2021-Development and Characterization of Biodegradable Starch-Based Fibre by Wet Extrusion
13 pages
(Jon Hild) Ans1 The Land of The Insect Men (23 06 2019)
100% (1)
(Jon Hild) Ans1 The Land of The Insect Men (23 06 2019)
31 pages
Master Madalena Arez Torres
No ratings yet
Master Madalena Arez Torres
71 pages
School - Eleltrical Line Diagram (Kamali)
No ratings yet
School - Eleltrical Line Diagram (Kamali)
1 page
CP5639 Week 4 PRACTICE On ITERATIONS
No ratings yet
CP5639 Week 4 PRACTICE On ITERATIONS
11 pages
PhiloA.B - SEAN EDDWARD E. CUNANAN
No ratings yet
PhiloA.B - SEAN EDDWARD E. CUNANAN
1 page
Islam in Indonesia
No ratings yet
Islam in Indonesia
17 pages
ICEpower 200AS1 Datasheet 1 3
100% (1)
ICEpower 200AS1 Datasheet 1 3
32 pages
Thesis
No ratings yet
Thesis
36 pages
Research Article Prevalence and Associated Factors of Overweight and Obesity Among Primary School Children Aged 7 - 17 Years in Urban Mbarara, Uganda
No ratings yet
Research Article Prevalence and Associated Factors of Overweight and Obesity Among Primary School Children Aged 7 - 17 Years in Urban Mbarara, Uganda
12 pages
GC 2024 10 13
No ratings yet
GC 2024 10 13
26 pages
10 Contoh Coding Bahasa Pemrograman Java
No ratings yet
10 Contoh Coding Bahasa Pemrograman Java
15 pages
8-Economic Aspect of Irrigation
No ratings yet
8-Economic Aspect of Irrigation
35 pages
Saimo Instruction Manual
No ratings yet
Saimo Instruction Manual
101 pages
The Economics of Bitcoin
100% (3)
The Economics of Bitcoin
54 pages
Morris Library First Floor: Periodicals
No ratings yet
Morris Library First Floor: Periodicals
4 pages
Career Objective Guide
No ratings yet
Career Objective Guide
4 pages
District One - Electrical Installation Scope of Work
No ratings yet
District One - Electrical Installation Scope of Work
9 pages
Functional Verification
No ratings yet
Functional Verification
54 pages
Tormach MicroARC 4th Axis
No ratings yet
Tormach MicroARC 4th Axis
8 pages
Sec Sec For Vpns W Ipsec 15 MT Book PDF
No ratings yet
Sec Sec For Vpns W Ipsec 15 MT Book PDF
168 pages
Introduction To SQL Class 1
No ratings yet
Introduction To SQL Class 1
21 pages
It Identify The Components of Your Cloud Security Architecture Executive Brief
No ratings yet
It Identify The Components of Your Cloud Security Architecture Executive Brief
21 pages
Chemistry The Central Science 12th Edition instant download
100% (1)
Chemistry The Central Science 12th Edition instant download
31 pages
JCM - ID - AM013 - Ver01.0 - Solar Power Plant Methodology
No ratings yet
JCM - ID - AM013 - Ver01.0 - Solar Power Plant Methodology
7 pages
Unit Ii Joining Processes: Welding
No ratings yet
Unit Ii Joining Processes: Welding
33 pages
ECONOMICS-grade 11 2015
No ratings yet
ECONOMICS-grade 11 2015
24 pages
Belouga BLG 66
No ratings yet
Belouga BLG 66
5 pages
Jimin - Lesson68P (J)
No ratings yet
Jimin - Lesson68P (J)
9 pages

HPC Lecture 2 Points

Uploaded by

HPC Lecture 2 Points

Uploaded by

High Performance Computing (HPC) - Lecture 2

- Parallel Computer Memory Architectures

Parallel Computer Memory Architectures

-Lack of scalability between memory and CPUs‫ز‬

Designing Parallel Programs

1. Understand the Problem and the Program

- Examples of non-parallelizable problems:

- Input/Output Bound Tasks (e.g., file compression).

- Dynamic Programming Problems with dependencies (e.g., Knapsack Problem).

- Computation can be divided into independent parts.

Other considerations : Identify inhibitors to parallelism. One common class of inhibitor is

3. Communication and Data Dependencies

- Factors to Consider in Communication:

4. Mapping (Load Balancing)

- Vertical Scaling (Scale-Up)

- Horizontal Scaling (Scale-Out)

Measuring Computer Performance

- FLOPS (Floating-point Operations per Second)

Top 500 Supercomputers

HPC Cluster Architecture

You might also like