0% found this document useful (0 votes)

3 views5 pages

hpc part b

HIGH PERFORMANCE COMPUTING

Uploaded by

B.REENA HICET STAFF CSE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

hpc part b

HIGH PERFORMANCE COMPUTING

Uploaded by

B.REENA HICET STAFF CSE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

11.

Explain in detail the evolution of super-computing from vector processors

to modern scale computing. Discuss key milestones and their impact on
computational power.

1. Vector Processing Era (1970s–1980s)

The inception of supercomputing is closely tied to vector processing. The CDC 6600,
designed by Seymour Cray in 1964, is often considered the first supercomputer, introducing
the concept of parallel functional units and achieving performance of up to 3 megaFLOPS.
Wikipedia+1National Academies Press+1

The Cray-1, introduced in 1976, was a landmark in supercomputing. It utilized vector

registers to perform operations on entire arrays of data, significantly boosting performance
for scientific computations. The Cray-1's innovative architecture allowed it to achieve speeds
of 80 megaFLOPS. National Academies Press+1Wikipedia+1

Diagram: Vector Processor Architecture

2. Transition to Massively Parallel Processing (1990s)

As computational demands grew, the limitations of vector processors became evident,

leading to the adoption of Massively Parallel Processing (MPP). MPP systems consist of
numerous processors working simultaneously on different parts of a problem. The Intel
Paragon, introduced in the early 1990s, exemplified this shift, utilizing thousands of
processors connected via a high-speed network. Wikipedia

The Cray T3E, released in 1995, further advanced MPP by integrating over 2,000 processors
with a three-dimensional torus interconnect, enhancing scalability and performance.
Wikipedia

Diagram: Massively Parallel Processing Architecture

3. Petascale Computing (2000s)

The 2000s witnessed the advent of petascale computing, breaking the barrier of 10^15
FLOPS. IBM's Blue Gene/L, operational in 2004, achieved 280 teraFLOPS using over
65,000 processors. Its successor, Blue Gene/P, reached 1 petaFLOP in 2007. Wikipedia

These systems emphasized energy efficiency and scalability, setting the stage for future
supercomputers.

Diagram: Blue Gene Architecture

4. Exascale Computing and Beyond (2010s–Present)

The pursuit of exascale computing, achieving 10^18 FLOPS, has been a significant focus in
recent years. The Frontier supercomputer, developed by Hewlett Packard Enterprise and
operational at Oak Ridge National Laboratory since 2022, became the first to surpass the
exascale threshold, achieving 1.1 exaFLOPS. Wikipedia
Frontier utilizes a combination of AMD CPUs and GPUs, interconnected through a high-
speed network, to deliver unprecedented performance for complex simulations and AI
workloads.Wikipedia

Diagram: Exascale Supercomputer Architecture

Impact on Computational Power

Each evolutionary phase in supercomputing has exponentially increased computational

capabilities:

 Vector Processing: Enabled efficient handling of large datasets in scientific

computations.
 Massively Parallel Processing: Allowed for the division of complex problems into
smaller tasks, processed simultaneously.
 Petascale Computing: Facilitated high-resolution simulations in fields like climate
modeling and genomics.
 Exascale Computing: Supports advanced AI applications, real-time data analysis,
and intricate simulations, pushing the boundaries of research and innovation.

Conclusion

The evolution from vector processors to exascale computing reflects the relentless pursuit of
higher performance and efficiency in supercomputing. Each milestone has not only enhanced
computational power but also expanded the horizons of scientific discovery and technological
advancement.

12. With a neat diagram, explain the different levels of memory hierarchy and their impact on
data locality in HPC
In High-Performance Computing (HPC), the memory hierarchy plays a crucial role in determining the
performance of applications. As processor speeds have outpaced memory speeds, memory
hierarchy helps bridge the gap through layers of memory with different speeds, sizes, and costs.

Diagram: Memory Hierarchy

←-------------- Increasing Size & Latency --------------

--------------→ Increasing Speed & Cost per Byte →

Explanation of Memory Levels

1. Registers
o Located inside the CPU.
o Fastest memory, smallest in size.
o Holds operands for immediate processing.
2. L1, L2, L3 Caches
o L1 Cache: Closest to the CPU core, smallest (~32KB), fastest cache.
o L2 Cache: Larger (~256KB to 1MB), shared or private.
o L3 Cache: Shared among cores, bigger (~4MB–64MB), slower than L1/L2.
o These are hardware-managed caches, crucial for temporal locality.
3. Main Memory (RAM)
o Larger capacity (~GBs), slower than cache.
o Accessed when data is not found in the cache (cache miss).
o Affects spatial locality through prefetching.
4. Secondary Storage (Disk)
o Includes SSDs and HDDs.
o Much larger (TBs), but very high latency.
o Data is paged into main memory when needed.
o Not ideal for frequent data access in HPC.
Impact on Data Locality

1. Temporal Locality

 If a piece of data is accessed, it’s likely to be accessed again soon.

 Caches take advantage of this by keeping recently used data close to the CPU.

2. Spatial Locality

 If data at one memory location is accessed, nearby data is likely accessed soon.
 RAM and cache prefetching help load blocks of data to exploit spatial locality.

3. Importance in HPC

 Efficient use of memory hierarchy reduces memory latency.

 HPC applications often process large data arrays, so optimizing cache usage (through
blocking, loop unrolling, etc.) is vital.
 Poor data locality leads to cache misses, long memory access times, and underutilized CPU
cycles.

Conclusion

Understanding and optimizing for the memory hierarchy is essential in HPC to enhance performance.
By maximizing data locality, applications can reduce costly memory accesses, utilize faster memory
levels, and achieve better parallel efficiency.

Jailbreaking The T2 With Checkra1n
100% (2)
Jailbreaking The T2 With Checkra1n
5 pages
ISR4331-SEC/K9 Datasheet: Quick Specs
No ratings yet
ISR4331-SEC/K9 Datasheet: Quick Specs
6 pages
ALL CSC 417 NOTE
No ratings yet
ALL CSC 417 NOTE
238 pages
Ddca 2024 Lecture24 Memory Hierarchy and Caches Beforelecture
No ratings yet
Ddca 2024 Lecture24 Memory Hierarchy and Caches Beforelecture
304 pages
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
Ch0 Overview
No ratings yet
Ch0 Overview
81 pages
Lecture 3 (Memory Hierarchy and Caches)
No ratings yet
Lecture 3 (Memory Hierarchy and Caches)
88 pages
Module 6_Memory
No ratings yet
Module 6_Memory
32 pages
5 mark q mdc
No ratings yet
5 mark q mdc
13 pages
4 Memory Models
No ratings yet
4 Memory Models
19 pages
Week_4
No ratings yet
Week_4
11 pages
Cache Memory: How Caching Works
No ratings yet
Cache Memory: How Caching Works
15 pages
week10
No ratings yet
week10
59 pages
Advanced Python
No ratings yet
Advanced Python
70 pages
Munshi Meraj Hossain_PCCCS302
No ratings yet
Munshi Meraj Hossain_PCCCS302
10 pages
Memory Hierarchy
100% (1)
Memory Hierarchy
47 pages
coaint
No ratings yet
coaint
16 pages
Lecture 10: Memory System - Memory Technology: CSE 564 Computer Architecture Summer 2017
No ratings yet
Lecture 10: Memory System - Memory Technology: CSE 564 Computer Architecture Summer 2017
44 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
Chapter 3 P1
No ratings yet
Chapter 3 P1
57 pages
12-caches-notes
No ratings yet
12-caches-notes
144 pages
Memory Hierarchy Presentation Detailed
No ratings yet
Memory Hierarchy Presentation Detailed
24 pages
Intro Parallel Computing PDF
No ratings yet
Intro Parallel Computing PDF
58 pages
Systems I: Locality and Caching
No ratings yet
Systems I: Locality and Caching
18 pages
Memory Design
No ratings yet
Memory Design
36 pages
CPU and Memory Systems
No ratings yet
CPU and Memory Systems
21 pages
Memory Subsytems
No ratings yet
Memory Subsytems
19 pages
CS 61C: Great Ideas in Computer Architecture: Lecture 12 - Memory Hierarchy/Direct-Mapped Caches
No ratings yet
CS 61C: Great Ideas in Computer Architecture: Lecture 12 - Memory Hierarchy/Direct-Mapped Caches
27 pages
coa unit4
No ratings yet
coa unit4
14 pages
Elements Assignment
No ratings yet
Elements Assignment
8 pages
Computer Architecture Unit 2
No ratings yet
Computer Architecture Unit 2
32 pages
04 Cache Memory Comparc
No ratings yet
04 Cache Memory Comparc
47 pages
GROUP 2. COMPUTER MEMORY
No ratings yet
GROUP 2. COMPUTER MEMORY
12 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
43 pages
Lec13 Memory 1 Notes
No ratings yet
Lec13 Memory 1 Notes
27 pages
Ca-Module Ii Notes
No ratings yet
Ca-Module Ii Notes
75 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
21 pages
Memory_Hierarchy_Report
No ratings yet
Memory_Hierarchy_Report
3 pages
help2
No ratings yet
help2
102 pages
Hierarchy of Memory in Computer Organization and Architecture
No ratings yet
Hierarchy of Memory in Computer Organization and Architecture
8 pages
Lecture 16
No ratings yet
Lecture 16
22 pages
03-Chap4-Cache Memory Mapping
No ratings yet
03-Chap4-Cache Memory Mapping
24 pages
SHREYA
No ratings yet
SHREYA
11 pages
Chache Memory, Internal Memory and External Memory
No ratings yet
Chache Memory, Internal Memory and External Memory
113 pages
Lecture 9
No ratings yet
Lecture 9
72 pages
Computer Organization & Architecture: Cache Memory
No ratings yet
Computer Organization & Architecture: Cache Memory
52 pages
Memory Interface & Controller Lecture 3
No ratings yet
Memory Interface & Controller Lecture 3
77 pages
Memory Hirecracy
No ratings yet
Memory Hirecracy
3 pages
Unit 4 MMemory Hierarchy
No ratings yet
Unit 4 MMemory Hierarchy
14 pages
Memory Hierarchy
No ratings yet
Memory Hierarchy
7 pages
Lec2 PDF
No ratings yet
Lec2 PDF
21 pages
Week 12 - Lecture 12 - Memory
No ratings yet
Week 12 - Lecture 12 - Memory
27 pages
LANGAT
No ratings yet
LANGAT
7 pages
Archi 18-19
No ratings yet
Archi 18-19
10 pages
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
No ratings yet
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
32 pages
EL3011---13-Memory-Hierarchy
No ratings yet
EL3011---13-Memory-Hierarchy
56 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Computer Architecture Important Question
No ratings yet
Computer Architecture Important Question
12 pages
Cache Memory
No ratings yet
Cache Memory
89 pages
Memory Hierarchy
No ratings yet
Memory Hierarchy
28 pages
ACA-Lecture 14
No ratings yet
ACA-Lecture 14
4 pages
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
From Everand
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
Rob Botwright
No ratings yet
NT TEST
No ratings yet
NT TEST
11 pages
IDA Pro Function Analysis and Graphing Part4
No ratings yet
IDA Pro Function Analysis and Graphing Part4
10 pages
NOTES Unit - II
No ratings yet
NOTES Unit - II
36 pages
CMSC 449 - Lec04 - Packed Malware
No ratings yet
CMSC 449 - Lec04 - Packed Malware
10 pages
secure hash algorithm
No ratings yet
secure hash algorithm
4 pages
Cryptography notes UNIT 3
No ratings yet
Cryptography notes UNIT 3
21 pages
Parallelism in Arm Processor
No ratings yet
Parallelism in Arm Processor
9 pages
rohm-s-a0008372410-1
No ratings yet
rohm-s-a0008372410-1
1 page
ASUSTeK Computer Inc. - Motherboards - ASUS P5P800 SE
No ratings yet
ASUSTeK Computer Inc. - Motherboards - ASUS P5P800 SE
2 pages
What I Need To Know?: 1.2-1 and Find Out How Much You Can
No ratings yet
What I Need To Know?: 1.2-1 and Find Out How Much You Can
4 pages
Microprocessors & Interfacing Lab Manual
100% (1)
Microprocessors & Interfacing Lab Manual
30 pages
20080701 056 IC設計產業 (OK)
100% (2)
20080701 056 IC設計產業 (OK)
57 pages
Asitrade 2
No ratings yet
Asitrade 2
53 pages
(ENG) SW Update Guide - SJ4+SJ5+SJ7 - 171213
No ratings yet
(ENG) SW Update Guide - SJ4+SJ5+SJ7 - 171213
3 pages
How To Turn Off or Disable Hardware Acceleration in Windows 11 - 10
No ratings yet
How To Turn Off or Disable Hardware Acceleration in Windows 11 - 10
10 pages
Elektor 1988 04
No ratings yet
Elektor 1988 04
64 pages
Compiler Construction - Lecture 03
No ratings yet
Compiler Construction - Lecture 03
21 pages
DS1044 PDF
No ratings yet
DS1044 PDF
97 pages
Envy x360 15 PDF
100% (1)
Envy x360 15 PDF
81 pages
Basic Computer
No ratings yet
Basic Computer
80 pages
795 Comp Science AL
No ratings yet
795 Comp Science AL
19 pages
4 Instruction Set Architectures
No ratings yet
4 Instruction Set Architectures
20 pages
Proteus VSM For PIC18: System Level Simulation For Microchip Technologies™ PIC18 Variants
No ratings yet
Proteus VSM For PIC18: System Level Simulation For Microchip Technologies™ PIC18 Variants
5 pages
120c1a Python Notes
No ratings yet
120c1a Python Notes
171 pages
r520 Spec Sheet
No ratings yet
r520 Spec Sheet
2 pages
Firefly User Manual 26 April 2021
No ratings yet
Firefly User Manual 26 April 2021
1 page
System Requirements For Office 2013 Professional: Internet Explorer Google Chrome
No ratings yet
System Requirements For Office 2013 Professional: Internet Explorer Google Chrome
1 page
Tutorial 08 Questions
No ratings yet
Tutorial 08 Questions
3 pages
MX67&MX68 Installation Guide
No ratings yet
MX67&MX68 Installation Guide
13 pages
Static Random-Access Memory
No ratings yet
Static Random-Access Memory
6 pages
M8 V800R022C00SPC600 Release Notes
No ratings yet
M8 V800R022C00SPC600 Release Notes
36 pages
Introduction To Parallel Processing
No ratings yet
Introduction To Parallel Processing
23 pages
Exam Mode Casio FX Cg50
No ratings yet
Exam Mode Casio FX Cg50
4 pages
Unit Iii Motorola 68HC11 Architecture
No ratings yet
Unit Iii Motorola 68HC11 Architecture
25 pages

hpc part b

Uploaded by

hpc part b

Uploaded by

11.

Explain in detail the evolution of super-computing from vector processors

1. Vector Processing Era (1970s–1980s)

The Cray-1, introduced in 1976, was a landmark in supercomputing. It utilized vector

Diagram: Vector Processor Architecture

2. Transition to Massively Parallel Processing (1990s)

As computational demands grew, the limitations of vector processors became evident,

Diagram: Massively Parallel Processing Architecture

Diagram: Blue Gene Architecture

4. Exascale Computing and Beyond (2010s–Present)

Diagram: Exascale Supercomputer Architecture

Impact on Computational Power

Each evolutionary phase in supercomputing has exponentially increased computational

 Vector Processing: Enabled efficient handling of large datasets in scientific

Diagram: Memory Hierarchy

←-------------- Increasing Size & Latency --------------

Explanation of Memory Levels

 If a piece of data is accessed, it’s likely to be accessed again soon.

 Efficient use of memory hierarchy reduces memory latency.

You might also like