0% found this document useful (0 votes)

13 views

chapter 2

The document discusses advancements in microprocessor speed and performance, highlighting techniques such as pipelining, superscalar execution, branch prediction, and speculative execution. It emphasizes the importance of balancing performance across different components, particularly between memory and processors, and outlines approaches to improve efficiency. Additionally, it introduces new performance improvement strategies like multicore processors and Graphics Processing Units (GPUs).

Uploaded by

ahmed.waasel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

chapter 2

Uploaded by

ahmed.waasel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Performance

Microprocessor Speed
The development of computers continues. Due to the
application of Moore's Law, chip makers can release a new
generation of chips every three years - with four times the
number of transistors. This leads to an increase in speed.

Techniques built into contemporary processors to

increase performance include

Superscalar Branch Speculative Data flow

Pipelining
execution prediction execution analysis
Microprocessor Speed

Pipelining
• Pipelining is the process of sending multiple data
packets serially without waiting for the previous
acknowledgment.
• This technique is beneficial when the amount of data
to be transferred is very large, and we send the data by
dividing them into various parts.
• It facilitates parallelism in execution at the hardware
level.
• “Common” instructions (arithmetic, load/store,
conditional branch) can be executed independently.
• Pipelining does not reduce the execution time of
individual instructions but reduces the overall
execution time required for a program.
Microprocessor Speed

Pipelining

The functionalities of pipelining in

the computer networks:
•High Performance
•Efficient use of resources
•Time Efficiency
•Fast Data Delivery
•Reduces the process waiting-time
Microprocessor Speed

Superscalar execution

•The ability to issue multiple

independent instructions in
parallel in every processor clock
cycle.
•Multiple parallel pipelines are
used.
Microprocessor Speed

Branch prediction
• The processor looks ahead in the instruction code
fetched from memory and predicts which branches, or
groups of instructions, are likely to be processed next.
• The purpose of the branch predictor is to improve the
flow in the instruction pipeline.
• The prediction is executed and the results are kept
temporarily, and if it is later detected that the guess
was wrong, the speculatively executed or partially
executed instructions are discarded .The pipeline
starts over with the correct branch, causing a delay.
Microprocessor Speed

Speculative execution
Using branch prediction and data flow
analysis, some processors speculatively
execute instructions before their actual
appearance in the program execution, holding
the results in temporary locations, and keeping
execution engines as busy as possible.

Data flow analysis

The processor analyzes which instructions are
dependent on each other’s results, or data, to
create an optimized schedule of instructions.
Performance

Performance Balance
One difficulty in designing
an efficient system is that It is necessary to
different components adjust the
operate at different speeds. organization and
➢ For example, DRAM is architecture to
generally much slower than
the processor compensate for this
mismatch.

This is why CPU The overall balance

computer in the system is
benchmarks are more important
used to compare than the raw
system performance of any
one component.
performance.
Performance

Performance Balance
To overcome the imbalance between memory and processor
speeds there are several approaches

Increase the number of bits that Change the DRAM interface to

are retrieved at one time by make it more efficient by
making DRAMs “wider” rather including a cache or other
than “deeper” and by using buffering scheme on the DRAM
wide bus data paths – 8, 16, 32, chip.
and 64-bit systems. Increase the interconnect
Reduce the frequency of bandwidth between processors
memory access by and memory by using higher-
incorporating increasingly speed buses and a hierarchy of
complex and efficient cache buses to buffer and structure
structures between the data flow.
processor and main
memory(memory hierarchy).
Performance

Improvements in Chip Organization and

Architecture
• Increase hardware speed of processor
• Fundamentally due to shrinking logic gate size
• More gates, packed more tightly, increasing
clock rate
• Propagation time for signals reduced
• Increase size and speed of caches
• Dedicating part of processor chip
• Cache access times drop significantly
• Change processor organization and architecture
• Increase effective speed of instruction execution
• Parallelism
Problems with Clock Speed and Login
Density
•Power
•RC delay
•Memory latency
New approach to improving performance

•Multicore: multiple processors on the

same chip, with a large shared cache.
•Many Integrated Core (MIC)
•Graphics Processing Unit (GPU)
Many Integrated Core (MIC)
Graphics Processing Unit (GPU)

MIC GPU
• A large number of cores per • A chip with multiple general-
chip. purpose processors plus graphics
• Leap in performance as well processing units (GPUs) and
as the challenges in specialized cores for video
developing software to processing and other tasks.
exploit such a large number • Traditionally found on a plug-in
of cores. graphics card, it is used to
• The multicore and MIC encode and render 2D and 3D
strategy involves a graphics as well as a process
homogeneous collection of video.
general purpose processors • Used as vector processors for a
on a single chip. variety of applications that
require repetitive computations.
Basic Measures of Computer Performance
• Performance is one of the key parameters to consider,
along with cost, size, security, reliability, and, in some
cases, power consumption.
• Traditional measures of processor speed:
➢Clock Speed:
oThe speed of a processor is dictated by the pulse frequency
produced by a system clock.
oClock speed is measured in cycles per second (Hertz)
➢Instruction Execution Rate:
oThe processor will have many different instructions it can
perform and each will take a fixed number of cycles.

Python Django Developer Resume: Career Goal
100% (1)
Python Django Developer Resume: Career Goal
2 pages
William Stallings Computer Organization and Architecture 10 Edition
No ratings yet
William Stallings Computer Organization and Architecture 10 Edition
33 pages
Audiomoth Dev Datasheet: Open Acoustic Devices
100% (1)
Audiomoth Dev Datasheet: Open Acoustic Devices
7 pages
Migration Strategy: Google Suite To Office365 v0
No ratings yet
Migration Strategy: Google Suite To Office365 v0
8 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
التحليل
No ratings yet
التحليل
32 pages
L5-L6-Performance Issues
No ratings yet
L5-L6-Performance Issues
47 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
4 - Performance Issues
No ratings yet
4 - Performance Issues
48 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
CH02 COA10e
No ratings yet
CH02 COA10e
67 pages
Chapter 2
No ratings yet
Chapter 2
15 pages
2.Week
No ratings yet
2.Week
35 pages
CH02 COA10e.performance Issues
No ratings yet
CH02 COA10e.performance Issues
19 pages
Chapter Two
No ratings yet
Chapter Two
33 pages
2. ünite
No ratings yet
2. ünite
33 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
Performance Issues
No ratings yet
Performance Issues
19 pages
Chapter 1 Solution
No ratings yet
Chapter 1 Solution
35 pages
LEC 2
No ratings yet
LEC 2
31 pages
Chapter 2 V
No ratings yet
Chapter 2 V
24 pages
Introduction To High Performance Computing: Unit-I
No ratings yet
Introduction To High Performance Computing: Unit-I
70 pages
IAS & MIPS Rate
No ratings yet
IAS & MIPS Rate
42 pages
FIT9134_week11
No ratings yet
FIT9134_week11
21 pages
Week2 - 1
No ratings yet
Week2 - 1
64 pages
Mod6 2 PDF
No ratings yet
Mod6 2 PDF
15 pages
LEC 2
No ratings yet
LEC 2
31 pages
Multicore Processor Report
100% (1)
Multicore Processor Report
19 pages
Chapter 11
No ratings yet
Chapter 11
33 pages
Performance of Computers: Factors Affecting Computer Performance
No ratings yet
Performance of Computers: Factors Affecting Computer Performance
4 pages
CA01_2024S2
No ratings yet
CA01_2024S2
30 pages
Assgniment 3rd Year 2nd Semester
No ratings yet
Assgniment 3rd Year 2nd Semester
5 pages
HPC -1
No ratings yet
HPC -1
40 pages
Ca02 2014 PDF
No ratings yet
Ca02 2014 PDF
79 pages
SP23 CS 212 Week 2
No ratings yet
SP23 CS 212 Week 2
23 pages
Unit 1 Modern Processors
No ratings yet
Unit 1 Modern Processors
52 pages
CMP2008 L1
No ratings yet
CMP2008 L1
47 pages
Seminar Report
50% (4)
Seminar Report
30 pages
Modle 01 - HPC Introduction To Pipeline
No ratings yet
Modle 01 - HPC Introduction To Pipeline
124 pages
Unit I-Basic Structure of A Computer: System
No ratings yet
Unit I-Basic Structure of A Computer: System
64 pages
Parallelism and Multicores
No ratings yet
Parallelism and Multicores
54 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
L1.0 HPC Overview
No ratings yet
L1.0 HPC Overview
58 pages
Mod 7
No ratings yet
Mod 7
56 pages
Intro
No ratings yet
Intro
14 pages
1.1 Processor Micro Architecture
No ratings yet
1.1 Processor Micro Architecture
21 pages
Hyper-Threading Technology: Processor Microarchitecture
No ratings yet
Hyper-Threading Technology: Processor Microarchitecture
18 pages
Parallel Computing Platforms-Dr Nausheen
No ratings yet
Parallel Computing Platforms-Dr Nausheen
47 pages
CSC232 - Chp1 (Compatibility Mode)
No ratings yet
CSC232 - Chp1 (Compatibility Mode)
50 pages
CCS 1202 Lecture 2_Computer Evolution and Performance
No ratings yet
CCS 1202 Lecture 2_Computer Evolution and Performance
32 pages
Aula Ch1
No ratings yet
Aula Ch1
40 pages
Multicore Processor
100% (1)
Multicore Processor
23 pages
COMP-unit-1
No ratings yet
COMP-unit-1
52 pages
Hyper-Threading Technology: Shaik Mastanvali (06951A0541)
No ratings yet
Hyper-Threading Technology: Shaik Mastanvali (06951A0541)
23 pages
20BCE2351 Micro Assignment-02
No ratings yet
20BCE2351 Micro Assignment-02
5 pages
Mastering System Programming with C: Files, Processes, and IPC
From Everand
Mastering System Programming with C: Files, Processes, and IPC
Larry Jones
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Fundamentals of Modern Computer Architecture: From Logic Gates to Parallel Processing
From Everand
Fundamentals of Modern Computer Architecture: From Logic Gates to Parallel Processing
Sam Steed
No ratings yet
Embedded Systems Programming with C: Writing Code for Microcontrollers
From Everand
Embedded Systems Programming with C: Writing Code for Microcontrollers
Larry Jones
No ratings yet
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
OpenACC Programming Essentials: Definitive Reference for Developers and Engineers
From Everand
OpenACC Programming Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Priority&Round Robin Algorithm
No ratings yet
Priority&Round Robin Algorithm
7 pages
نماذج اختبار نصفي نظم تشغيل عملي للأستاذة أبرار الإدريسي Cs24
No ratings yet
نماذج اختبار نصفي نظم تشغيل عملي للأستاذة أبرار الإدريسي Cs24
4 pages
قالب الاسئلة نموذج انجليزي - نظم التشغيل Principles of Operating System-2025
No ratings yet
قالب الاسئلة نموذج انجليزي - نظم التشغيل Principles of Operating System-2025
40 pages
CS-2
No ratings yet
CS-2
2 pages
Lecture 1 - 2024
No ratings yet
Lecture 1 - 2024
15 pages
Adgrants
No ratings yet
Adgrants
13 pages
How To Get To Windows 10's Advanced Startup Options Menu: Method 1: Hit F11
No ratings yet
How To Get To Windows 10's Advanced Startup Options Menu: Method 1: Hit F11
3 pages
mp3 Thesister
100% (3)
mp3 Thesister
6 pages
Ug586 7series MIS
No ratings yet
Ug586 7series MIS
164 pages
Microsoft Certified Data Analyst Associate Skills Measured
No ratings yet
Microsoft Certified Data Analyst Associate Skills Measured
4 pages
Procedure Text: English Grade XI
No ratings yet
Procedure Text: English Grade XI
14 pages
Hardware Software Co-Design: BITS Pilani
No ratings yet
Hardware Software Co-Design: BITS Pilani
26 pages
Affairscloud Computer Part 1 of 10
No ratings yet
Affairscloud Computer Part 1 of 10
21 pages
PM Chapter-4
No ratings yet
PM Chapter-4
4 pages
Post Processing Framework
No ratings yet
Post Processing Framework
2 pages
Desktop Compute - How Each Part Works
No ratings yet
Desktop Compute - How Each Part Works
26 pages
Advancing 5G Connectivity A Comprehensive Review o
No ratings yet
Advancing 5G Connectivity A Comprehensive Review o
19 pages
CHE/ CPE 612/ CPE 613: Aspen Plus: Introduction To Chemical Engineering Simulation Note From: Dr. Zainal Ahmad
No ratings yet
CHE/ CPE 612/ CPE 613: Aspen Plus: Introduction To Chemical Engineering Simulation Note From: Dr. Zainal Ahmad
33 pages
JWT Spring Boot Example
100% (1)
JWT Spring Boot Example
9 pages
Subscription Management SaaS-based System
No ratings yet
Subscription Management SaaS-based System
11 pages
Casa Cable Cmcpe Mib
No ratings yet
Casa Cable Cmcpe Mib
27 pages
Practice Test 3 Udemy
No ratings yet
Practice Test 3 Udemy
68 pages
Kites - The Erp Made For This Decade
No ratings yet
Kites - The Erp Made For This Decade
2 pages
Atlantis User Manual A08-LN1200-W & A08-LS1500-W - V3.2 A
No ratings yet
Atlantis User Manual A08-LN1200-W & A08-LS1500-W - V3.2 A
54 pages
2022 - 002075129500003955742022 - CityEdge VAR Service Desk Go Live
No ratings yet
2022 - 002075129500003955742022 - CityEdge VAR Service Desk Go Live
5 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
22 pages
PC App
No ratings yet
PC App
5 pages
Fast - Lane F5 NETWORKS - CONFIGURING BIG IP ADVANCED WAF
No ratings yet
Fast - Lane F5 NETWORKS - CONFIGURING BIG IP ADVANCED WAF
5 pages
12 Transaction Processing PDF
No ratings yet
12 Transaction Processing PDF
50 pages
HP PageWide Pro Series Firmware Readme VR 1.9.1 001.2132A
No ratings yet
HP PageWide Pro Series Firmware Readme VR 1.9.1 001.2132A
8 pages
BILL MANAGEMENT SYSTEM WITH TITLE Budget Planner Abstract
No ratings yet
BILL MANAGEMENT SYSTEM WITH TITLE Budget Planner Abstract
4 pages
Salesforce Developer Interview Questions
No ratings yet
Salesforce Developer Interview Questions
60 pages

chapter 2

Uploaded by

chapter 2

Uploaded by

Performance

Techniques built into contemporary processors to

Superscalar Branch Speculative Data flow

The functionalities of pipelining in

•The ability to issue multiple

Data flow analysis

This is why CPU The overall balance

Increase the number of bits that Change the DRAM interface to

Improvements in Chip Organization and

•Multicore: multiple processors on the

You might also like