Parallel Chapter3

This document discusses pipelining in computer processors. It defines key concepts like stages, tasks, and stalling. It describes how pipelining reduces the processing time per task from Tseq to Tcyc/k. It also discusses different types of pipelines like instruction and arithmetic pipelines. It covers hazards like control hazards resolved through branch prediction and structural hazards addressed by adding hardware. It describes using collision vectors to control initiation in pipelines and minimize average latency.

Uploaded by

aletharee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views29 pages

Parallel Chapter3

Uploaded by

aletharee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 29

Chapter 3 Pipelining

3.1 Pipeline Model

Terminology
task
subtask
stage
staging register
Total processing time for each task.
T
pl
= , where t
i
is the processing time,
d
i
is the delay by the staging register, and k is the
number of stages

=
+
k
i
i i d t
1
) (
3.1 Pipeline Model
(continued)
Total processing time for each task.
T
seq
=
pipeline cycle time, t
max
= Max(t
i
+d
i
), 1 s I s k
clock frequency = 1/ t
max
pipeline cycle time t
cyc
can be denoted by
T
seq
/k + d
speedup, S = ,where N is the number
of tasks.

=
k
i
i t
1
) (
cyc
seq
t N k
T N
) 1 ( +
-
3.1 Pipeline Model
(continued)
If staging register delay is ignored and the
processing times of the stages are same,
t
cyc
= T
seq
/ k.
Therefore, S
ideal
becomes
If
1 +
-
N k
k N
k S N ideal = ,
3.1 Pipeline Model
(continued)
The total cost of the pipeline is given by
C= L.k + Cp where Cp = and L is
the cost of each staging register.
To minimize the composite cost per the
computation rate, k =
d L
Tseq Cp
-
-

=
k
i
i c
1
3.1 Pipeline Model
(continued)
In practice, making the delays of pipeline stages
equal is a complicated and time-consuming process
It is essential to maximum performance that the stages be
close to balanced.
It is done for commercial processors, although it is not easy
and cheap to do
Another problem with pipelines is the overhead in
term of handling exception or interrupts.
A deep pipeline increases the interrupt handling overhead.
Pipeline Types
Pipeline Types(Handlers classification)
Instruction pipelines
FI, DI, CA, FO, EX, ST
arithmetic pipelines
processor pipelines: a cascade of
processors each executing a specific
module in the application program.
Instruction pipeline

reservation table
Row : stages
Column : pipeline cycles
The cycle time of instruction pipelines is
often determined by the stages
requiring memory access.

Control Hazard
Conditional branch instructions
The target address of branch will be known only
after the evaluation of the condition.
The ways to solve control hazards
The pipeline is frozen
The pipeline predicts that the branch will not be
taken.
It would be to start fetching the target instruction
sequence into a buffer while the nonbranch
sequence is being fed into the pipeline.
Arithmetic pipelines
Floating point addition
Consider S = A + B, where A=(Ea,Ma), B=(Eb,
Mb), and S=(Es,Ms)
Addition steps (Figure 3.5)
Equalize the exponents
Add mantissas
Normalize Ms and adjust Es for the sum normalization
Round Ms
Renormalize Ms and adjust Es
Modified floating point add pipeline (Figure 3.6 &
3.7)
Arithmetic pipelines(cont.)
floating point multiplication
Consider P= A x B, where A=(Ea,Ma), B=(Eb,
Mb), and P=(Ep,Mp)
Multiplication steps (Figure 3.8)
Add exponents
Multiply mantissas
Normalize Mp and adjust Ep
Round Mp
Renormalize Mp and adjust Ep
Modified floating point add pipeline (Figure 3.9)
Arithmetic pipelines(cont.)
Multifunction pipeline
To perform more than one operation
A control input is needed for proper
operation of the multifunction pipeline.
Figure 3.10 : floating point add/multiplier
Classification scheme by
Ramamoorthy and Li
Functionality
unifunctional
multifunctional
Configuration
static
dynamic
Mode of operation:
scalar
vector
3.2 Pipeline control and
Performance
To provide the max. possible throughput, it
must be kept full and flowing smoothly.
Two conditions of smooth flow of a pipeline:
the rate of input of data
data interlocks between the stages
Example 3.1 : the pipeline completes one
operation per cycle(once it is full)
Example 3.2 : non-linear pipeline
Structural hazard
Due to the non-availability of
appropriate hardware
One obvious way of avoiding structural
hazard is to insert additional hardware
into the pipeline.

Example 3.3
Figure 3.12 depicts the operation of the
pipeline
In cycle 3, 4, 5, and 6, simultaneous accesses are
needed.
If we assume that the machine has separate data
and instruction caches, in cycles 5 and 6 the
problems are solved.
One way to solve the problem in cycle 4 is to stall
the ADD instruction (Figure 3.13)
The stalling process results in a degradation of pipeline
performance.

Collision vectors
Initiation : launching of an operation into the
pipeline
Latency: the number of cycles that elapse
between two initiation.
Latency sequence: the latencies between
successive initiations
Collision: it occurs if a stage in the pipeline is
required to perform more than one task at
any time.
Collision vectors(cont.)
Forbidden set: the set of all possible column
distances between two entries on some row
of RT.
Collision vector can be derived from
forbidden set F and can be utilized to
control the initiation of operations in the
pipelines.
CV = (v
n-1
,v
n-2
,,v
2
,v
1
)
V
i
=1 if i is in the forbidden set
Examples
Example 3.4
(a) Overlapped RT
(b) Collision Vector(CV)
Example 3.5 & 3.6
Collision case and no collision case

Control
How to control the initiation of pipeline using
CV.
Place the CV in a shift reg.
If the LSB of the shift reg. Is 1, do not initiate an
operation at that cycle; shift the CV right once,
inserting 0 at the vacant MSB position
If the LSB of the shift reg. Is 0, initiate a new
operation at that cycle; shift the CV right once,
inserting 0 at the vacant MSB position. In order to
reflect the superposing status due to the new
initiation over the original one, perform a bit-by-bit
OR of the original CV with the content of the shift
reg.
3.2.3 Performance
Figure 3.15(a)
The CV of Figure 3.11 : (00111)
Figure 3.15(a) shows the state transitions.
3.2.3 Performance
Average latency
simple cycle
greedy cycle
MAL(Minimum average Latency)
3.2.4 Multifunction Pipelines
Figure 3.17
Vxx, Vxy, Vyx, Vyy
3.3 Other Pipeline Problems
Data Interlock: due to the sharing of
resources. Data hazard
data forwarding
internal forwarding
write-read forwarding
read-read forwarding
write-write forwarding
load/store architectures versus
memory/memory architectures
3.3 Other Pipeline Problems
(continued)
Conditional Branches
branch prediction
delayed branch
branch-prediction buffer
branch history
multiple instruction buffers
Interrupts
precise interrupt scheme
3.4 Dynamic Pipelines
Instruction deferral
scoreboard
Tomosulos algorithm
Performance evaluation
maximizing the total number of initiations
per unit time
minimizing the total time required to handle
a specific sequences of initiation table
types
3.5 Example systems
CDC Star-100
CDC 6600
MIPS R-4000

3.6 Summaries
Three approaches have been tried to
improve the performance beyond the
ideal CPI case:
superpipeline
superscalar
VLIW(Very Long Instruction Word)
End of Chapter 3

CAO-II Module 2 Complete
100% (1)
CAO-II Module 2 Complete
32 pages
Module 3-Part 2
No ratings yet
Module 3-Part 2
50 pages
Parallel Processing Essentials
No ratings yet
Parallel Processing Essentials
32 pages
Pipelining & Superscalar Techniques
No ratings yet
Pipelining & Superscalar Techniques
71 pages
Unit 3 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Advanced Computer Architecture - WWW - Rgpvnotes.in
15 pages
CA Slides#3 Pipeline Introduction
No ratings yet
CA Slides#3 Pipeline Introduction
26 pages
Pipelining
No ratings yet
Pipelining
21 pages
Pipelining & Vector Processing Guide
No ratings yet
Pipelining & Vector Processing Guide
29 pages
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
No ratings yet
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
10 pages
Unit-V NEW
No ratings yet
Unit-V NEW
21 pages
Pipelining in Instruction Processing
No ratings yet
Pipelining in Instruction Processing
76 pages
Pipelining Basic Concept
No ratings yet
Pipelining Basic Concept
23 pages
Parallel Processing and Pipelining
No ratings yet
Parallel Processing and Pipelining
53 pages
Parallel Processing Chapter - 3: Instruction Level Parallelism
No ratings yet
Parallel Processing Chapter - 3: Instruction Level Parallelism
33 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
CA-unit 4-Material
No ratings yet
CA-unit 4-Material
31 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
Module 3 Chapter 2
No ratings yet
Module 3 Chapter 2
40 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
64 pages
Chapter 5 - CO - BIM - III
No ratings yet
Chapter 5 - CO - BIM - III
7 pages
Advanced Pipelining Techniques
No ratings yet
Advanced Pipelining Techniques
75 pages
Chapter 5 Pipelining and Vector Processing Modified
No ratings yet
Chapter 5 Pipelining and Vector Processing Modified
37 pages
Chapter 6 (Pipelining and Superscalar Techniques)
No ratings yet
Chapter 6 (Pipelining and Superscalar Techniques)
10 pages
Module 4
No ratings yet
Module 4
12 pages
Unit 5
No ratings yet
Unit 5
51 pages
Advanced Computer Architecture 2
No ratings yet
Advanced Computer Architecture 2
17 pages
FINAL Presentation
No ratings yet
FINAL Presentation
31 pages
CA Unit-3 Part2
No ratings yet
CA Unit-3 Part2
8 pages
Pipe Lining
No ratings yet
Pipe Lining
29 pages
2.2 Pipelining: Asynchronous
25% (4)
2.2 Pipelining: Asynchronous
24 pages
4 Instruction Pipeline
No ratings yet
4 Instruction Pipeline
13 pages
Module 5
No ratings yet
Module 5
16 pages
Pipeline and Vector Processing Overview
No ratings yet
Pipeline and Vector Processing Overview
74 pages
Pipelining and Parallelism
No ratings yet
Pipelining and Parallelism
41 pages
ACA - Pipelining
No ratings yet
ACA - Pipelining
25 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Chap4 Pipelining
No ratings yet
Chap4 Pipelining
39 pages
Lec3 PDF
No ratings yet
Lec3 PDF
15 pages
Module 4-Pipelining
No ratings yet
Module 4-Pipelining
39 pages
Caalp Unit5
No ratings yet
Caalp Unit5
20 pages
Module 5 Part2 Pipelining
No ratings yet
Module 5 Part2 Pipelining
36 pages
Helping Slides Pipelining Hazards Solutions
No ratings yet
Helping Slides Pipelining Hazards Solutions
55 pages
Pipelining and Vector Processing Guide
No ratings yet
Pipelining and Vector Processing Guide
63 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Advanced Pipelining Techniques
No ratings yet
Advanced Pipelining Techniques
44 pages
Instruction Pipeline - Study Notes
No ratings yet
Instruction Pipeline - Study Notes
14 pages
Flynn's Taxonomy & Pipelining
No ratings yet
Flynn's Taxonomy & Pipelining
12 pages
Flynn's Taxonomy of Parallel Processing
No ratings yet
Flynn's Taxonomy of Parallel Processing
58 pages
Parallel Processing & Pipelining
No ratings yet
Parallel Processing & Pipelining
33 pages
Dld&Co Cse-Ds Unit 4-2
No ratings yet
Dld&Co Cse-Ds Unit 4-2
38 pages
Pipeline Processing in Computer Systems
No ratings yet
Pipeline Processing in Computer Systems
16 pages
Parallel and Pipeline Processing Explained
No ratings yet
Parallel and Pipeline Processing Explained
43 pages
5.pipeline and Multiprocessors
100% (1)
5.pipeline and Multiprocessors
16 pages
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
No ratings yet
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
8 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
33 pages
Solar Electricity Handbook 2012
100% (2)
Solar Electricity Handbook 2012
42 pages
ME Electrical Engineering Syllabus Gujarat
No ratings yet
ME Electrical Engineering Syllabus Gujarat
5 pages
Manual
No ratings yet
Manual
22 pages
Exp. No-5
No ratings yet
Exp. No-5
4 pages
Advanced Technology Solutions For Next Generation HPHT Wells
No ratings yet
Advanced Technology Solutions For Next Generation HPHT Wells
15 pages
DIY Valve Amplifier Guide
No ratings yet
DIY Valve Amplifier Guide
6 pages
Unit-Iii Arm Application Development
No ratings yet
Unit-Iii Arm Application Development
36 pages
Electrical Technology Grade 12 Motor Starters Revision 2024
No ratings yet
Electrical Technology Grade 12 Motor Starters Revision 2024
18 pages
B.Tech AC Circuit Analysis Guide
No ratings yet
B.Tech AC Circuit Analysis Guide
2 pages
9800 Series Manual33 144dpi 23%
No ratings yet
9800 Series Manual33 144dpi 23%
16 pages
Texas - Instruments TCA6408APWR Datasheet
No ratings yet
Texas - Instruments TCA6408APWR Datasheet
35 pages
Relay Coordination and Protection Guide
100% (1)
Relay Coordination and Protection Guide
50 pages
Transformer Test Report
73% (11)
Transformer Test Report
2 pages
Decoupling Control for GTO PWM Converters
No ratings yet
Decoupling Control for GTO PWM Converters
6 pages
Information Security
No ratings yet
Information Security
15 pages
Sound & Vision - June 2015 USA PDF
100% (2)
Sound & Vision - June 2015 USA PDF
78 pages
MCQ in Philippine Electrical Code (PEC) Part 2 - REE Board Exam - Pinoybix Engineering
No ratings yet
MCQ in Philippine Electrical Code (PEC) Part 2 - REE Board Exam - Pinoybix Engineering
18 pages
Canon EOS 1000D - Specifications
No ratings yet
Canon EOS 1000D - Specifications
6 pages
BSNL: Overview and Telecommunications Insights
No ratings yet
BSNL: Overview and Telecommunications Insights
25 pages
Evolution of Computing Hardware History
No ratings yet
Evolution of Computing Hardware History
20 pages
Powerplasma-60 Original
No ratings yet
Powerplasma-60 Original
2 pages
Dr. J. Shanmugam Prof of Avionics M. I. T., Anna University Chennai
100% (1)
Dr. J. Shanmugam Prof of Avionics M. I. T., Anna University Chennai
73 pages
9130 BSC Evolution Hardware Guide
No ratings yet
9130 BSC Evolution Hardware Guide
138 pages
KVM Switch Features Hardware Requirements: Top View
No ratings yet
KVM Switch Features Hardware Requirements: Top View
2 pages
Transmission Line Inductance Guide
No ratings yet
Transmission Line Inductance Guide
28 pages
Spectral Subtraction Based On Minimum Statistics
No ratings yet
Spectral Subtraction Based On Minimum Statistics
4 pages
Power Line Carrier Communication - ETL41-42
100% (1)
Power Line Carrier Communication - ETL41-42
81 pages
DC, Synchronous & Stepper Motor Training
No ratings yet
DC, Synchronous & Stepper Motor Training
1 page
0205 Standards Approvals Conditions Zone2 PDF
No ratings yet
0205 Standards Approvals Conditions Zone2 PDF
2 pages
Spider 3 - 4 - 5 - DO-160 Sec 17
No ratings yet
Spider 3 - 4 - 5 - DO-160 Sec 17
32 pages

Parallel Chapter3

Uploaded by

Parallel Chapter3

Uploaded by

Chapter 3 Pipelining

3.1 Pipeline Model

You might also like