0% found this document useful (0 votes)

113 views

Slides Chapter 5 Basic Processing Unit

The document describes the basic processing unit of a processor. It discusses how instructions are executed in multiple stages, including fetching instructions from memory, decoding, executing operations, accessing memory if needed, and writing results back to registers. It provides details on the typical components involved in each stage, such as the program counter, instruction register, register file, ALU, and control circuitry. The document uses a 5-stage RISC processor as an example to illustrate how instruction execution can be divided into separate hardware stages in a pipelined fashion.

Uploaded by

Win War

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

113 views

Slides Chapter 5 Basic Processing Unit

Uploaded by

Win War

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 44

Basic Processing Unit

Credits: Federico Baronti

Processing Unit
• A processor reads program instructions from
the computer’s memory and executes them.
This includes the following basic phases:
– Fetching and decoding the instruction
– Executing the instruction, which includes:
1. Reading one or more registers (in the register file)

datapath
2. Doing some computation (in the ALU)
3. Accessing the memory
4. Writing a register (in the register file)
Processor’s building blocks

• PC provides instruction
address
• Instruction is fetched into
IR
• Instruction address
generator updates PC
• ALU performs some
computation during
execution
• Control circuitry interprets
instruction and generates
control signals to perform
the actions needed.
A digital processing system
• datapath
A multi-stage digital processing system
• datapath
Why multi-stage?
• Processing moves from one stage to the next in
each clock cycle
• Such a multi-stage system is the basis for
pipelined operation
– High-performance processors have a pipelined
organization
– Pipelining enables the execution of successive
instructions to be overlapped
• We will get back to pipeline later. Let’s now
focus on the basics of the multi-stage
architecture of a RISC-style processor
Instruction execution
• Pipelined organization is most effective if all
instructions can be executed in the same number of
steps.
• Each step is carried out in a separate hardware
stage.
• Processor design will be illustrated using five
hardware stages.
• How can instruction execution be divided into five
steps?
– Let’s start from some representative RISC instructions
A memory access instruction:
Load R5, X(R7)
1. Fetch the instruction and increment the
program counter.
2. Decode the instruction and read the contents
of register R7 in the register file.
3. Compute the effective address = X + [R7].
4. Read the memory source operand.
5. Load the operand into the destination
register, R5.
A computational instruction:
Add R3, R4, R5
1. Fetch the instruction and increment the program
counter.
2. Decode the instruction and read registers
R4 and R5.
3. Compute the sum [R4] + [R5].
4. No action.
5. Load the result into the destination register, R3.

• Stage 4 (memory access) is not involved in this

instruction.
5-stage Architecture of a
RISC Processor
1. Fetch an instruction and increment the program
counter.
2. Decode the instruction and read registers from the
register ﬁle.
3. Perform an ALU operation.
4. Read or write memory data if the instruction involves a
memory operand.
5. Write the result into the destination register.

• This sequence determines the hardware stages needed.

Hardware components: Register file

• A 2-port register file

is needed to read the
two source registers
at the same time.

• It may be
implemented using a
2-port memory.
Hardware components: Register file
Hardware components: ALU (1)
• Both source operands
and the destination
location are in the
register file.
[RA] and [RB] denote [RB]
values of registers that
new [RC]
are identified by [RA]
addresses A and B
new [RC] denotes the
result that is stored to
the register identified
by address C
Hardware components: ALU (2)
• In this case, one of
the source
operands is the
immediate value
in the IR.
new [RC]

[RA]
A 5-stage implementation of
a RISC processor
• Instruction processing
moves from stage to stage
in every clock cycle,
starting with fetch.

• The instruction is decoded

and the source registers
are read in stage 2.

• Computation takes place

in the ALU in stage 3.
A 5-stage implementation of
a RISC processor

• …

• If a memory operation is
involved, it takes place in
stage 4.

• The result of the

instruction is written in
the destination register in
stage 5.
The datapath – Stages 2 to 5

• Register file,
used in stages 2 and 5
– (Inter-stage registers RA, RB, RZ, RM, RY
needed to carry data from one stage to
the next)

• ALU stage

• Memory stage

• Final stage to store result

to the register file
Memory stage
• For a calculation
instruction:
– MuxY selects [RZ] to be
placed in RY.
• For a memory instruction:
– RZ provides memory address,
and MuxY selects read data
to be placed in RY.
– RM provides data for a
memory write operation.
• In subroutine calls or
exception handling:
– Input 2 of MuxY is used
(return address stored in the
register file)
Instruction Fetch Stage (1)
• MuxMA selects the PC
when fetching instructions
(RZ in the Memory Stage –
we are assuming no
Harvard architecture)
• The Instruction address
generator increments the
PC after fetching an
instruction
– It also generates branch
and subroutine addresses.
Instruction Fetch Stage (2)
• When an instruction is
read, it is placed in IR.
• The control circuitry
decodes the instruction.
– It generates the control
signals that drive all
units.
• The Immediate block
extends the immediate
operand to 32 bits,
according to the type of
instruction.
Instruction address generator
• Connections to
registers RY and RA
are used to support
subroutine call and
return instructions
Example: Add R3, R4, R5
1. Memory address ←[PC],
Read memory,
IR←Memory data,
PC ← [PC] + 4
2. Decode instruction,
RA ← [R4], RB ← [R5]
3. RZ ← [RA] + [RB]
4. RY ← [RZ]
5. R3 ← [RY]
Example: Load R5, X(R7)
1. Memory address ← [PC],
Read memory,
IR ← Memory data,
PC ← [PC] + 4
=X
2. Decode instruction,
RA ← [R7]
3. RZ ← [RA] + Immediate value
X
4. Memory address ←[RZ], Read
memory,
RY ← Memory data
5. R5 ← [RY]
Example: Store R6, X(R8)
1. Memory address ← [PC],
Read memory,
IR ← Memory data,
PC ← [PC] + 4
2. Decode instruction,
RA ← [R8], RB ← [R6]
3. RZ ← [RA] + Immediate value
X, RM ← [RB]
4. Memory address ←[RZ],
Memory data ← [RM], Write
memory
5. No action
Unconditional branch
1. Memory address ←[PC], Read memory,
IR ← Memory data, PC ←[PC] + 4
2. Decode instruction
3. PC ← [PC] + Branch offset
4. No action
5. No action
Conditional branch: Branch_if_[R5]=[R6]
LOOP
1. Memory address ← [PC], Read memory,
IR ← Memory data, PC ←[PC] + 4
2. Decode instruction, RA ← [R5], RB ←[R6]
3. Compare [RA] to [RB],
If [RA] = [RB], then
PC ← [PC] + Branch offset
4. No action
5. No action
Subroutine call with indirection: Call_register
R9
1. Memory address ← [PC], Read memory,
IR ← Memory data, PC ←[PC] + 4
2. Decode instruction, RA ← [R9]
3. PC-Temp ← [PC],
PC ← [RA]
4. RY ← [PC-Temp]
5. Register LINK ← [RY]
Control signals
• Select multiplexer inputs to route the flow of
data

• Set the function performed by the ALU

• Determine when data are written into the PC,

the IR, the register file, and the memory
Register file control signals

Generated by decoding
the OPCODE field of the
instruction hold in the
IR register
Instruction
Format
R

I
ALU control signals

Generated by decoding
the OPCODE field of the
instruction hold in the
IR register Analyzed by the
CONTROL CIRCUITRY
during the execution
of a branch
instruction
Result selection

Generated by decoding
the OPCODE field of the
instruction hold in the
IR register
Memory access
• When data are found in the cache, access to
memory can be completed in one clock cycle.
• Otherwise, read and write operations may require
several clock cycles to load data from main memory
into the cache.
• A control signal is needed to indicate that memory
function has been completed (MFC). E.g., for step 1:
1.Memory address ← [PC], Read memory,
Wait for MFC,
IR ← Memory data, PC ← [PC] + 4
Memory and IR control signals

MuxY
Memory and IR control signals

1. Imm 16-bit sign

extended
2. Imm 16-bit
MuxY
unsigned extended
3. Imm 16-bit “high”
extended
4. Imm 26-bit in CALL
instr. which is
special extended
Control signals of instruction address generator
Control signal generation
• Circuitry must be implemented to generate control
signals so actions take place in correct sequence and at
correct time.
• There are two basic approaches:
hardwired control and microprogramming
• Hardwired control involves implementing circuitry that
considers step (ring) counter, IR, ALU result, and external
inputs.
• Step (Ring) counter keeps track of execution progress,
one clock cycle for each of the five steps described
(unless a memory access takes longer than one cycle).
Hardwired generation of control signals

E.g.
RF_wtite = T5&(ALU | Load | Call);
PC_enable = T1&MFC | T3&(BR | Ret | Call);
CISC processors
• CISC-style processors have more complex
instructions.
• The full set of instructions cannot all be
implemented in a fixed number of steps.
• Execution steps for different instructions do not
all follow a prescribed sequence of actions.
• Hardware organization should therefore enable
a flexible flow of data and actions to
accommodate CISC.
Hardware organization for a CISC computer
Main difference between
5-stage RISC organization
and CISC organization,
where a datapath cannot
Hold temporary results be identified easily
during instruction
execution
Bus
• An example of an interconnection network.
• When functional units are connected to a
common bus, tri-state drivers are needed.

Register Enable
A 3-bus interconnection network
Example 1: Add R5, R6
1. Memory address ← [PC],
Read memory, Wait for
MFC, IR ← Memory data,
PC ← [PC] + 4
2. Decode instruction
3. R5 ← [R5] + [R6]
A 3-bus interconnection network
Example 2: And X(R7), R9
1. Memory address ← [PC], Read memory,
Wait for MFC,
IR ← Memory data,
PC ← [PC] + 4
2. Decode instruction
3. Memory address ← [PC], Read memory,
Wait for MFC,
Temp1 ← Memory data,
PC ← [PC] + 4
4. Temp2 ← [Temp1] + [R7]
5. Memory address ← [Temp2], Read
memory, Wait for MFC, Temp1 ← Memory
data
6. Temp1 ←[Temp1] AND [R9]
7. Memory address ← [Temp2], Memory data
← [Temp1], Write memory, Wait for MFC

X is stored as a second word of the

instruction
References
• C. Hamacher, Z. Vranesic, S. Zaky, N. Manjikian
"Computer Organization and Embedded Systems,”
McGraw-Hill International Edition
– Chapter V: Basic Processing Unit

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
2.3 Lab - Explore YANG Models Using The Pyang Tool
0% (3)
2.3 Lab - Explore YANG Models Using The Pyang Tool
2 pages
Lab Manual Computer Organization & Assembly Language
100% (2)
Lab Manual Computer Organization & Assembly Language
89 pages
Template Programming
100% (10)
Template Programming
80 pages
Building Data Pipelines in Python
No ratings yet
Building Data Pipelines in Python
49 pages
Computer Organization Hamacher Instructor Manual Solution - Chapter 7
67% (3)
Computer Organization Hamacher Instructor Manual Solution - Chapter 7
13 pages
Data and Computer Communications: Tenth Edition by William Stallings
No ratings yet
Data and Computer Communications: Tenth Edition by William Stallings
21 pages
Ibm TSM Commands
100% (2)
Ibm TSM Commands
29 pages
Unit-6: Pipeline & Vector Processing
No ratings yet
Unit-6: Pipeline & Vector Processing
41 pages
Hardwired and Microprogrammed Control2
No ratings yet
Hardwired and Microprogrammed Control2
4 pages
COA_Module4
No ratings yet
COA_Module4
19 pages
Computer Architecture and Parallel Processing
No ratings yet
Computer Architecture and Parallel Processing
29 pages
Cache Memory
No ratings yet
Cache Memory
12 pages
Unit 5 - Computer Organization and Architecture - WWW - Rgpvnotes.in
0% (1)
Unit 5 - Computer Organization and Architecture - WWW - Rgpvnotes.in
13 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
36 pages
Module 4
No ratings yet
Module 4
35 pages
Instruction Op-Code Operand Bytes Machine - Cycles T - States Detail
No ratings yet
Instruction Op-Code Operand Bytes Machine - Cycles T - States Detail
3 pages
Computer Peripherals & Interfacing
No ratings yet
Computer Peripherals & Interfacing
128 pages
Associative Memory
94% (18)
Associative Memory
17 pages
Unit IV Memory System Notes
No ratings yet
Unit IV Memory System Notes
13 pages
Superpipelining
No ratings yet
Superpipelining
7 pages
CS 8491 Computer Architecture
No ratings yet
CS 8491 Computer Architecture
103 pages
Chapter 08 - Pipeline and Vector Processing
No ratings yet
Chapter 08 - Pipeline and Vector Processing
14 pages
Mes Manual 2022-23
No ratings yet
Mes Manual 2022-23
39 pages
IA-32 Architecture: Computer Organization and Assembly Language Dr. Aiman El-Maleh
No ratings yet
IA-32 Architecture: Computer Organization and Assembly Language Dr. Aiman El-Maleh
38 pages
Computer Architecture - Memory System
100% (1)
Computer Architecture - Memory System
22 pages
Chapter - 2 Instruction Set Architecture 2.1 Memory Locations and Addresses
No ratings yet
Chapter - 2 Instruction Set Architecture 2.1 Memory Locations and Addresses
11 pages
Presentation On Cache Memory Operating System CSE 309 1
No ratings yet
Presentation On Cache Memory Operating System CSE 309 1
21 pages
Intel x86 Architecture: Presentations
100% (1)
Intel x86 Architecture: Presentations
27 pages
S.No Topics Lec: Advanced Computer Network ETCS-401
No ratings yet
S.No Topics Lec: Advanced Computer Network ETCS-401
4 pages
Systolic Array
No ratings yet
Systolic Array
42 pages
Artificial Intelligence Question Bank
No ratings yet
Artificial Intelligence Question Bank
3 pages
Cs2253 Computer Organization and Architecture Question Bank
No ratings yet
Cs2253 Computer Organization and Architecture Question Bank
10 pages
MODULE 2: Input / Output Organization: Courtesy: Text Book: Carl Hamacher 5 Edition
No ratings yet
MODULE 2: Input / Output Organization: Courtesy: Text Book: Carl Hamacher 5 Edition
95 pages
L2: Internal Organization of Memory Chip
No ratings yet
L2: Internal Organization of Memory Chip
16 pages
Advanced Computer Architecture 2
No ratings yet
Advanced Computer Architecture 2
17 pages
Memory Reference Instructions Execution
100% (1)
Memory Reference Instructions Execution
13 pages
Moore and Mealy Machines: By: Engr - Syed Atir Iftikhar
No ratings yet
Moore and Mealy Machines: By: Engr - Syed Atir Iftikhar
21 pages
Ch04 The Memory System
No ratings yet
Ch04 The Memory System
45 pages
DDCO
No ratings yet
DDCO
34 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
39 pages
Pipelining PDF
No ratings yet
Pipelining PDF
19 pages
MP LAB Cse Manual
No ratings yet
MP LAB Cse Manual
140 pages
Data Structure Lab Report
No ratings yet
Data Structure Lab Report
20 pages
WilliamStallings Chp3 PDF
No ratings yet
WilliamStallings Chp3 PDF
60 pages
Computer System Architecture Lab Report 4
No ratings yet
Computer System Architecture Lab Report 4
7 pages
The Language of Bits: Computer Organisation and Architecture
No ratings yet
The Language of Bits: Computer Organisation and Architecture
72 pages
1-IAS Architecture-12-12-2022
No ratings yet
1-IAS Architecture-12-12-2022
34 pages
Chapter 2:instructions: Language of The Computer
No ratings yet
Chapter 2:instructions: Language of The Computer
81 pages
L10-L11-Instruction Pipelining
No ratings yet
L10-L11-Instruction Pipelining
38 pages
MiddleWare Technology Lab Manual
No ratings yet
MiddleWare Technology Lab Manual
170 pages
Data Structures With C Lab Manual 15csl38
No ratings yet
Data Structures With C Lab Manual 15csl38
75 pages
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
74 pages
4thsem Microprocessor Notes PDF
No ratings yet
4thsem Microprocessor Notes PDF
148 pages
Coa Unit 5
No ratings yet
Coa Unit 5
73 pages
Hardwired and Microprogrammed
No ratings yet
Hardwired and Microprogrammed
45 pages
Instruction Pipeline
No ratings yet
Instruction Pipeline
27 pages
Coa Important Questions
No ratings yet
Coa Important Questions
49 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
60 pages
System Software and Microprocessor Labmanual
No ratings yet
System Software and Microprocessor Labmanual
130 pages
Markov Chains
No ratings yet
Markov Chains
35 pages
Advanced Unix Programming
From Everand
Advanced Unix Programming
Prof. N. B Venkateswarlu
No ratings yet
Unit 3 Basic Processing Unit
No ratings yet
Unit 3 Basic Processing Unit
42 pages
Basic Processing Unit
No ratings yet
Basic Processing Unit
49 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
03 Instruction Set Architecture Co 2022
No ratings yet
03 Instruction Set Architecture Co 2022
10 pages
02 Functional Units and Basic Operation
No ratings yet
02 Functional Units and Basic Operation
28 pages
MiniProject Report Format - 2022-23 (Edit)
No ratings yet
MiniProject Report Format - 2022-23 (Edit)
11 pages
NIT Trichy-Vortex23 Hackathon Topics
No ratings yet
NIT Trichy-Vortex23 Hackathon Topics
4 pages
Embedded Question Set final (2)
No ratings yet
Embedded Question Set final (2)
2 pages
Project Report
No ratings yet
Project Report
14 pages
Video Graphics Array
No ratings yet
Video Graphics Array
11 pages
Eee250 Module 1-7
No ratings yet
Eee250 Module 1-7
101 pages
Program Manual Termic Printer SAT 37T22T
No ratings yet
Program Manual Termic Printer SAT 37T22T
44 pages
Python Control Flow Statements and Loops: Pynative
No ratings yet
Python Control Flow Statements and Loops: Pynative
16 pages
Java
No ratings yet
Java
4 pages
Benq Sl490 Sl550 Datasheet
No ratings yet
Benq Sl490 Sl550 Datasheet
2 pages
Inst Guide - CC-Link Slave
No ratings yet
Inst Guide - CC-Link Slave
1 page
7 1 DBA Tools With IBM I Navigator
0% (1)
7 1 DBA Tools With IBM I Navigator
85 pages
IT 17.04
No ratings yet
IT 17.04
8 pages
Nfs With Sso
No ratings yet
Nfs With Sso
21 pages
User - Manual Gw-Dlms-485-Elsa15 - 8.53 - en
No ratings yet
User - Manual Gw-Dlms-485-Elsa15 - 8.53 - en
13 pages
LCP SDK Trace
No ratings yet
LCP SDK Trace
4 pages
How To Setup A Kali Linux Hacking Station On Raspberry Pi 3 Model B+
No ratings yet
How To Setup A Kali Linux Hacking Station On Raspberry Pi 3 Model B+
11 pages
Lab Manual 6 23112021 114139am
No ratings yet
Lab Manual 6 23112021 114139am
6 pages
Making Your HP Glance Peak Perform
No ratings yet
Making Your HP Glance Peak Perform
25 pages
01 PROFIBUS Technology
No ratings yet
01 PROFIBUS Technology
27 pages
Manual Printer Brother DCP-T420W
No ratings yet
Manual Printer Brother DCP-T420W
237 pages
Modeller User Manual - 8
No ratings yet
Modeller User Manual - 8
17 pages
Rsa Authentication Manager 8.5 Getting Started Web Tier
No ratings yet
Rsa Authentication Manager 8.5 Getting Started Web Tier
9 pages
Ppa Practical
No ratings yet
Ppa Practical
34 pages
Sophail: A Critical Analysis of Sophos Antivirus: Omponents
No ratings yet
Sophail: A Critical Analysis of Sophos Antivirus: Omponents
15 pages
Computer Syllabus
No ratings yet
Computer Syllabus
20 pages
Pic Microcontroller
No ratings yet
Pic Microcontroller
11 pages
Microprocessors and Interfacing Devices - Unit-1
No ratings yet
Microprocessors and Interfacing Devices - Unit-1
42 pages

Slides Chapter 5 Basic Processing Unit

Uploaded by

Slides Chapter 5 Basic Processing Unit

Uploaded by

Basic Processing Unit

Credits: Federico Baronti

• Stage 4 (memory access) is not involved in this

• This sequence determines the hardware stages needed.

• A 2-port register file

• The instruction is decoded

• Computation takes place

• The result of the

• Final stage to store result

• Set the function performed by the ALU

• Determine when data are written into the PC,

1. Imm 16-bit sign

X is stored as a second word of the

You might also like