0% found this document useful (0 votes)

2K views24 pages

Custom Single Purpose Processor Design

The document describes the design of a custom single-purpose processor. It begins by discussing general versus single-purpose processors, and some advantages of single-purpose processors like higher performance, smaller size, and lower power consumption. It then covers combinational logic design, register-transfer level components, sequential logic design using state machines and state tables, and splitting the design into a controller and datapath. The document provides an example of designing a processor to compute the greatest common divisor algorithm. It discusses creating state diagrams, logic gates, and state tables for the controller, and registers and functional units for the datapath. Overall, the document outlines the basic process for designing a custom processor for a specific application from an algorithm.

Uploaded by

Aar Kay Gautam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views24 pages

Custom Single Purpose Processor Design

Uploaded by

Aar Kay Gautam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

CUSTOM SINGLE PURPOSE PROCESSOR DESIGN

General Vs Single purpose processors

Higher Performance

Due to fewer clock cycles Shorter clock cycle

Smaller Size

Less power consumption

High NRE cost

Longer Time-to-market
Less flexible

Combinational logic design

A) Problem description y is 1 if a is to 1, or b and c are 1. z is 1 if b or c is to 1, but not both, or if all are 1. a 0 0 0 0 1 1 1 1 B) Truth table Inputs b 0 0 1 1 0 0 1 1 c 0 1 0 1 0 1 0 1 Outputs y z 0 0 0 1 0 1 1 0 1 0 1 1 1 1 1 1 C) Output equations y = a'bc + ab'c' + ab'c + abc' + abc z = a'b'c + a'bc' + ab'c + abc' + abc

D) Minimized output equations y bc a 00 01 11 10 0 0 0 1 0 1 1 z a bc 1 1 1

E) Logic Gates a b c y

y = a + bc 00 0 0 1 0 01 1 1 11 0 1 10 1 1

z = ab + bc + bc

RT level Combinational components

I(m-1) I1 I0 n S0 n-bit, m x 1 Multiplexor S(log n m) I(log n -1) I0 A B A n n B n A n B

n n-bit Adder
n carry sum

log n x n Decoder
O(n-1) O1 O0

n-bit Comparato r less equa greate l r

n bit, m function S0 ALU S(log n m) O

O= I0 if S=0..00 I1 if S=0..01 I(m-1) if S=1..11

O0 =1 if I=0..00 O1 =1 if I=0..01 O(n-1) =1 if I=1..11

sum = A+B (first n bits) carry = (n+1)th bit of A+B

less = 1 if A<B equal =1 if A=B greater=1 if A>B

O = A op B op determined by S.

With enable input e all Os are 0 if e=0

With carry-in input Ci sum = A + B + Ci

May have status outputs carry, zero, etc.

RT level Sequential components

I n load clear n-bit Register n Q Q= 0 if clear=1, I if load=1 and clock=1, Q(previous) otherwise. Q = lsb - Content shifted - I stored in msb shift I n-bit Shift register n-bit Counter n Q Q= 0 if clear=1, Q(prev)+1 if count=1 and clock=1. Q

Sequential logic design

A) Problem Description You want to construct a clock divider. Slow down your preexisting clock so that you output a 1 for every four clock cycles C) Implementation Model a Combinational logic x I1 I0 Q1 B) State Diagram a=0 x=0 0 a=1 1 a=0 x=0 a=1 x=1 3 a=1 2 x=0 a=0 a=0 Q0 D) State Table (Moore-type) Inputs Q1 Q0 0 0 0 0 0 1 0 1 1 0 1 0 1 1 1 1 Outputs I1 I0 0 0 0 1 0 1 1 0 1 0 1 1 1 1 0 0

State register I1 I0

a 0 1 0 1 0 1 0 1

x 0 0 0 1

a=1

Given this implementation model

Sequential logic design quickly reduces to combinational logic design

Sequential logic design (cont.)

E) Minimized Output Equations F) Combinational Logic

I1 Q1Q0 00 a 0
1

0
0

0
1

1
0

10 1 1

a I1 = Q1Q0a + Q1a + Q1Q0 x

Q1Q0 a

01 1 0 11 1 0

10 0 1 I0 = Q0a + Q0a

00 0 1

x Q1Q0 00 a 0 0 1 0

I0 01 0 0 11 1 1 10 0 0 x = Q1Q0 Q1 Q0

Custom single-purpose processor basic model

external control inputs controller datapath control inputs external data inputs datapath controller datapath registers next-state and control logic

external control outputs

datapath control outputs

external data outputs

state register

functional units

controller and datapath a view inside the controller and datapath

Example: greatest common divisor

First create algorithm Convert algorithm to complex state machine

(a) black-box view go_i x_i GCD

3: x = x_i !1 1: 1 2: !go_i 2-J: !(!go_i)

(c) state diagram

y_i

d_o
4: y = y_i !(x!=y) x!=y 6: x<y 7: y = y -x 6-J: !(x<y)

Known as FSMD: finitestate machine with datapath Can use templates to perform such conversion

(b) desired functionality 0: int x, y; 1: while (1) { 2: while (!go_i); 3: x = x_i; 4: y = y_i; 5: while (x != y) { 6: if (x < y) 7: y = y - x; else 8: x = x - y; } 9: d_o = x; }

8: x = x - y

5-J: 9: 1-J: d_o = x

State diagram templates

10
Assignment statement a=b next statement Loop statement while (cond) { loop-bodystatements } next !cond statement cond
loopbodystatement s c1 stmts J:

Branch statement
if (c1) c1 stmts else if c2 c2 stmts else other stmts next statement
C: c1 !c1*c2 c2 stmts !c1*!c2 others

a=b

next statemen t J:

next statement

Creating the datapath

Create a register for any declared variable Create a functional unit for each arithmetic operation Connect the ports, registers and functional units

!1 1: 1 2: !go_i !(!go_i) x_i y_i

2-J: x_sel 3: x = x_i y_sel n-bit 2x1 n-bit 2x1

Datapath

x_ld
4: y = y_i !(x!=y) x!=y 6: x<y 7: y = y -x 6-J: !(x<y) != 5: x!=y x_neq_ y x_lt_y y_ld

0: x

0: y

< 6: x<y

subtractor 8: x-y

subtractor 7: y-x

8: x = x - y

9: d d_ o

d_ld

Based on reads and writes Use multiplexors for multiple sources

5-J:
9: 1-J:

d_o = x

Creating the controllers FSM

12
1: 1 2: !go_i 2-J: 3: x = x_i 0001 2: !go_i 00102-J: x_sel = 0 0011 3: x_ld = 1 y_sel = 0 0100 4: y_ld = 1 !(x!=y) 0101 5: x!=y 6: x<y 7: y = y -x 6-J: !(x<y) 0110 6: x_lt_y 7: y_sel = 1 y_ld = 1 0111 1001 6-J: 5-J: 9: 1-J: d_o = x 1010 5-J: 1011 9: 1100 1-J: d_ld = 1 x_neq_y !x_lt_y x_sel =1 8: x_ld = 1 1000 !x_neq_y !(!go_i) !1 go_i

Controller
0000 1: 1

!1 !(!go_i)

Same structure as FSMD Replace complex actions/conditions with Datapath datapath configurations
x_i y_i x_sel y_sel x_ld y_ld 0: x 0: y n-bit 2x1 n-bit 2x1

y = y_i

8: x = x - y

!= 5: x!=y x_neq_ y x_lt_y d_ld

< 6: x<y

subtractor 8: x-y

subtractor 7: y-x

9: d d_ o

Splitting into a controller and datapath

go_i

Controller implementation model

go_i Combinational logic x_sel y_sel x_ld y_ld x_neq_y x_lt_y d_ld

Controller
0000 1: 1 0001 2: !go_i 00102-J: x_sel = 0 0011 3: x_ld = 1 y_sel = 0 0100 4: y_ld = 1 0101 5:

!1 x_i !(!go_i) x_sel y_sel x_ld y_ld 0: x 0: y n-bit 2x1 n-bit 2x1 y_i

(b) Datapath

!= x_neq_y=0 5: x!=y x_neq_ y x_lt_y d_ld

< 6: x<y

subtractor 8: x-y

subtractor 7: y-x

Q3 Q2 Q1 Q0 0110 6: State register I3 I2 I1 I0 x_lt_y=1 7: y_sel = 1 y_ld = 1 0111

x_neq_y= 1 x_lt_y= 0 =1 x_sel 8: x_ld = 1 1000

9: d d_ o

1001 6-J:
1010 5-J: 1011 9: 1100 1-J: d_ld = 1

Controller state table for the GCD example

Inputs Q3 0 0 0 0 Q2 0 0 0 0 Q1 0 0 0 1 Q0 0 1 1 0 x_ne q_y * * * * x_lt_ y * * * * go_i * 0 1 * I3 0 0 0 0 I2 0 0 0 0 I1 0 1 1 0 I0 1 0 1 1 Outputs x_sel X X X X y_sel X X X X x_ld 0 0 0 0 y_ld 0 0 0 0 d_ld 0 0 0 0

0 0
0 0 0 0 0 1 1 1 1 1 1 1 1

0 1
1 1 1 1 1 0 0 0 0 1 1 1 1

1 0
0 0 1 1 1 0 0 1 1 0 0 1 1

1 0
1 1 0 0 1 0 1 0 1 0 1 0 1

* *
0 1 * * * * * * * * * * *

* *
* * 0 1 * * * * * * * * *

* *
* * * * * * * * * * * * *

0 0
1 0 1 0 1 1 1 0 1 0 0 0 0

1 1
0 1 0 1 0 0 0 1 1 0 0 0 0

0 0
1 1 0 1 0 0 1 0 0 0 0 0 0

0 1
1 0 0 1 1 1 0 1 0 0 0 0 0

0 X
X X X X X 1 X X X X X X X

X 0
X X X X 1 X X X X X X X X

1 0
0 0 0 0 0 1 0 0 0 0 0 0 0

0 1
0 0 0 0 1 0 0 0 0 0 0 0 0

0 0
0 0 0 0 0 0 0 0 1 0 0 0 0

Design Custom single purpose processor for

Fibonacci

number up to n

int i, j,k,n,Outp; while (1) { while (!go_i); n = n_i; i=0; j=1; k=0; outp=i; outp=j; while (k<=n) { k=i+j; i=j; j=k; outp=k; } }

RT-level custom single-purpose processor design

Problem Specification

We often start with a state machine

Sen der

rdy_in clock

Rather than algorithm Cycle timing often too central to functionality

Bridge A single-purpose processor that converts two 4-bit inputs, arriving one at a time over data_in along with a rdy_in pulse, into one 8-bit output on data_out along with a rdy_out pulse.

rdy_out

Re cei ver

data_in(4)

data_out(8)

rdy_in=0 WaitFirst4

Bridge

rdy_in=1 RecFirst4End

Example
FSMD

rdy_in=1 RecFirst4Start data_lo=data_in rdy_in=0

rdy_in=0

rdy_in=1 RecSecond4En d Inputs rdy_in: bit; data_in: bit[4]; Outputs rdy_out: bit; data_out:bit[8] Variables data_lo, data_hi: bit[4];

Bus bridge that converts 4bit bus to 8-bit bus

Start with FSMD Known as register-transfer (RT) level

rdy_in= 1 RecSecond4Sta WaitSecond4 rt data_hi=data_in rdy_in=0 Send8Start data_out=data _hi & data_lo rdy_out=1 Send8End rdy_out=0

RT-level custom single-purpose processor design (cont)

17
Bridge

(a) Controller
rdy_in=0 WaitFirst4 rdy_in=0 rdy_in=1 RecFirst4Start data_lo_ld=1 rdy_in=1 RecFirst4End

rdy_in=0 rdy_in=1 WaitSecond4 RecSecond4Star t data_hi_ld=1 Send8Start data_out_ld=1 rdy_out=1

rdy_in clk

rdy_in=1
RecSecond4End

Send8End rdy_out=0

rdy_ou t data_out data_out_ld data_hi_ld data_lo_ld

data_in(4)
to all registers data_hi data_lo

data_out

(b) Datapath

Optimizing single-purpose processors 18

Optimization is the task of making design metric values the best possible Optimization opportunities
original

program

FSMD
datapath FSM

Optimizing the original program

Analyze program attributes and look for areas of possible improvement

number size

of computations

of variable

time

and space complexity

used
and division very expensive

operations

multiplication

Optimizing the original program (cont)

original program 0: int x, y; 1: while (1) { 2: while (!go_i); 3: x = x_i; 4: y = y_i; 5: while (x != y) { 6: if (x < y) 7: y = y - x; else 8: x = x - y; } 9: d_o = x; } replace the subtraction operation(s) with modulo operation in order to speed up program

GCD(42, 8) - 9 iterations to complete the loop x and y values evaluated as follows : (42, 8), (43, 8), (26,8), (18,8), (10, 8), (2,8), (2,6), (2,4), (2,2).

optimized program 0: int x, y, r; 1: while (1) { 2: while (!go_i); // x must be the larger number 3: if (x_i >= y_i) { 4: x=x_i; 5: y=y_i; } 6: else { 7: x=y_i; 8: y=x_i; } 9: while (y != 0) { 10: r = x % y; 11: x = y; 12: y = r; } 13: d_o = x; } GCD(42,8) - 3 iterations to complete the loop x and y values evaluated as follows: (42, 8), (8,2), (2,0)

Optimizing the FSMD

Areas of possible improvements

merge

states

with constants on transitions can be eliminated, transition taken is already known

states

with independent operations can be merged

separate
states

states

which require complex operations (a*b*c*d) can be broken into smaller states to reduce hardware size

scheduling

Optimizing the FSMD (cont.)

22
1: 1 2: !go_i 2-J: x = x_i y = y_i !(x!=y) x!=y 6: x<y !(x<y) y = y -x 6-J: 5-J: d_o = x 8: x = x - y !(!go_i)

int x, y;

original FSMD eliminate state 1 transitions have constant values

optimized FSMD int x, y;

2: !go_i go_i x = x_i y = y_i

3: 4: 5:

merge state 2 and state 2J no loop operation in between them

merge state 3 and state 4 assignment operations are independent of one another
merge state 5 and state 6 transitions from state 6 can be done in state 5 eliminate state 5J and 6J transitions from each state can be done from state 7 and state 8, respectively eliminate state 1-J transition from state 1-J can be done directly from state 9

x<y 7: y = y -x

x>y 8: x = x - y

d_o = x

9: 1-J:

Optimizing the datapath

Sharing of functional units

one-to-one

mapping, as done previously, is not

necessary
if

same operation occurs in different states, they can share a single functional unit

Multi-functional units
ALUs

support a variety of operations, it can be shared among operations occurring in different states

Optimizing the FSM

State encoding
task

of assigning a unique bit pattern to each state in an FSM of state register and combinational logic vary

size

can

be treated as an ordering problem

State minimization
task

of merging equivalent states into a single state

state

equivalent if for all possible input combinations

Advanced Digital System Exam 2012
No ratings yet
Advanced Digital System Exam 2012
12 pages
21 Races and State Assignment
No ratings yet
21 Races and State Assignment
22 pages
Multiprocessing vs Multitasking in RTOS
No ratings yet
Multiprocessing vs Multitasking in RTOS
14 pages
Unit - 2 Diff Amp Objective Questions
No ratings yet
Unit - 2 Diff Amp Objective Questions
3 pages
Implementation of Modified Booth Algorithm (Radix 4) and Its Comparison With Booth Algorithm (Radix-2)
No ratings yet
Implementation of Modified Booth Algorithm (Radix 4) and Its Comparison With Booth Algorithm (Radix-2)
18 pages
Small-Scale Multipath Measurements Lecture Notes
No ratings yet
Small-Scale Multipath Measurements Lecture Notes
6 pages
Assignment 8 - 2023 - Gate
No ratings yet
Assignment 8 - 2023 - Gate
10 pages
Principle of Pattern Multiplication in Antennas
100% (1)
Principle of Pattern Multiplication in Antennas
16 pages
Subject: Analog and Mixed Signal Ic Design
No ratings yet
Subject: Analog and Mixed Signal Ic Design
3 pages
M.tech R19 I Sem Cmos Analog Ic Design Lab
No ratings yet
M.tech R19 I Sem Cmos Analog Ic Design Lab
36 pages
ADC PPT
100% (2)
ADC PPT
28 pages
Overview of Programmable Logic Devices
No ratings yet
Overview of Programmable Logic Devices
36 pages
Lecture 13 State Minimization of Sequential Machines
No ratings yet
Lecture 13 State Minimization of Sequential Machines
42 pages
Unit-IV Subsystem Design and VLSI Design Styles
No ratings yet
Unit-IV Subsystem Design and VLSI Design Styles
33 pages
Synchronous Sequential Circuits Overview
No ratings yet
Synchronous Sequential Circuits Overview
28 pages
6.verilog Shift Register
No ratings yet
6.verilog Shift Register
13 pages
AP4111 Manual Electronic System Design
No ratings yet
AP4111 Manual Electronic System Design
87 pages
Viva Questions
No ratings yet
Viva Questions
13 pages
II - Software Design For Low Power
No ratings yet
II - Software Design For Low Power
11 pages
Ceremorphic Question Paper
No ratings yet
Ceremorphic Question Paper
14 pages
TMS320c50 Programs
67% (3)
TMS320c50 Programs
28 pages
Current Mirror Design and Challenges
No ratings yet
Current Mirror Design and Challenges
66 pages
High-Speed Data Transfer Protocols
No ratings yet
High-Speed Data Transfer Protocols
4 pages
Final
100% (1)
Final
178 pages
Ect393 Scheme 1
No ratings yet
Ect393 Scheme 1
29 pages
Model QP - MPMC Lab - Wo Split Up
0% (1)
Model QP - MPMC Lab - Wo Split Up
3 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
VLSI Design: Subsystems & Adders
No ratings yet
VLSI Design: Subsystems & Adders
53 pages
Advanced Digital System Design
20% (5)
Advanced Digital System Design
1 page
Lattice Filter FIR Structure Explained
No ratings yet
Lattice Filter FIR Structure Explained
8 pages
Characteristics of DSP
100% (1)
Characteristics of DSP
15 pages
MEMS Design
No ratings yet
MEMS Design
11 pages
Sequential Machine Optimization
No ratings yet
Sequential Machine Optimization
27 pages
EC8095-VLSI Design - 01 - by WWW - LearnEngineering.in
No ratings yet
EC8095-VLSI Design - 01 - by WWW - LearnEngineering.in
134 pages
ERTOS Course Outcomes
No ratings yet
ERTOS Course Outcomes
2 pages
Adsd Lab Manual Lhs
No ratings yet
Adsd Lab Manual Lhs
45 pages
Mixed Signal Ic Design Testing-1
No ratings yet
Mixed Signal Ic Design Testing-1
256 pages
UNIT-2 Embedded Processors: ISA Architecture Models
100% (1)
UNIT-2 Embedded Processors: ISA Architecture Models
30 pages
Introduction To Cmos Vlsi Design: Circuits & Layout
No ratings yet
Introduction To Cmos Vlsi Design: Circuits & Layout
54 pages
ISI & Nyquist Criterion For Distortion Less Baseband Binary Data Transmission
0% (1)
ISI & Nyquist Criterion For Distortion Less Baseband Binary Data Transmission
7 pages
Cook Toom Algorithm
No ratings yet
Cook Toom Algorithm
27 pages
VLSI Front End Lab Manual
No ratings yet
VLSI Front End Lab Manual
88 pages
Understanding Direct Memory Access (DMA)
No ratings yet
Understanding Direct Memory Access (DMA)
15 pages
Linear Block Code Implementation in MATLAB
No ratings yet
Linear Block Code Implementation in MATLAB
1 page
VHDL Nptel PDF
No ratings yet
VHDL Nptel PDF
94 pages
Advanced Digital System Design Exam
No ratings yet
Advanced Digital System Design Exam
1 page
Embedded Systems: Key Concepts & Applications
No ratings yet
Embedded Systems: Key Concepts & Applications
47 pages
Dynamic Logic Circuits
100% (1)
Dynamic Logic Circuits
38 pages
Custom Singlepurpose Processor
No ratings yet
Custom Singlepurpose Processor
19 pages
Hardware Issuesss
No ratings yet
Hardware Issuesss
27 pages
Lecture 4
No ratings yet
Lecture 4
212 pages
High Level Synthesis - 01 - Introduction
No ratings yet
High Level Synthesis - 01 - Introduction
25 pages
Unit-Ii Es
No ratings yet
Unit-Ii Es
76 pages
Custom Single-Purpose Processors
63% (8)
Custom Single-Purpose Processors
54 pages
Lect 05 PDF
No ratings yet
Lect 05 PDF
29 pages
System Design Part
No ratings yet
System Design Part
53 pages
Lecture 19
No ratings yet
Lecture 19
37 pages
Digital Electonics DONE
No ratings yet
Digital Electonics DONE
19 pages
Animated Slides For Implimentation
No ratings yet
Animated Slides For Implimentation
15 pages
Analog Electronics Practice Test Report
No ratings yet
Analog Electronics Practice Test Report
19 pages
Primitive Unit Cell of Diamond Structure
100% (1)
Primitive Unit Cell of Diamond Structure
4 pages
ECE606 f12 hw3 PDF
No ratings yet
ECE606 f12 hw3 PDF
3 pages
Made Easy Online Test Series
No ratings yet
Made Easy Online Test Series
11 pages
CHAPTER 2: Physics-Based Derivation of The I-V Model
No ratings yet
CHAPTER 2: Physics-Based Derivation of The I-V Model
36 pages
Low-Noise Amplifier
100% (1)
Low-Noise Amplifier
115 pages
Overview of Embedded Systems Design
No ratings yet
Overview of Embedded Systems Design
67 pages
Ch05 2
No ratings yet
Ch05 2
43 pages
VHDL Code for Logic Gates & Adders
No ratings yet
VHDL Code for Logic Gates & Adders
27 pages
Rajat Sapra - Student Profile Overview
No ratings yet
Rajat Sapra - Student Profile Overview
2 pages
Project List: Software Development & Education Center
No ratings yet
Project List: Software Development & Education Center
16 pages
Pointy Hat - Patron Dragon - Warlock Dragon Stat Block
No ratings yet
Pointy Hat - Patron Dragon - Warlock Dragon Stat Block
10 pages
Bipolar Worksheet - 19 - Problem Solving Sheet
No ratings yet
Bipolar Worksheet - 19 - Problem Solving Sheet
2 pages
Journalizing
No ratings yet
Journalizing
5 pages
Confinity Contech Privata Limited-Asti47625-26-10 Jul 25
No ratings yet
Confinity Contech Privata Limited-Asti47625-26-10 Jul 25
1 page
(SURNAME) - A1CO2 - Audit of PPE - Masipag Company
No ratings yet
(SURNAME) - A1CO2 - Audit of PPE - Masipag Company
1 page
Planar4 62100 Data Sheet
No ratings yet
Planar4 62100 Data Sheet
24 pages
Q4 22 - EarningsRelease
No ratings yet
Q4 22 - EarningsRelease
17 pages
ATTACHMENT - REPORT For A. Siziba
100% (2)
ATTACHMENT - REPORT For A. Siziba
84 pages
Product Cataloge PDF
No ratings yet
Product Cataloge PDF
20 pages
Chapter 7
100% (1)
Chapter 7
7 pages
Nursery Sample Paper
100% (24)
Nursery Sample Paper
7 pages
A General Theory of Artistic Legitimation How Art Worlds Are Like Social Movements
No ratings yet
A General Theory of Artistic Legitimation How Art Worlds Are Like Social Movements
19 pages
The Scalar Tensor Theory of Gravitation 1st Edition Yasunori Fujii Instant Access 2025
No ratings yet
The Scalar Tensor Theory of Gravitation 1st Edition Yasunori Fujii Instant Access 2025
119 pages
Scenariostuck 6 - Sliced Scenematic
No ratings yet
Scenariostuck 6 - Sliced Scenematic
3 pages
Prime
100% (5)
Prime
116 pages
Fluidized Bed Engineering Study
No ratings yet
Fluidized Bed Engineering Study
29 pages
AFMS Overview of Army Structure and Capabilities 2012
No ratings yet
AFMS Overview of Army Structure and Capabilities 2012
39 pages
April 2020 Journal Entries Overview
50% (2)
April 2020 Journal Entries Overview
14 pages
GSI Data For Rock Mass Classification
No ratings yet
GSI Data For Rock Mass Classification
18 pages
Valmont - Galvanizing Information
No ratings yet
Valmont - Galvanizing Information
107 pages
2023-2024 Mass, Weight and Density - PPTX Updated
No ratings yet
2023-2024 Mass, Weight and Density - PPTX Updated
40 pages
Wilhelmy Plate - Wikipedia
No ratings yet
Wilhelmy Plate - Wikipedia
2 pages
TAFJ Component Deployment Guide
No ratings yet
TAFJ Component Deployment Guide
14 pages
Lighting Solutions Quotation
No ratings yet
Lighting Solutions Quotation
3 pages
BT If
No ratings yet
BT If
9 pages
Bank Reconciliation Process Explained
No ratings yet
Bank Reconciliation Process Explained
6 pages
Grade 3 Term 2 Music Schemes
No ratings yet
Grade 3 Term 2 Music Schemes
4 pages
Quantum-Touch Supercharging Workshop
50% (2)
Quantum-Touch Supercharging Workshop
8 pages
Advanced Calculus and Complex Analysis Sem
No ratings yet
Advanced Calculus and Complex Analysis Sem
4 pages
Testbank For Principles of Physics International Edition 10th Edition
No ratings yet
Testbank For Principles of Physics International Edition 10th Edition
18 pages

Custom Single Purpose Processor Design

Uploaded by

Custom Single Purpose Processor Design

Uploaded by

CUSTOM SINGLE PURPOSE PROCESSOR DESIGN

General Vs Single purpose processors

Due to fewer clock cycles Shorter clock cycle

Less power consumption

Combinational logic design

D) Minimized output equations y bc a 00 01 11 10 0 0 0 1 0 1 1 z a bc 1 1 1

RT level Combinational components

n-bit Comparato r less equa greate l r

n bit, m function S0 ALU S(log n m) O

O= I0 if S=0..00 I1 if S=0..01 I(m-1) if S=1..11

O0 =1 if I=0..00 O1 =1 if I=0..01 O(n-1) =1 if I=1..11

sum = A+B (first n bits) carry = (n+1)th bit of A+B

less = 1 if A<B equal =1 if A=B greater=1 if A>B

With enable input e all Os are 0 if e=0

With carry-in input Ci sum = A + B + Ci

May have status outputs carry, zero, etc.

RT level Sequential components

Sequential logic design

Given this implementation model

Sequential logic design quickly reduces to combinational logic design

Sequential logic design (cont.)

a I1 = Q1Q0a + Q1a + Q1Q0 x

Custom single-purpose processor basic model

external control outputs

datapath control outputs

external data outputs

controller and datapath a view inside the controller and datapath

Example: greatest common divisor

(a) black-box view go_i x_i GCD

(c) state diagram

5-J: 9: 1-J: d_o = x

State diagram templates

Creating the datapath

!1 1: 1 2: !go_i !(!go_i) x_i y_i

2-J: x_sel 3: x = x_i y_sel n-bit 2x1 n-bit 2x1

Based on reads and writes Use multiplexors for multiple sources

Creating the controllers FSM

!= 5: x!=y x_neq_ y x_lt_y d_ld

Splitting into a controller and datapath

Controller implementation model

!= x_neq_y=0 5: x!=y x_neq_ y x_lt_y d_ld

Q3 Q2 Q1 Q0 0110 6: State register I3 I2 I1 I0 x_lt_y=1 7: y_sel = 1 y_ld = 1 0111

x_neq_y= 1 x_lt_y= 0 =1 x_sel 8: x_ld = 1 1000

Controller state table for the GCD example

Design Custom single purpose processor for

RT-level custom single-purpose processor design

We often start with a state machine

Rather than algorithm Cycle timing often too central to functionality

rdy_in=1 RecFirst4Start data_lo=data_in rdy_in=0

Bus bridge that converts 4bit bus to 8-bit bus

RT-level custom single-purpose processor design (cont)

rdy_in=0 rdy_in=1 WaitSecond4 RecSecond4Star t data_hi_ld=1 Send8Start data_out_ld=1 rdy_out=1

rdy_ou t data_out data_out_ld data_hi_ld data_lo_ld

Optimizing single-purpose processors 18

Optimizing the original program

Analyze program attributes and look for areas of possible improvement

and space complexity

Optimizing the original program (cont)

Optimizing the FSMD

Areas of possible improvements

with constants on transitions can be eliminated, transition taken is already known

with independent operations can be merged

Optimizing the FSMD (cont.)

original FSMD eliminate state 1 transitions have constant values

optimized FSMD int x, y;

merge state 2 and state 2J no loop operation in between them

Optimizing the datapath

Sharing of functional units

mapping, as done previously, is not

Optimizing the FSM

be treated as an ordering problem

of merging equivalent states into a single state

equivalent if for all possible input combinations

You might also like