0% found this document useful (0 votes)

231 views34 pages

System Architecture

system architecture

Uploaded by

suman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

231 views34 pages

System Architecture

system architecture

Uploaded by

suman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Systems Architecture

Lecture 14: Floating Point Arithmetic

Jeremy R. Johnson
Anatole D. Ruslanov
William M. Mongan
Some or all figures from Computer Organization and Design: The
Hardware/Software Approach, Third Edition, by David Patterson and
John Hennessy, are copyrighted material (COPYRIGHT 2004
MORGAN KAUFMANN PUBLISHERS, INC. ALL RIGHTS RESERVED).
Lec 14

Systems Architecture

Introduction
Objective: To provide hardware support for floating point
arithmetic. To understand how to represent floating point
numbers in the computer and how to perform arithmetic with
them. Also to learn how to use floating point arithmetic in
MIPS.
Approximate arithmetic
Finite Range
Limited Precision

Topics
IEEE format for single and double precision floating point numbers
Floating point addition and multiplication
Support for floating point computation in MIPS
Lec 14

Systems Architecture

Distribution of Floating Point Numbers

3 bit mantissa
exponent {-1,0,1}

0
Lec 14

e = -1
1.00 X 2^(-1) =
1.01 X 2^(-1) =
1.10 X 2^(-1) =
1.11 X 2^(-1) =

1/2
5/8
3/4
7/8

e=0
1.00 X 2^0 =
1.01 X 2^0 =
1.10 X 2^0 =
1.11 X 2^0 =

2
Systems Architecture

1
5/4
3/2
7/4

e=1
1.00 X 2^1 = 2
1.01 X 2^1 = 5/2
1.10 X 2^1= 3
1.11 X 2^1 = 7/2

3
3

Floating Point
An IEEE floating point representation consists of
A Sign Bit (no surprise)
An Exponent (times 2 to the what?)
Mantissa (Significand), which is assumed to be 1.xxxxx (thus, one
bit of the mantissa is implied as 1)
This is called a normalized representation

So a mantissa = 0 really is interpreted to be 1.0, and a

mantissa of all 1111 is interpreted to be 1.1111
Special cases are used to represent denormalized
mantissas (true mantissa = 0), NaN, etc., as will be
discussed.

Lec 14

Systems Architecture

Floating Point Standard

Defined by IEEE Std 754-1985
Developed in response to divergence of representations
Portability issues for scientific code

Now almost universally adopted

Two representations
Single precision (32-bit)
Double precision (64-bit)

Lec 14

Systems Architecture

IEEE Floating-Point Format

single: 8 bits
double: 11 bits

S Exponent

single: 23 bits
double: 52 bits

Fraction

x ( 1)S (1 Fraction) 2(Exponent Bias)

S: sign bit (0 non-negative, 1 negative)
Normalize significand: 1.0 |significand| < 2.0
Always has a leading pre-binary-point 1 bit, so no need to
represent it explicitly (hidden bit)
Significand is Fraction with the 1. restored
Exponent: excess representation: actual exponent + Bias
Ensures exponent is unsigned
Single: Bias = 127; Double: Bias = 1203
Lec 14

Systems Architecture

Single-Precision Range
Exponents 00000000 and 11111111 reserved
Smallest value
Exponent: 00000001
actual exponent = 1 127 = 126
Fraction: 00000 significand = 1.0
1.0 2126 1.2 1038

Largest value
exponent: 11111110
actual exponent = 254 127 = +127
Fraction: 11111 significand 2.0
2.0 2+127 3.4 10+38
Lec 14

Systems Architecture

Double-Precision Range
Exponents 000000 and 111111 reserved
Smallest value
Exponent: 00000000001
actual exponent = 1 1023 = 1022
Fraction: 00000 significand = 1.0
1.0 21022 2.2 10308

Largest value
Exponent: 11111111110
actual exponent = 2046 1023 = +1023
Fraction: 11111 significand 2.0
2.0 2+1023 1.8 10+308
Lec 14

Systems Architecture

Representation of Floating Point

Numbers
IEEE 754 single precision
31 30

Sign

23 22

Biased exponent

Normalized Mantissa (implicit 24th bit = 1)

(-1)s F 2E-127

Lec 14

Exponent Mantissa Object Represented

0
0
0
0
non-zero
denormalized
1-254
anything
FP number
255
0
pm infinity
255
non-zero
NaN

Systems Architecture

Why biased exponent?

For faster comparisons (for sorting, etc.), allow integer
comparisons of floating point numbers:
Unbiased exponent:

1/2 0 1111 1111 000 0000 0000 0000 0000 0000

2 0 0000 0001 000 0000 0000 0000 0000 0000
Biased exponent:

1/2 0 0111 1110 000 0000 0000 0000 0000 0000

2 0 1000 0000 000 0000 0000 0000 0000 0000
Lec 14

Systems Architecture

Basic Technique
Represent the decimal in the form +/- 1.xxxb x 2y
And fill in the fields
Remember biased exponent and implicit 1. mantissa!

Examples:

Lec 14

0.0: 0 00000000 00000000000000000000000

1.0 (1.0 x 2^0): 0 01111111 00000000000000000000000
0.5 (0.1 binary = 1.0 x 2^-1): 0 01111110 00000000000000000000000
0.75 (0.11 binary = 1.1 x 2^-1): 0 01111110 10000000000000000000000
3.0 (11 binary = 1.1*2^1): 0 10000000 10000000000000000000000
-0.375 (-0.011 binary = -1.1*2^-2): 1 01111101 10000000000000000000000
1 10000011 01000000000000000000000 = - 1.01 * 2^4 = -20.0

Systems Architecture
http://www.math-cs.gordon.edu/courses/cs311/lectures-2003/binary.html
Copyright 2003 - Russell C. Bjork

Basic Technique
One can compute the mantissa just similar to the way one would
convert decimal whole numbers to binary.
Take the decimal and repeatedly multiply the fractional
component by 2. The whole number portion is the next binary
bit.
For whole numbers, append the binary whole number to the
mantissa and shift the exponent until the mantissa is in
normalized form.

Lec 14

Systems Architecture
http://www.newton.dep.anl.gov/newton/askasci/1995/math/MATH065.HTM

Floating-Point Example
Represent 0.75
0.75 = (1)1 1.12 21
S=1
Fraction = 1000002
Exponent = 1 + Bias
Single: 1 + 127 = 126 = 011111102
Double: 1 + 1023 = 1022 = 011111111102

Single: 101111110100000
Double: 101111111110100000

Lec 14

Systems Architecture

Floating-Point Example
What number is represented by the single-precision float
1100000010100000
S=1
Fraction = 01000002
Fxponent = 100000012 = 129

x = (1)1 (1 + 012) 2(129 127)

= (1) 1.25 22
= 5.0

Lec 14

Systems Architecture

Representation of Floating Point

Numbers
IEEE 754 double precision
31 30

Sign

20 19

Biased exponent

Normalized Mantissa (implicit 53rd bit)

(-1)s F 2E-1023
Lec 14

Exponent Mantissa Object Represented

0
0
0
0
non-zero
denormalized
1-2046
anything
FP number
2047
0
pm infinity
2047
non-zero
NaN

Systems Architecture

Floating Point Arithmetic

fl(x) = nearest floating point number to x
Relative error (precision = s digits)
|x - fl(x)|/|x| 1/2 1-s for = 2, 2-s

Arithmetic
x y = fl(x+y) = (x + y)(1 + )
x y = fl(x y)(1 + )

for < u
for < u

ULPUnit in the Last Place is the smallest possible increment

or decrement that can be made using the machine's FP
arithmetic.
Lec 14

Systems Architecture

Floating-Point Precision
Relative precision
all fraction bits are significant
Single: approx 223
Equivalent to 23 log102 23 0.3 6 decimal digits of precision

Double: approx 252

Equivalent to 52 log102 52 0.3 16 decimal digits of precision

Lec 14

Systems Architecture

Is FP addition associative?
Associativity law for addition: a + (b + c) = (a + b) + c
Let a = 2.7 x 1023, b = 2.7 x 1023, and c = 1.0
a + (b + c) = 2.7 x 1023 + ( 2.7 x 1023 + 1.0 ) = 2.7 x 1023 + 2.7
x 1023 = 0.0
(a + b) + c = ( 2.7 x 1023 + 2.7 x 1023 ) + 1.0 = 0.0 + 1.0 = 1.0
Beware Floating Point addition not associative!
The result is approximate
Why the smaller number disappeared?
Lec 14

Systems Architecture

Floating point addition

Start

Sign Exponent

Fraction

Sign Exponent

Fraction
1. Compare the exponents of the two numbers.
Shift the smaller number to the right until its
exponent would match the larger exponent

Small ALU
2. Add the significands

Exponent
difference
0

3. Normalize the sum, either shifting right and

incrementing the exponent or shifting left
and decrementing the exponent

Shift right

Control

Overflow or
underflow?

Big ALU

Yes

Increment or
decrement

Exception

4. Round the significand to the appropriate

number of bits

Shift left or right

Rounding hardware

Still normalized?

Yes

Sign Exponent

Fraction
Done

Lec 14

Systems Architecture

Floating-Point Addition
Consider a 4-digit decimal example
9.999 101 + 1.610 101

1. Align decimal points

Shift number with smaller exponent
9.999 101 + 0.016 101

2. Add significands
9.999 101 + 0.016 101 = 10.015 101

3. Normalize result & check for over/underflow

1.0015 102

4. Round and renormalize if necessary

1.002 102
Lec 14

Systems Architecture

Floating-Point Addition
Now consider a 4-digit binary example
1.0002 21 + 1.1102 22 (0.5 + 0.4375)

1. Align binary points

Shift number with smaller exponent
1.0002 21 + 0.1112 21

2. Add significands
1.0002 21 + 0.1112 21 = 0.0012 21

3. Normalize result & check for over/underflow

1.0002 24, with no over/underflow

4. Round and renormalize if necessary

1.0002 24 (no change) = 0.0625
Lec 14

Systems Architecture

FP Adder Hardware
Much more complex than integer adder
Doing it in one clock cycle would take too long
Much longer than integer operations
Slower clock would penalize all instructions

FP adder usually takes several cycles

Can be pipelined

Lec 14

Systems Architecture

FP Adder Hardware

Step 1

Step 2

Step 3
Step 4
Lec 14

Systems Architecture

Floating Point Multiplication Algorithm

Lec 14

Systems Architecture

FP Arithmetic Hardware
FP multiplier is of similar complexity to FP adder
But uses a multiplier for significands instead of an adder

FP arithmetic hardware usually does

Addition, subtraction, multiplication, division, reciprocal, square-root
FP integer conversion

Operations usually takes several cycles

Can be pipelined

Lec 14

Systems Architecture

FP Instructions in MIPS
FP hardware is coprocessor 1
Adjunct processor that extends the ISA

Separate FP registers
32 single-precision: $f0, $f1, $f31
Paired for double-precision: $f0/$f1, $f2/$f3,
Release 2 of MIPs ISA supports 32 64-bit FP regs

FP instructions operate only on FP registers

Programs generally dont do integer ops on FP
data, or vice versa
More registers with minimal code-size impact

FP load and store instructions

lwc1, ldc1, swc1, sdc1
e.g., ldc1 $f8, 32($sp)
Lec 14

Systems Architecture

FP Instructions in MIPS
Single-precision arithmetic
add.s, sub.s, mul.s, div.s
e.g., add.s $f0, $f1, $f6

Double-precision arithmetic
add.d, sub.d, mul.d, div.d
e.g., mul.d $f4, $f4, $f6

Single- and double-precision comparison

c.xx.s, c.xx.d (xx is eq, lt, le, )
Sets or clears FP condition-code bit
e.g. c.lt.s $f3, $f4

Branch on FP condition code true or false

bc1t, bc1f
e.g., bc1t TargetLabel
Lec 14

Systems Architecture

FP Example: F to C
C code:
float f2c (float fahr) {
return ((5.0/9.0)*(fahr - 32.0));
}
fahr in $f12, result in $f0, literals in global memory
space

Compiled MIPS code:

f2c: lwc1
lwc2
div.s
lwc1
sub.s
mul.s
jr
Lec 14

$f16,
$f18,
$f16,
$f18,
$f18,
$f0,
$ra

const5($gp)
const9($gp)
$f16, $f18
const32($gp)
$f12, $f18
$f16, $f18

Systems Architecture

Rounding
Guard and round digits and sticky bit
When computing result, assume there are several extra digits available
for shifting and computation. This improves accuracy of computation.
Guard digit: first extra digit/bit to the right of mantissa -- used for
rounding addition results
Round digit: second extra digit/bit to the right of mantissa -- used for
rounding multiplication results
Sticky bit: third extra digit/bit to the right of mantissa used for
resolving ties such as 0.50...00 vs. 0.50...01

Lec 14

Systems Architecture

Rounding
examples
An example without guard and round digits
Add 9.76 x 1025 and 2.59 x 1024 assuming 3 digit mantissa
Shift mantissa of the smaller number to the right: 0.25 x 10 25
Add mantissas: 10.01x 1025
Check and normalize mantissa if necessary: 1.00x 10 26

An example with guard and round digits

Add 9.76 x 1025 and 2.59 x 1024 assuming 3 digit mantissa

Lec 14

Internal registers have extra two digits: 9.7600 x 10 25 and 2.5900 x 1024
Shift mantissa of the smaller number to the right: 0.2590 x 10 25
Add mantissas: 10.0190 x 1025
Check and normalize mantissa if necessary: 1.0019 x 10 26
Round the result: 1.00 x 1026

Systems Architecture

Rounding
examples
An example without guard and round digits
Add 9.78 x 1025 and 8.79 x 1024 assuming 3 digit mantissa
Shift mantissa of the smaller number to the right: 0.87 x 10 25
Add mantissas: 10.65 x 1025
Normalize mantissa if necessary: 1.06 x 10 26

An example with guard and round digits

Add 9.78 x 1025 and 8.79 x 1024 assuming 3 digit mantissa

Lec 14

Internal registers have extra two digits: 9.7800 x 10 25 and 8.7900 x 1024
Shift mantissa of the smaller number to the right: 0.8790 x 10 25
Add mantissas (note extra digit on the left): 10.6590 x 10 25
Check and normalize mantissa if necessary: 1.0659 x 10 26
Round the result: 1.07 x 1026

Systems Architecture

IEEE Rounding
Modes
Round toward Infinity: always
round toward the smaller number

1.
2. Round toward + Infinity: always round toward the larger number
3. Round to Zero: always round toward the smallest absolute (truncate)
4. Round toward Nearest Even: always round so that least significant bit
(lsb) is zero
1.40
1.60
1.50
2.50
1.50

Zero 1.00
2.00
1.00

1.00
2.00
1.00

1.00
2.00
2.00
Nearest Even (default)
1.00
2.00
2.00

2.00
2.00
3.00

1.00
2.00
1.00

2.00

When rounding a binary fraction, the least significant digit of rounded

result will be either 1 or 0. Nearest even mode always rounds the number
so that the lsb is 0. Hence, the name. (If we omit the binary point, the
rounded number would be even.)
It can be shown that if we assume uniform distribution of digits, rounding
to nearest mode tends to have mean error = 0.

Lec 14

Systems Architecture

FP Instructions in
MIPS
Floating point operations are
slower than integer operations

Data is rarely converted from integers to float within the same

procedure

1980s solution place FP processing unit in a separate chip

Todays solution imbed FP processing unit in processor chip

Co-processor 1 features:

Contains 32 single precision floating point registers: $f0, $f1, $f31

These registers can also act as 16 double precision registers:

$f0/$f1, $f2/$f3, , $f30/$f31 (only the first one is specified in the instructions)

Uses special floating point instructions, which are similar (in format) to integer
instructions but have .s or .d attached to signify that they work on fp numbers

Several special instructions to move between regular registers and the coprocessor registers

Lec 14

Systems Architecture

FP Instructions in
MIPS
lwc1 / swc1 load/store word coprocessor 1

Move instructions (between processors)

mfc1 rt, rd

Move floating point register rd to CPU register rt

mtc1 rd, rt

Move CPU register rt to floating point register rd

mfc1.d rdest, frsrc1

Move frsrc1 & frsrc1 + 1 to regs rdest & rdest +
1
Single and double precision arithmetic instructions
Single add.s, sub.s, mul.s, div.s, c.lt.s
Double add.d, sub.d, mul.d, div.d, c.lt.d

Examples:

Lec 14

add.s $f0, $f1, $f2

Systems Architecture

sub.d $f0, $f2, $f4

Module 2 - Part 2
No ratings yet
Module 2 - Part 2
24 pages
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
No ratings yet
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
21 pages
IEEE Floating Point Representation Explained
No ratings yet
IEEE Floating Point Representation Explained
31 pages
Floating-Point Representation in Computing
No ratings yet
Floating-Point Representation in Computing
6 pages
Digital Circuit Design Basics
No ratings yet
Digital Circuit Design Basics
28 pages
VHDL Floating-Point Design Lab
100% (1)
VHDL Floating-Point Design Lab
10 pages
FPGA-Based 64-Bit Floating Point Adder
No ratings yet
FPGA-Based 64-Bit Floating Point Adder
11 pages
IEEE Floating Point Representation
No ratings yet
IEEE Floating Point Representation
8 pages
Understanding Floating Point Representation
No ratings yet
Understanding Floating Point Representation
49 pages
Understanding IEEE 754 Floating Point
No ratings yet
Understanding IEEE 754 Floating Point
34 pages
SOC 2040 System Programming Slides
No ratings yet
SOC 2040 System Programming Slides
46 pages
Floating Point Arithmetic Guide
No ratings yet
Floating Point Arithmetic Guide
42 pages
IEEE 754 Floating Point Overview
No ratings yet
IEEE 754 Floating Point Overview
9 pages
Floating Point 6up
No ratings yet
Floating Point 6up
7 pages
IEEE 754 Floating Point Guide
No ratings yet
IEEE 754 Floating Point Guide
38 pages
IEEE 754 Floating Point Representation
No ratings yet
IEEE 754 Floating Point Representation
6 pages
Floating Point & Fixed Point Representation - BCA II
No ratings yet
Floating Point & Fixed Point Representation - BCA II
24 pages
Floating Point Numbers Guide
No ratings yet
Floating Point Numbers Guide
7 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
27 pages
MIPS Computer Arithmetic Overview
No ratings yet
MIPS Computer Arithmetic Overview
55 pages
Floating Point
No ratings yet
Floating Point
26 pages
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
No ratings yet
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
32 pages
IEEE 754: Floating Point Guide
No ratings yet
IEEE 754: Floating Point Guide
10 pages
Floating Point Numbers 237045407 237045407
No ratings yet
Floating Point Numbers 237045407 237045407
20 pages
Computer Arithmetic Basics
No ratings yet
Computer Arithmetic Basics
18 pages
Floating Point Representation
No ratings yet
Floating Point Representation
3 pages
32-Bit Floating Point ALU Design
No ratings yet
32-Bit Floating Point ALU Design
4 pages
Understanding IEEE 754 Number Formats
No ratings yet
Understanding IEEE 754 Number Formats
42 pages
4.4 - 1 New Floating Point
No ratings yet
4.4 - 1 New Floating Point
22 pages
MIPS Floating Point Architecture Overview
No ratings yet
MIPS Floating Point Architecture Overview
34 pages
Module2.1 of Nothing
No ratings yet
Module2.1 of Nothing
7 pages
LECTURE NOTE Fixed and Floating Point Representation
No ratings yet
LECTURE NOTE Fixed and Floating Point Representation
3 pages
Floating-Point Arithmetic Operations (Aligning The Mantissas - Biased Exponent - Overflow)
No ratings yet
Floating-Point Arithmetic Operations (Aligning The Mantissas - Biased Exponent - Overflow)
18 pages
IEEE 754 Floating Point Representation Guide
No ratings yet
IEEE 754 Floating Point Representation Guide
31 pages
Real Number Representation and Floating Point Arithmetic
No ratings yet
Real Number Representation and Floating Point Arithmetic
12 pages
Floating Point Representation
No ratings yet
Floating Point Representation
3 pages
IEEE 754 Floating Point Guide
No ratings yet
IEEE 754 Floating Point Guide
26 pages
DSP Arithmetic
No ratings yet
DSP Arithmetic
33 pages
IEEE 754 Floating Point Guide
No ratings yet
IEEE 754 Floating Point Guide
28 pages
IEEE 754 Floating Point Arithmetic Guide
No ratings yet
IEEE 754 Floating Point Arithmetic Guide
16 pages
Understanding Floating Point Representation
No ratings yet
Understanding Floating Point Representation
21 pages
Understanding Floating Point Numbers
No ratings yet
Understanding Floating Point Numbers
23 pages
Understanding Floating-Point Representation
No ratings yet
Understanding Floating-Point Representation
17 pages
Computer Architecture for Competitive Exams
100% (1)
Computer Architecture for Competitive Exams
108 pages
Floating Point
No ratings yet
Floating Point
33 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
7 pages
Part 5 Floating Point Add Sub Mul
No ratings yet
Part 5 Floating Point Add Sub Mul
20 pages
ENSC254 - Floating Point Computation
No ratings yet
ENSC254 - Floating Point Computation
29 pages
Optimizing Floating Point Units for Speed
No ratings yet
Optimizing Floating Point Units for Speed
12 pages
IEEE Floating-Point Representation Guide
No ratings yet
IEEE Floating-Point Representation Guide
26 pages
Floating-Point Representation Guide
No ratings yet
Floating-Point Representation Guide
14 pages
Floating-Point Representation Explained
No ratings yet
Floating-Point Representation Explained
9 pages
COA
No ratings yet
COA
14 pages
Week 5: IEEE Floating Point Revision Guide For Phase Test
No ratings yet
Week 5: IEEE Floating Point Revision Guide For Phase Test
23 pages
Computer Arithmetic and Number Representation
No ratings yet
Computer Arithmetic and Number Representation
24 pages
Understanding Floating-Point Representation
No ratings yet
Understanding Floating-Point Representation
16 pages
Data Representation in Computer Architecture
No ratings yet
Data Representation in Computer Architecture
64 pages
A Handbook On CSIT Engineering
21% (28)
A Handbook On CSIT Engineering
6 pages
Engineering Mathematics For Gate
0% (1)
Engineering Mathematics For Gate
9 pages
Graph Theory
No ratings yet
Graph Theory
17 pages
Web Technologies: HTTP, GET, POST, Cookies
No ratings yet
Web Technologies: HTTP, GET, POST, Cookies
35 pages
UPTU B.Tech Exam Schedule 2014
No ratings yet
UPTU B.Tech Exam Schedule 2014
67 pages
Form 4
No ratings yet
Form 4
2 pages
Understanding Dengue Fever Symptoms
No ratings yet
Understanding Dengue Fever Symptoms
2 pages
M.Tech Admissions 2012: Computational Science
No ratings yet
M.Tech Admissions 2012: Computational Science
3 pages
IEC College Campus Recruitment Bio-Data
No ratings yet
IEC College Campus Recruitment Bio-Data
3 pages
BDA Unit 5 Notes: Big Data Analytics (Anna University)
No ratings yet
BDA Unit 5 Notes: Big Data Analytics (Anna University)
20 pages
Michelangelo: Long Context Evaluations Beyond Haystacks Via Latent Structure Queries
No ratings yet
Michelangelo: Long Context Evaluations Beyond Haystacks Via Latent Structure Queries
37 pages
Data Representation Solutions for 2024-25
No ratings yet
Data Representation Solutions for 2024-25
5 pages
Data Structures & Algorithms Course
No ratings yet
Data Structures & Algorithms Course
31 pages
Cs25co2 Lab Record Mech
No ratings yet
Cs25co2 Lab Record Mech
100 pages
Intel x86 Instruction Set Overview
0% (1)
Intel x86 Instruction Set Overview
44 pages
Python Expression Evaluation Guide
No ratings yet
Python Expression Evaluation Guide
37 pages
Curse NG
No ratings yet
Curse NG
464 pages
Fronius Solar API V1 Operating Guide
No ratings yet
Fronius Solar API V1 Operating Guide
62 pages
Series PM135 Powermeters PM135P/PM135E/PM135EH: Modbus Communications Protocol
No ratings yet
Series PM135 Powermeters PM135P/PM135E/PM135EH: Modbus Communications Protocol
77 pages
FE Exam Preparation Book VOL1 LimitedDisclosureVer
87% (39)
FE Exam Preparation Book VOL1 LimitedDisclosureVer
448 pages
Python Grade Calculation Program
No ratings yet
Python Grade Calculation Program
5 pages
LabVIEW Program Components Explained
No ratings yet
LabVIEW Program Components Explained
531 pages
Java
No ratings yet
Java
129 pages
Time Complexity of Integer Cube Root
No ratings yet
Time Complexity of Integer Cube Root
12 pages
Qbasic Programming Without Stress
88% (8)
Qbasic Programming Without Stress
199 pages
Python Essentials 1
No ratings yet
Python Essentials 1
128 pages
MODCELL Modrtu 1
No ratings yet
MODCELL Modrtu 1
32 pages
Area-Efficient Iterative Logarithmic Approximate Multipliers For IEEE 754 and Posit Numbers
No ratings yet
Area-Efficient Iterative Logarithmic Approximate Multipliers For IEEE 754 and Posit Numbers
13 pages
Java Primitive Data Types Overview
No ratings yet
Java Primitive Data Types Overview
36 pages
PWP Practical No. 4
No ratings yet
PWP Practical No. 4
2 pages
C Programming: Input and Output Basics
No ratings yet
C Programming: Input and Output Basics
10 pages
Python Lexical Structure and Tokens
No ratings yet
Python Lexical Structure and Tokens
15 pages
Unit 1 and Unit 2 Introduction To Microprocessor and Architecture
No ratings yet
Unit 1 and Unit 2 Introduction To Microprocessor and Architecture
27 pages
Introduction to C Programming Basics
No ratings yet
Introduction to C Programming Basics
42 pages
24-bit Vedic Multiplier for FP Division
No ratings yet
24-bit Vedic Multiplier for FP Division
5 pages
Computer Architecture QBank
No ratings yet
Computer Architecture QBank
12 pages
Benjamin - Cummings.c.by - Dissection.1987.SCAN DARKCROWN
No ratings yet
Benjamin - Cummings.c.by - Dissection.1987.SCAN DARKCROWN
484 pages
C++ Cheat Sheet.
100% (1)
C++ Cheat Sheet.
14 pages
Intermediate Report - (Darshan J Ronad-027)
No ratings yet
Intermediate Report - (Darshan J Ronad-027)
21 pages

System Architecture

Uploaded by

System Architecture

Uploaded by

Systems Architecture

Lecture 14: Floating Point Arithmetic

Distribution of Floating Point Numbers

So a mantissa = 0 really is interpreted to be 1.0, and a

Floating Point Standard

Now almost universally adopted

IEEE Floating-Point Format

x ( 1)S (1 Fraction) 2(Exponent Bias)

Representation of Floating Point

Normalized Mantissa (implicit 24th bit = 1)

Exponent Mantissa Object Represented

Why biased exponent?

1/2 0 1111 1111 000 0000 0000 0000 0000 0000

1/2 0 0111 1110 000 0000 0000 0000 0000 0000

0.0: 0 00000000 00000000000000000000000

x = (1)1 (1 + 012) 2(129 127)

Representation of Floating Point

Normalized Mantissa (implicit 53rd bit)

Exponent Mantissa Object Represented

Floating Point Arithmetic

ULPUnit in the Last Place is the smallest possible increment

Double: approx 252

Floating point addition

3. Normalize the sum, either shifting right and

4. Round the significand to the appropriate

Shift left or right

1. Align decimal points

3. Normalize result & check for over/underflow

4. Round and renormalize if necessary

1. Align binary points

3. Normalize result & check for over/underflow

4. Round and renormalize if necessary

FP adder usually takes several cycles

Floating Point Multiplication Algorithm

FP arithmetic hardware usually does

Operations usually takes several cycles

FP instructions operate only on FP registers

FP load and store instructions

Single- and double-precision comparison

Branch on FP condition code true or false

Compiled MIPS code:

An example with guard and round digits

An example with guard and round digits

When rounding a binary fraction, the least significant digit of rounded

Data is rarely converted from integers to float within the same

1980s solution place FP processing unit in a separate chip

Todays solution imbed FP processing unit in processor chip

Contains 32 single precision floating point registers: $f0, $f1, $f31

These registers can also act as 16 double precision registers:

Move instructions (between processors)

Move floating point register rd to CPU register rt

Move CPU register rt to floating point register rd

mfc1.d rdest, frsrc1

add.s $f0, $f1, $f2

sub.d $f0, $f2, $f4

You might also like