0% found this document useful (0 votes)

38 views31 pages

ELEC2041 Microprocessors and Interfacing Lectures 21: Floating Point Number Representation - III

This document discusses floating point number representation in three sentences or less: The document covers floating point number representation including special numbers like NaN and denorms, IEEE rounding modes, floating point operations in ARM, and examples of matrix multiplication using floating point numbers. It also discusses issues with floating point numbers like lack of associativity in addition/subtraction and examples of rounding errors when converting between integer and floating point representations.

Uploaded by

Raj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views31 pages

ELEC2041 Microprocessors and Interfacing Lectures 21: Floating Point Number Representation - III

Uploaded by

Raj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 31

ELEC2041 Microprocessors and Interfacing Lectures 21: Floating Point Number Representation III http://webct.edtec.unsw.edu.

au/

April 2006 Saeid Nooshabadi [email protected]

ELEC2041 lec21-fp-III.1 Saeid Nooshabadi

Overview Special Floating Point Numbers: NaN, Denorms IEEE Rounding modes Floating Point fallacies, hacks Using floating point in C and ARM Multi Dimensional Array layouts

ELEC2041 lec21-fp-III.2

Saeid Nooshabadi

Review: ARM Fl. Pt. Architecture

Floating Point Data: approximate representation of very large or very small numbers in 32-bits or 64-bits IEEE 754 Floating Point Standard is most widely accepted attempt to standardize interpretation of such numbers New ARM registers(s0-s31), instruct.:
Single Precision (32 bits, 2x10-38 2x1038): fadds, fsubs, fmuls, fdivs Double Precision (64 bits , 2x10-3082x10308): faddd, fsubd, fmuld, fdivd fcmps, fcmpd,

Big Idea: Instructions determine meaning of data; nothing inherent inside the data
ELEC2041 lec21-fp-III.3 Saeid Nooshabadi

Review: Floating Point Representation Single Precision and Double Precision

31 30 23 22 S Exponent Significand 1 bit 8 bits 23 bits 31 30 20 19 S Exponent Significand 1 bit 11 bits 20 bits Significand (contd) 32 bits 0

(-1)S x (1+Significand) x 2(Exponent-Bias)

ELEC2041 lec21-fp-III.4 Saeid Nooshabadi

Example Meaning Comments fadds s0,s1,s2 s0=s1+s2 Fl. Pt. Add (single) faddd d0,d1,d2 d0=d1+d2 Fl. Pt. Add (double) fsubs s0,s1,s2 s0=s1 s2 Fl. Pt. Sub (single) fsubd d0,d1,d2 d0=d1 d2 Fl. Pt. Sub (double) fmuls s0,s1,s2 s0=s1 s2 Fl. Pt. Mul (single) fmuld d0,d1,d2 d0=d1 d2 Fl. Pt. Mul (double) fdivs s0,s1,s2 s0=s1 s2 Fl. Pt. Div (single) fdivd d0,d1,d2 d0=d1 d2 Fl. Pt. Div (double) fcmps s0,s1 FCPSR flags = s0 s1 Fl. Pt.Compare (single) fcmpd d0,d1 FCPSR flags = d0 d1 Fl. Pt.Compare (double)
Z = 1 if s0 = s1, (d0 = d1) N = 1 if s0 < s1, (d0 < d1) C = 1 if s0 = s1, (d0 = d1); s0 > s1, (d0 > d1), or unordered V = 1 if unordered Unordered? Next slide
Saeid Nooshabadi

New ARM arithmetic instructions

ELEC2041 lec21-fp-III.5

Special Numbers What have we defined so far? Precision)

ExponentSignificand Object 0 0 1-254 255 255 0 nonzero anything 0 nonzero 0 ??? +/- fl. pt. # +/- infinity ???

(Single

Professor Kahan had clever ideas; Waste not, want not

ELEC2041 lec21-fp-III.6 Saeid Nooshabadi

Representation for Not a Number What do I get if I calculate sqrt(4.0)or 0/0?

If infinity is not an error, these shouldnt be either. Called Not a Number (NaN) Exponent = 255, Significand nonzero Why is this useful? Hope NaNs help with debugging? They contaminate: op(NaN,X) = NaN OK if calculate but dont use it cmp s1, s2 produces unordered results if either is an NaN
ELEC2041 lec21-fp-III.7 Saeid Nooshabadi

Special Numbers (contd) What have we defined so far? (Single Precision)?

ExponentSignificand Object 0 0 1-254 255 255 0 nonzero anything 0 nonzero 0 ??? +/- fl. pt. # +/- infinity NaN

ELEC2041 lec21-fp-III.8

Saeid Nooshabadi

Representation for Denorms (#1/2) Problem: Theres a gap among representable FP numbers around 0
Significand = 0, Exp = 0 (2-127) 0 Smallest representable positive num:
- a = 1.0 2 * 2-126 = 2-126

Second smallest representable positive num:

- b = 1.0001 2 * 2-126 = 2-126 + 2-149

a - 0 = 2-126 b - a = 2-149 Gap! ELEC2041 lec21-fp-III.9

Gap!

0 a

+
Saeid Nooshabadi

Representation for Denorms (#2/2) Solution:

We still havent used Exponent = 0, Significand nonzero Denormalized number: no leading 1 Smallest representable pos num:
- a = 2-149

Second smallest representable pos num:

- b = 2-148 Meaning: (-1)S x (0 + Significand) x 2(126) Range: 2-149 X 2-126 2-149

ELEC2041 lec21-fp-III.10

+
Saeid Nooshabadi

Special Numbers What have we defined so far? Precision)

ExponentSignificand Object 0 0 1-254 255 255 0 nonzero anything 0 nonzero 0 Denorm +/- fl. pt. # +/- infinity NaN

(Single

Professor Kahan had clever ideas; Waste not, want not

ELEC2041 lec21-fp-III.11 Saeid Nooshabadi

Clever Idea and Hardware Implementation 0 nonzero Denorm

Very Clever Idea by Prof Kahan BUT such corner cases make the hardware design very complex Good idea but hard practice! Even software emulation not easy. In Ref. Ft. Pt. Emulator: A student mini project on: http://
dsl.ee.unsw.edu.au/unsw/projects/armvfp/README.html

25% - 30% of the code is to get the operations on denorms right In most hardware implementations denorms are flushed to zero, or implemented in software via exceptions
ELEC2041 lec21-fp-III.12 Saeid Nooshabadi

Rounding When we perform math on real numbers, we have to worry about rounding The actual hardware for Floating Point Representation carries two extra bits of precision, and then round to get the proper value Rounding also occurs when converting a double to a single precision value, or converting a floating point number to an integer

ELEC2041 lec21-fp-III.13

Saeid Nooshabadi

IEEE Rounding Modes Round towards +infinity

ALWAYS round up: 2.2001 2.3 -2.3001 -2.3

Round towards -infinity

ALWAYS round down: 1.9999 1.9, -1.9999 -2.0

Truncate
Just drop the last digits (round towards 0); 1.9999 1.9, -1.9999 -1.9

Round to (nearest) even

Normal rounding, almost
ELEC2041 lec21-fp-III.14 Saeid Nooshabadi

Round to Even Round like you learned in high school Except if the value is right on the borderline, in which case we round to the nearest EVEN number 2.55 2.6 3.45 3.4 Insures fairness on calculation
This way, half the time we round up on tie, the other half time we round down Ask statistics Prof.
ELEC2041 lec21-fp-III.15

This is the default rounding mode

Saeid Nooshabadi

Casting floats to ints and vice versa in C

(int) exp
In C float to int type casting coerces and converts it to the an integer by truncation (rounds to towards 0) affected by rounding modes i = (int) (3.14159 * f); fuitos (floating int) In ARM to round to a selected mode (default nearest) fuitozs (floating int) In ARM to round towards zero

(float) exp
converts integer to nearest floating point f = f + (float) i; fsitos (int floating) In ARM
ELEC2041 lec21-fp-III.16 Saeid Nooshabadi

int float int if (i == (int)((float) i)) { printf(true); } Will not always work Large values of integers dont have exact floating point representations Similarly, we may round to the wrong value

ELEC2041 lec21-fp-III.17

Saeid Nooshabadi

float int float if (f == (float)((int) f)) { printf(true); } Will not always work Small values of floating point dont good integer representations Also rounding errors have

ELEC2041 lec21-fp-III.18

Saeid Nooshabadi

Ints, Fractions and rounding in C What do you get?

{ int x = 3/2; int y = 2/3; printf(x: %d, y: %d, x, y); }

How about? ( - 32) * 5 / 9;) int cela = (fahr

int celb = (5 / 9) * (fahr - 32) float celc = (5.0 / 9.0) * (fahr - 32);
fahr = 60 => cela: 15, celb: 0, celc: 15.55556
ELEC2041 lec21-fp-III.19 Saeid Nooshabadi

Floating Point Fallacy FP Add, subtract associative: FALSE!

Z X +(Y + z) = (X + y) +

x = 1.5 x 1038, y = 1.5 x 1038, and z = 1.0 x + (y + z) = 1.5x1038 + (1.5x1038 + 1.0) = 1.5x1038 + (1.5x1038) = 0.0 (x + y) + z = (1.5x1038 + 1.5x1038) + 1.0 = (0.0) + 1.0 = 1.0

Therefore, Floating Point add, subtract are not associative!

Why? FP result approximates real result! In this example: 1.5 x 1038 is so much larger than 1.0 that 1.5 x 1038 + 1.0 in floating point representation is still 1.5 x 1038
ELEC2041 lec21-fp-III.20 Saeid Nooshabadi

Floating Point In the News!

July 1994: Intel discovers bug in Pentium
Occasionally affects bits 12-52 of D.P. divide The bug was introduces when they optimsed divide unit to run much faster. They ignored some rare corner cases

Sept: Math Prof. discovers, puts on WWW Nov: Front page trade paper, then NY Times
Intel: several dozen people that this would affect. So far, we've only heard from one. Intel claims customers see 1 error/27000 years for random set of Ft. Pt. Inputs. Does not explain why anybody wants to use Ff. Pt. No. in random IBM claims 1 error/month, stops shipping

Dec: Intel apologizes, replace chips $300M

ELEC2041 lec21-fp-III.21 Saeid Nooshabadi

IEEE 754 Floating Point Issues It is complex, involves lots of details

We just scratched the surface

Check for gradual underflow and treating denomrs makes it much harder Beyond Prof. Kahan very few really understand it! It was finally approved as IEEE 754 after 10 years of controversy in 1983
Denorm was the most controversial aspect The visitors to the US were advised of 3 most interesting places to visit: Las Vegas, Great Canyon and IEEE committee rooms!
ELEC2041 lec21-fp-III.22 Saeid Nooshabadi

Reading Material
ARM Architecture Reference Manual 2nd Ed, AddisonWesley, 2001, ISBN: 0-201-73719-1, Part C, Vector Floating Point Architecture, chapters C1 C5

Ft. Pt. Emulator: A student mini project on: http://dsl.

ee.unsw.edu.au/unsw/projects/armvfp/README.html

Steve Furber: ARM System On-Chip; 2nd Ed, AddisonWesley, 2000, ISBN: 0-201-67519-6. chapter 6 (NOT up to date)

ELEC2041 lec21-fp-III.23

Saeid Nooshabadi

Example: Matrix with Fl Pt, Multiply, Add? j

j i
Row 32

Col 32

j i

ELEC2041 lec21-fp-III.24

Saeid Nooshabadi

Example: Matrix with Fl Pt, Multiply, Add in C void mm(double x[][32],double y[] [32], double z[][32]){ int i, j, k; for (i=0; i<32; i=i+1) for (j=0; j<32; j=j+1) for (k=0; k<32; k=k+1) x[i][j] = x[i][j] + y[i][k] * z[k][j]; Why pass in # of cols? } Starting addresses are parameters in a1, a2, and a3. Integer variables are in v2, v3, v4. Arrays 32 x 32
ELEC2041 lec21-fp-III.25

Use fldd/fstd (load/store 64 bits)

Saeid Nooshabadi

Multidimensional Array Addressing C stores multidimensional arrays in rowmajor order

elements of a row are consecutive in memory (Next element in row) FORTRAN uses column-major order (Next element in col) What is the address of A[x][y]? (x = row # & 36 y = col #)

Address 0 A0,0 A0,1 A0,2 A0,3 A1,0 A1,1 A1,2 A1,3 A2,0 A2,1 A2,2 A2,3 Address
Saeid Nooshabadi

Why pass in # of cols?

float A[3][4] col Base Address A0,0 A0,1 A0,2 A0,3 A1,0 A1,1 A1,2 A1,3 row A2,0 A2,1 A2,2 A2,3

A2,1 = (2 x 4 + 1) x 4 = 36

ELEC2041 lec21-fp-III.26

ARM code for first piece: initilialize, x[][] Initailize Loop Variables
mm: ... mov v1, mov L1: mov L2: mov
stmfd sp!, {v1-v4}

#32 v2, v3, v4,

; v1 = 32 #0 ; i = 0; 1st loop #0 ; j = 0; reset 2nd #0 ; k = 0; reset 3rd add

To fetch x[i][j], skip i rows (i*32), add j

a4,v3,v2, lsl #5 ;a4 = i*25+j

Get byte address (8 bytes), load x[i][j]

add a4,a1,a4, lsl #3;a4 = a1 +a4*8 ;(i,j byte addr.) fldd d0, [a4] ; d0 = x[i][j]
ELEC2041 lec21-fp-III.27 Saeid Nooshabadi

ARM code for second piece: z[][], y[][] Like before, but load y[i][k] into d1
L3: add ip,v4,v2, lsl #5 ;ip = i*25+k add ip,a2,ip, lsl #3 ;ip = a2 +ip*8 ;(i,k byte addr.) fldd d1, [ip] ; d1 = y[i][k] add ip,v3,v4, lsl #5 ;ip = k*25+j add ip,a3,ip, lsl #3 ;ip = a3 +ip*8 ;(k,j byte addr.) fldd d2, [ip] ; d2 = z[k][j]

Like before, but load z[k][j] into d2

Summary: d0:x[i][j], d1:y[i][k], d2:z[k][j]

ELEC2041 lec21-fp-III.28 Saeid Nooshabadi

ARM code for last piece: add/mul, loops Add y*z to x

fmacd d0,d1,d2

; x[][] = x + y*z ; k = k + 1 ; if(k<32) goto L3 ; x[i][j] = d0 ; j = j + 1 ; if(j<32) goto L2

Increment k; if end of inner loop, store x

add v4,v4,#1 cmp v4,v1 blt L3 fstd d0,[a4] add v3,v3,#1 cmp v3,v1 blt L2

Increment j; middle loop if not end of j

Increment i; if end of outer loop, return

add v2,v2,#1 ; i = i + 1 cmp v2,v1 ; if(i<32) goto L1 blt L1

Saeid Nooshabadi

ELEC2041 lec21-fp-III.29

ARM code for Return Return

ldmfd sp!, {v1-v4} mov pc, lr

ELEC2041 lec21-fp-III.30

Saeid Nooshabadi

And in Conclusion.. Exponent = 255, Significand nonzero Represents NaN Finite precision means we have to cope with round off error (arithmetic with inexact values) and truncation error (large values overwhelming small ones). In NaN representation of Ft. Pt. Exponent = 255 and Significand 0 In Denorm representation of Ft. Pt. Exponent = 0 and Significand 0 In Denorm representation of Ft. Pt. numbers there no hidden 1.
ELEC2041 lec21-fp-III.31 Saeid Nooshabadi

Floating Point Representation Part IV With Anno
No ratings yet
Floating Point Representation Part IV With Anno
101 pages
8255
No ratings yet
8255
52 pages
08-Serial Main
No ratings yet
08-Serial Main
63 pages
Unit-1 - Co Notes
No ratings yet
Unit-1 - Co Notes
69 pages
class03_cs230s22
No ratings yet
class03_cs230s22
33 pages
Lecture 4 - Floating Point Data
No ratings yet
Lecture 4 - Floating Point Data
44 pages
L2-Variables and Floating Point Number System
No ratings yet
L2-Variables and Floating Point Number System
38 pages
Lecture 3 - Floating Point
No ratings yet
Lecture 3 - Floating Point
33 pages
08-FloatingPoint
No ratings yet
08-FloatingPoint
52 pages
04-float
No ratings yet
04-float
40 pages
04-float-2
No ratings yet
04-float-2
44 pages
1521 Lec 7 - Floating Point Numbers
No ratings yet
1521 Lec 7 - Floating Point Numbers
33 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
Week 2 Nptel Digital Electronics
No ratings yet
Week 2 Nptel Digital Electronics
74 pages
3. Floating_Point_Number
No ratings yet
3. Floating_Point_Number
36 pages
4-Floating-Point-inclass
No ratings yet
4-Floating-Point-inclass
33 pages
Floating Point
No ratings yet
Floating Point
33 pages
ML System Optimization Lecture 11 Quantization
No ratings yet
ML System Optimization Lecture 11 Quantization
150 pages
Lec 06
No ratings yet
Lec 06
49 pages
2.4 Floating Points
No ratings yet
2.4 Floating Points
36 pages
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
No ratings yet
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
42 pages
Programming F# 3.0 - Chris Smith
100% (2)
Programming F# 3.0 - Chris Smith
724 pages
Floating Point & fixed point Representation_BCA II
No ratings yet
Floating Point & fixed point Representation_BCA II
24 pages
CH03-Data-II(2) (2)
No ratings yet
CH03-Data-II(2) (2)
31 pages
5 Data - Floating - Point v1
No ratings yet
5 Data - Floating - Point v1
25 pages
Floating Point Arithmetic
100% (1)
Floating Point Arithmetic
30 pages
ENSC254 - Floating Point Computation
No ratings yet
ENSC254 - Floating Point Computation
29 pages
Summary of Integer Arithmetic and ALU: - Addition
No ratings yet
Summary of Integer Arithmetic and ALU: - Addition
22 pages
Ece552 10 Floating Point
No ratings yet
Ece552 10 Floating Point
15 pages
COA UNIT-III PPTs Dr.G.Bhaskar ECE
No ratings yet
COA UNIT-III PPTs Dr.G.Bhaskar ECE
64 pages
LEC03 Data II
No ratings yet
LEC03 Data II
45 pages
CA - UNIT 2 - NOTES
No ratings yet
CA - UNIT 2 - NOTES
38 pages
Floating Point: 15-213: Introduction To Computer Systems 4 Lecture, Sep. 10, 2015
No ratings yet
Floating Point: 15-213: Introduction To Computer Systems 4 Lecture, Sep. 10, 2015
40 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
ELEC2041 Microprocessors and Interfacing Lectures 20: Floating Point Number Representation - Ii
No ratings yet
ELEC2041 Microprocessors and Interfacing Lectures 20: Floating Point Number Representation - Ii
29 pages
COA - Unit2 Floating Point Arithmetic 2
No ratings yet
COA - Unit2 Floating Point Arithmetic 2
67 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
Introduction To Computational Mathematics - An Outline
No ratings yet
Introduction To Computational Mathematics - An Outline
210 pages
95% Completely Clueless: " of The Folks Out There Are About Floating-Point."
No ratings yet
95% Completely Clueless: " of The Folks Out There Are About Floating-Point."
33 pages
Floating-Point Numbers
No ratings yet
Floating-Point Numbers
23 pages
Lecture 2
No ratings yet
Lecture 2
27 pages
Aapcs 32
No ratings yet
Aapcs 32
41 pages
Pooja Vashisth
No ratings yet
Pooja Vashisth
35 pages
8.3 Floating Point Numbers
No ratings yet
8.3 Floating Point Numbers
19 pages
ELEC2041 Microprocessors and Interfacing Lectures 19: Floating Point Number Representation - I
No ratings yet
ELEC2041 Microprocessors and Interfacing Lectures 19: Floating Point Number Representation - I
24 pages
Floating Point Numbers: CS101 Introduction To Computing
No ratings yet
Floating Point Numbers: CS101 Introduction To Computing
41 pages
GSC-320 Numerical Computing: Lecturer:Fasiha Ikram
No ratings yet
GSC-320 Numerical Computing: Lecturer:Fasiha Ikram
17 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
Chap1 Introduction Data Representation
No ratings yet
Chap1 Introduction Data Representation
69 pages
Cosc 2150: Computer Organization: Chapter 9, Part 3 Floating Point Numbers
No ratings yet
Cosc 2150: Computer Organization: Chapter 9, Part 3 Floating Point Numbers
39 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Chapter2 2.5
No ratings yet
Chapter2 2.5
34 pages
Lect4 Floats
No ratings yet
Lect4 Floats
64 pages
Booth and Radix-4 Questions
No ratings yet
Booth and Radix-4 Questions
8 pages
Unit 1 Matlab
No ratings yet
Unit 1 Matlab
21 pages
QT206 Technical Data Sheet: Hart To Modbus Rtu Converter
No ratings yet
QT206 Technical Data Sheet: Hart To Modbus Rtu Converter
7 pages
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
No ratings yet
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
31 pages
Functions
No ratings yet
Functions
16 pages
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
No ratings yet
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
34 pages
Floating Point: - We Need A Way To Represent
No ratings yet
Floating Point: - We Need A Way To Represent
14 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
31 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Yoga Sutras
No ratings yet
Yoga Sutras
144 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
51 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
4 pages
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
No ratings yet
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
8 pages
Floating Point Arithmetic Class
No ratings yet
Floating Point Arithmetic Class
24 pages
Computer Organization 2: Lab Tutorial 3 Chapter
No ratings yet
Computer Organization 2: Lab Tutorial 3 Chapter
30 pages
Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019
No ratings yet
Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019
16 pages
Migrating From DB2 To PostgreSQL - What You Should Know - Severalnines
No ratings yet
Migrating From DB2 To PostgreSQL - What You Should Know - Severalnines
13 pages
Mainframe Q&A Kindle
No ratings yet
Mainframe Q&A Kindle
123 pages
Number System
No ratings yet
Number System
38 pages
Example Floating Point Problems: Problem 1
No ratings yet
Example Floating Point Problems: Problem 1
4 pages
Java Unit II
No ratings yet
Java Unit II
29 pages
History of Microprocessors
No ratings yet
History of Microprocessors
32 pages
Implementation of Floating Point Multiplier
No ratings yet
Implementation of Floating Point Multiplier
4 pages
8086 Instruction Set
No ratings yet
8086 Instruction Set
66 pages
Computer Arithmetic (5 Hours)
No ratings yet
Computer Arithmetic (5 Hours)
27 pages
IEEE 754 Floating Point Notes
No ratings yet
IEEE 754 Floating Point Notes
4 pages
Risc Vs Cisc
No ratings yet
Risc Vs Cisc
24 pages
F95 Reference
No ratings yet
F95 Reference
169 pages
Excess 64 and IEEE 754 Format
No ratings yet
Excess 64 and IEEE 754 Format
9 pages
MIPS Green Sheet
No ratings yet
MIPS Green Sheet
2 pages
Universal Serial Bus: Basic Architecture Host-to-Device Connections and Transactions
No ratings yet
Universal Serial Bus: Basic Architecture Host-to-Device Connections and Transactions
15 pages
Computer Architecture Sample Final
No ratings yet
Computer Architecture Sample Final
10 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
88 pages
Single Precision Floating-Point Conversion
No ratings yet
Single Precision Floating-Point Conversion
6 pages
Fixed Point Numbers
No ratings yet
Fixed Point Numbers
20 pages
3.1 Binary Addition: Chapter Three
No ratings yet
3.1 Binary Addition: Chapter Three
28 pages
Assembly Language Programming
100% (4)
Assembly Language Programming
20 pages
Bài Thi Cuối Kỳ Môn Kiến Trúc Máy Tính và Hợp Ngữ
No ratings yet
Bài Thi Cuối Kỳ Môn Kiến Trúc Máy Tính và Hợp Ngữ
12 pages
A A A% B!C Def G.5 H %) FI 0J 0G я %) FI 0J 0G я =0J 7Kк 0LG M
No ratings yet
A A A% B!C Def G.5 H %) FI 0J 0G я %) FI 0J 0G я =0J 7Kк 0LG M
3 pages
University of Dar Es Salaam: Surname First Name Deg. Prog. Reg. Number
No ratings yet
University of Dar Es Salaam: Surname First Name Deg. Prog. Reg. Number
21 pages
DLX Instruction Set Description Notation
No ratings yet
DLX Instruction Set Description Notation
14 pages
Dos and Bios Interrupts
No ratings yet
Dos and Bios Interrupts
5 pages
MS-DOS OS Function Reference DOS I/O Function Calls: FUNCTION 01H: Read Keyboard and Echo
No ratings yet
MS-DOS OS Function Reference DOS I/O Function Calls: FUNCTION 01H: Read Keyboard and Echo
5 pages
MS-DOS OS Function Reference DOS I/O Function Calls: FUNCTION 01H: Read Keyboard and Echo
No ratings yet
MS-DOS OS Function Reference DOS I/O Function Calls: FUNCTION 01H: Read Keyboard and Echo
5 pages
Slide03 NumSys Ops Part1
No ratings yet
Slide03 NumSys Ops Part1
47 pages
Data Transfer Instructions: 8086 Instruction Set Summary
No ratings yet
Data Transfer Instructions: 8086 Instruction Set Summary
2 pages
The Intel Pentium Processor
No ratings yet
The Intel Pentium Processor
12 pages
Đề Thi Và Đáp Án Kiến Trúc Máy Tính Giữa Kỳ 1 Năm Học 2021-2022 - UET
No ratings yet
Đề Thi Và Đáp Án Kiến Trúc Máy Tính Giữa Kỳ 1 Năm Học 2021-2022 - UET
8 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Quiz For Chapter 3 With Solutions PDF
No ratings yet
Quiz For Chapter 3 With Solutions PDF
8 pages
Aic
No ratings yet
Aic
6 pages

ELEC2041 Microprocessors and Interfacing Lectures 21: Floating Point Number Representation - III

Uploaded by

ELEC2041 Microprocessors and Interfacing Lectures 21: Floating Point Number Representation - III

Uploaded by

ELEC2041 Microprocessors and Interfacing Lectures 21: Floating Point Number Representation III http://webct.edtec.unsw.edu.

April 2006 Saeid Nooshabadi [email protected]

Review: ARM Fl. Pt. Architecture

Review: Floating Point Representation Single Precision and Double Precision

(-1)S x (1+Significand) x 2(Exponent-Bias)

New ARM arithmetic instructions

Special Numbers What have we defined so far? Precision)

Professor Kahan had clever ideas; Waste not, want not

Representation for Not a Number What do I get if I calculate sqrt(4.0)or 0/0?

Special Numbers (contd) What have we defined so far? (Single Precision)?

Second smallest representable positive num:

a - 0 = 2-126 b - a = 2-149 Gap! ELEC2041 lec21-fp-III.9

Representation for Denorms (#2/2) Solution:

Second smallest representable pos num:

Special Numbers What have we defined so far? Precision)

Professor Kahan had clever ideas; Waste not, want not

Clever Idea and Hardware Implementation 0 nonzero Denorm

IEEE Rounding Modes Round towards +infinity

Round towards -infinity

Round to (nearest) even

This is the default rounding mode

Casting floats to ints and vice versa in C

Ints, Fractions and rounding in C What do you get?

How about? ( - 32) * 5 / 9;) int cela = (fahr

Floating Point Fallacy FP Add, subtract associative: FALSE!

Therefore, Floating Point add, subtract are not associative!

Floating Point In the News!

Dec: Intel apologizes, replace chips $300M

IEEE 754 Floating Point Issues It is complex, involves lots of details

Ft. Pt. Emulator: A student mini project on: http://dsl.

Example: Matrix with Fl Pt, Multiply, Add? j

Use fldd/fstd (load/store 64 bits)

Multidimensional Array Addressing C stores multidimensional arrays in rowmajor order

Why pass in # of cols?

#32 v2, v3, v4,

; v1 = 32 #0 ; i = 0; 1st loop #0 ; j = 0; reset 2nd #0 ; k = 0; reset 3rd add

To fetch x[i][j], skip i rows (i*32), add j

Get byte address (8 bytes), load x[i][j]

Like before, but load z[k][j] into d2

Summary: d0:x[i][j], d1:y[i][k], d2:z[k][j]

ARM code for last piece: add/mul, loops Add y*z to x

; x[][] = x + y*z ; k = k + 1 ; if(k<32) goto L3 ; x[i][j] = d0 ; j = j + 1 ; if(j<32) goto L2

Increment k; if end of inner loop, store x

Increment j; middle loop if not end of j

Increment i; if end of outer loop, return

add v2,v2,#1 ; i = i + 1 cmp v2,v1 ; if(i<32) goto L1 blt L1

ARM code for Return Return

You might also like