0% found this document useful (0 votes)

88 views45 pages

Computer Science Data Types Guide

The document discusses different data types including primitive, structured, and user-defined types. It covers numeric, boolean, character, string, enumerated, array, and record data types. For each type, it describes the type's definition, allowed values, common operations, and hardware implementations. It provides examples of how different programming languages define and work with each data type.

Uploaded by

Dadang Kadarusman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views45 pages

Computer Science Data Types Guide

Uploaded by

Dadang Kadarusman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 45

Data types

Outline
• Primitive data types
• Structured data types
• Strings
• Enumerated types
• Arrays
• Records
• Pointers

CSI 3125, Data Types, page 1

Primitive data types

Points

• Numeric types

• Booleans

• Characters

CSI 3125, Data Types, page 2

Data types: introduction

A data type is not just a set of objects.

We must consider all operations on
these objects. A complete definition of
a type must include a list of operations,
and their definitions.

Primitive data objects are close to

hardware, and are represented directly
(or almost directly) at the machine
level—usually word, byte, bit.

CSI 3125, Data Types, page 3

Integer types

An integer type is a finite approximation

of the infinite set of integer numbers:
{0, 1, -1, 2, -2, ...}.

Various kinds of integers:

signed—unsigned, long—short—small.

Hardware implementations of integers:

one's complement, two's complement, ...

CSI 3125, Data Types, page 4

Floating-point types

All "real" numbers in computers are finite

approximations of the non-denumerable
set of real numbers.
Precision and range of values are defined
by the language or by the programmer.
Hardware implementations (used by
floating-point processors): exponent and
mantissa.

CSI 3125, Data Types, page 5

Boolean type

This type is not supported by all languages

(for example, it is not available in PL/, C,
Perl).

The values are true and false. Operations

are as in classical two-valued propositional
logic.

Hardware implementation: a single bit or a

byte (this allows more efficient operations).

CSI 3125, Data Types, page 6

Character types

This is usually ASCII, but extended character

(Unicode, ISO) sets are often used.
Accented characters é à ü etc. should fit within
ASCII, though there is no single standard.
Chinese or Japanese are examples of writing
systems that require character sets of many
more than 256 elements.
Hardware implementation: a byte (ASCII,
EBCDIC), two bytes (Unicode) or several bytes.
CSI 3125, Data Types, page 7
Other primitive types

• word (for example, in Modula-2)

• byte, bit (for example, in PL/I)

• pointer (for example, in C)

CSI 3125, Data Types, page 8

Structured data types

Points
• Strings
• Enumerated types
• Arrays
• Records

CSI 3125, Data Types, page 9

Strings

A string is a sequence of characters. It may be:

• a special data type (its objects can be
decomposed into characters)—Fortran, Basic;
• an array of characters—Pascal, Ada;
• a list of characters—Prolog;
• consecutively stored characters—C.
The syntax is the same: characters in quotes.
Pascal has one kind of quotes, Ada has two:
'A' is a character, "A" is a string.

CSI 3125, Data Types, page 10

String operations

Typical operations on strings

string  string  string
concatenation
string  int  int  string
substring
string  characters
decompose into an array or list
characters  string
convert an array or list into a string

CSI 3125, Data Types, page 11

String operations (2)

string  integer
length

string  boolean
is it empty?

string  string  boolean

equality, ordering

CSI 3125, Data Types, page 12

More string operations…

Specialized string manipulation languages

(Snobol, Icon) include built-in pattern
matching, sometimes very complicated,
with extremely elaborate backtracking.

Another language that works very well

with strings is, of course, Perl.

CSI 3125, Data Types, page 13

Fixed- and variable-length strings
The allowed length of strings is a design
issue:
fixed-length strings—Pascal, Ada, Fortran;
variable-length strings—C, Java, Perl.
A character may be treated as a string of
length 1, or as a separate data structure.
Many languages (Pascal, Ada, C, Prolog)
treat strings as special cases of arrays or
lists.
Operations on strings are the same as on arrays or
lists of other types. For example, every character in
a character array is available at once, whereas a list
of characters must be searched linearly.

CSI 3125, Data Types, page 14

Enumerated types

Also called: user-defined ordinal types

[read Section 6.4]
We can declare a list of symbolic constants that are
to be treated literally, just like in Prolog or Scheme.
We also specify the implicit ordering of those newly
introduced symbolic constants. In Pascal:
type day =
(mo, tu, we, th, fr, sa, su);
Here, we have mo < tu < we < th < fr < sa < su.

CSI 3125, Data Types, page 15

Operators for enumerated types

Pascal supplies the programmer with three generic

operations for every new enumerated type T:
succ: successor, for example, succ(tu) = we
pred: predecessor, for example, pred(su) = sa
(each is undefined at one end)
ord: (ordinal) position in the type, starting at 0,so for
example ord(th) = 3
For characters, Pascal also has chr, producing the
character at a given position, so for example
chr(65) returns ‘A’.

CSI 3125, Data Types, page 16

Enumerated types in Ada

Ada makes these generic operations complete:

succ: successor,
pred: predecessor,
pos: position,
val: constant at position.

Ada also supplies type attributes, among them

FIRST and LAST:

day'FIRST = mo, day'LAST = su

CSI 3125, Data Types, page 17

Reuse of symbolic constants
A design issue: is the symbolic constant allowed in
more than one type? In Pascal, no. In Ada, yes:
type stoplight is
(red, amber, green);
type rainbow is
(violet, indigo, blue, green,
yellow, orange, red);
Qualified descriptions—similar to type casts—prevent
any confusion: we can write stoplight'(red) or
rainbow'(red).
CSI 3125, Data Types, page 18
Implementation of enumerated types

Map the constants c1, ..., ck into

small integers 0, ..., k-1.

Enumerated types help increase clarity

and readability of programs by separating
concepts from their numeric codes.

CSI 3125, Data Types, page 19

Arrays
An array represents a mapping:
index_type  component_type

The index type must be a discrete type (integer,

character, enumeration etc). In some languages this
type is specified implicitly:
an array of size N is indexed 0…N-1 in C++ / Java /
Perl, but in Fortran it is 1…N. In Algol, Pascal, Ada
the lower and upper bound must be both given.

There are normally few restrictions on the component

type (in some languages we can even have arrays of
procedures or files).

CSI 3125, Data Types, page 20

Multidimensional arrays

Multidimensional arrays can be defined in two

ways (for simplicity, we show only dimension 2):
index_type1  index_type2  component_type
This corresponds to references such as A[I,J].
Algol, Pascal, Ada work like this.
index_type1 (index_type2  component_type)
This corresponds to references such as A[I][J].
Java works like this.
Perl sticks to one dimension

CSI 3125, Data Types, page 21

Operations on arrays (1)

select an element (get or change its value):

A[J]

select a slice of an array:

(read the textbook, Section 6.5.7)

assign a complete array to a complete array:

A := B;

There is an implicit loop here.

CSI 3125, Data Types, page 22

Operations on arrays (2)

Compute an expression with complete arrays

(this is possible in extendible or specialized
languages, for example in Ada):
V := W + U;

If V, W, U are arrays, this may denote array

addition. All three arrays must be compatible
(the same index and component type), and
addition is probably carried out element by
element.

CSI 3125, Data Types, page 23

Subscript binding

static: fixed size, static allocation

this is done in older Fortran.
semistatic: fixed size, dynamic allocation
Pascal.
semidynamic: size determined at run time,
dynamic allocation
Ada
dynamic: size fluctuates during execution,
flexible allocation required
Algol 68, APL—both little used...

CSI 3125, Data Types, page 24

Array-type constants and initialization
Many languages allow initialization of arrays to be
specified together with declarations:
C int vector [] = {10,20,30};
Ada vector: array(0..2)
of integer := (10,20,30);
Array constants in Ada
temp is array(mo..su)of -40..40;
T: temp;
T := (15,12,18,22,22,30,22);
T := (mo=>15, we=>18, tu=>12,
sa=>30, others=>22);
T := (15,12,18, sa=>30, others=>22);

CSI 3125, Data Types, page 25

Implementing arrays (1)

The only issue is how to store arrays and access their

elements—operations on the component type decide
how the elements are manipulated.

An array is represented during execution by an array

descriptor. It tells us about:
the index type,
the component type,
the address of the array, that is, the data.

CSI 3125, Data Types, page 26

Implementing arrays (2)

Specifically, we need:
the lower and upper bound (for subscript checking),
the base address of the array,
the size of an element.

We also need the subscript—it gives us the offset (from

the base) in the memory area allocated to the array.

A multi-dimensional array will be represented by a

descriptor with more lower-upper bound pairs.

CSI 3125, Data Types, page 27

Implementing multidimensional arrays

11 12 13 14 15
21 22 23 24 25
31 32 33 34 35

Row major order (second subscript increases faster)

11 12 13 14 15 21 22 23 24 25 31 32 33 34 35

Column major order (first subscript increases faster)

11 21 31 12 22 32 13 23 33 14 24 34 15 25 35

CSI 3125, Data Types, page 28

Implementing multidimensional arrays (2)

Suppose that we have this array:

A: array [LOW1..HIGH1,
LOW2..HIGH2] of ELT;
where the size of each entity of type ELT is
SIZE.
This calculation is done for row-major
(calculations for column-major are quite
similar). We need the base—for example,
the address LOC of A[LOW1, LOW2].

CSI 3125, Data Types, page 29

Implementing multidimensional arrays (3)

We can calculate the address of A[I,J] in the

row-major order, given the base.
Let the length of each row in the array be:
ROWLENGTH = HIGH2 - LOW2 + 1
The address of A[I,J] is:
(I - LOW1) * ROWLENGTH * SIZE +
(J - LOW2) * SIZE + LOC

CSI 3125, Data Types, page 30

Implementing multidimensional arrays (4)
Here is an example.
VEC: array [1..10, 5..24] of integer;
The length of each row in the array is:
ROWLENGTH = 24 - 5 + 1 = 20
Let the base address be 1000, and let the size of
an integer be 4.
The address of VEC[i,j] is:
(i - 1) * 20 * 4 + (j - 5) * 4 + 1000
For example, VEC[7,16] is located in 4 bytes at
1524 = (7 - 1) * 20 * 4 +
(16 - 5) * 4 + 1000

CSI 3125, Data Types, page 31

Languages without arrays

A final word on arrays: they are not supported

by standard Prolog and pure Scheme. An array
can be simulated by a list, which is the basic
data structure in Scheme and a very important
data structure in Prolog.
Assume that the index type is always 1..N.
Treat a list of N elements:
[x1, x2, ..., xN] (Prolog)
(x1 x2 ... xN) (Scheme)
as the (structured) value of an array

CSI 3125, Data Types, page 32

Records

A record is a heterogeneous collection of fields

(components)—this differs from homogenous
arrays.

Records are supported by a majority of

important languages, beginning with Cobol,
through Pascal, PL/I, Ada, C (where they are
called structures), Prolog (!) to C++.

There are no records in Java, but classes

replace them. There are no records in Perl.

CSI 3125, Data Types, page 33

Ada records—syntax
type date is record
day: 1..31;
month: 1..12;
year: 1000..9999;
end record;

type person is record

name: record
fname: string(1..20);
lname: string(1..20);
end record;
born: date;
gender: (F, M);
end record;

CSI 3125, Data Types, page 34

Fields

A field is distinguished by a name rather than an

index. Iteration on elements of an array is natural
and very useful, but iteration on fields of a record is
not possible (why?).

A field is indicated by a qualified name. In Ada:

X, Y: person;
X.born.day := 15;
X.born.month := 11;
X.born.year := 1964;
Y.born := (23, 9, 1949);
Y.name.fname(1..8) := "Smithson";

CSI 3125, Data Types, page 35

Operations on records (1)
Selection of a component is done by field name.
Construction of a record from components—either from
separate fields, or as a complete record in a structured
constant.
D := (month => 10, day => 15, year => 1994);
D := (day => 15, month => 10, year => 1994);
D := (15, 10, 1994);
D := (15, 10, year => 1994);
Note that an array can also be assigned such a constant.
Interpretation depends on context.
A: array(1..3)of integer;
A := (15, 10, 1994);

CSI 3125, Data Types, page 36

Operations on records (2)

Assignment of complete records is allowed in

Ada, and is done field by field.

Records can be compared for equality or

inequality, regardless of their structure or type
of components. No generic standard ordering
of records exists, but specific ordering can be
defined by the programmer.

CSI 3125, Data Types, page 37

More on records
Ada allows default values for fields:
type date is record
day: 1..31; month: 1..12;
year: 1000..9999 := 2002;
end record;
D: date; -- D.year is now 2002
There are almost no restrictions on field types.
Any combination of records and arrays (any
depth) is usually possible. A field could also be
a file or even a procedure!
CSI 3125, Data Types, page 38
The Prolog equivalent of records
Records—or rather terms—in Prolog can carry their type and
their components around:
date(day(15), month(10), year(1994))
person(
name(fname("Jim"), lname("Berry")),
born(date(day(15), month(10), year(1994))),
gender(male)
)
If we can assure correct use, this can be simplified by dropping
one-argument "type" functors:
date(15, 10, 1994)
person(name("Jim", "Berry"),
born(date(15, 10, 1994)),
male)
CSI 3125, Data Types, page 39
Back to pointers
[Note: We’re skipping 6.9.9]

A pointer variable has addresses as values (and a

special address nil or null for "no value"). They are used
primarily to build structures with unpredictable shapes
and sizes—lists, trees, graphs—from small fragments
allocated dynamically at run time.

A pointer to a procedure is possible, but normally we

have pointers to data (simple and composite). An
address, a value and usually a type of a data item
together make up a variable. We call it an anonymous
variable: no name is bound to it. Its value is accessed
by dereferencing the pointer.

CSI 3125, Data Types, page 40

Back to pointers (2)

Pointers in Pascal are quite well designed.

p  value(p) = 

 17 value(p^) = 17

Note that, as with normal named variables, in this:

p^ := 23;
we mean the address of p^ (the value of p).
In this:
m := p^;
we mean the value of p^.

CSI 3125, Data Types, page 41

Pointer variable creation

A pointer variable is declared explicitly and has the

scope and lifetime as usual.
An anonymous variable has no scope (because it has
no name) and its lifetime is determined by the
programmer. It is created (in a special memory area
called heap) by the programmer, for example:
new(p); in Pascal
p = malloc(4); in C
and destroyed by the programmer:
dispose(p); in Pascal
free(p); in C

CSI 3125, Data Types, page 42

Pointer variable creation (2)

If an anonymous variable exists outside

the scope of the explicit pointer variable,
we have "garbage" (a lost object). If an
anonymous variable has been destroyed
inside the scope of the explicit pointer
variable, we have a dangling reference.
new(p);
p^ := 23;
dispose(p);
......
if p^ > 0 {???}

CSI 3125, Data Types, page 43

Pointer variable creation (2)

Producing garbage, an example in Pascal:

new(p); p^ := 23; new(p);
{the anonymous variable with the value 23
becomes inaccessible}
Garbage collection is the process of
reclaiming inaccessible storage. It is usually
complex and costly. It is essential in
languages whose implementation relies on
pointers: Lisp, Prolog.

CSI 3125, Data Types, page 44

Pointers: types and operators

Pointers in PL/I are typeless. In Pascal, Ada,

C they are declared as pointers to types, so
that a dereferenced pointer (p^, *p) has a
fixed type.
Operations on pointers in C are quite rich:
char b, c;
c = '\007';
b = *((&c - 1) + 1);
putchar(b);

CSI 3125, Data Types, page 45

Unit III
No ratings yet
Unit III
65 pages
ch6 Short
No ratings yet
ch6 Short
59 pages
Software II: Principles of Programming Languages: Some Basic Definitions
No ratings yet
Software II: Principles of Programming Languages: Some Basic Definitions
55 pages
Chapter 4 - Data Types
No ratings yet
Chapter 4 - Data Types
30 pages
Programming Data Types Guide
No ratings yet
Programming Data Types Guide
46 pages
PL 10 CH 6
No ratings yet
PL 10 CH 6
59 pages
Lecture 5
No ratings yet
Lecture 5
68 pages
Chapter-6 Data Types
No ratings yet
Chapter-6 Data Types
37 pages
Programming Languages Overview
No ratings yet
Programming Languages Overview
80 pages
UNIT-II - Structuring The Data, Computations and Program
No ratings yet
UNIT-II - Structuring The Data, Computations and Program
105 pages
Design Issues in Character Strings
No ratings yet
Design Issues in Character Strings
60 pages
Programming Language Data Types
100% (1)
Programming Language Data Types
154 pages
Unit-2 PPL Datatypes
67% (3)
Unit-2 PPL Datatypes
89 pages
Programming Data Types Overview
No ratings yet
Programming Data Types Overview
36 pages
Module 2 (Data Types)
No ratings yet
Module 2 (Data Types)
97 pages
PL 10 CH 6
No ratings yet
PL 10 CH 6
92 pages
Data Types Explained for Programmers
No ratings yet
Data Types Explained for Programmers
3 pages
PPL Second Chapter Notes
No ratings yet
PPL Second Chapter Notes
12 pages
Chapter 06 Data Types
No ratings yet
Chapter 06 Data Types
32 pages
Lecture 03
No ratings yet
Lecture 03
44 pages
Datatypes
No ratings yet
Datatypes
63 pages
Data Types and Type Casting Explained
No ratings yet
Data Types and Type Casting Explained
31 pages
Chapter 2 CS331
No ratings yet
Chapter 2 CS331
12 pages
DataTypes
No ratings yet
DataTypes
27 pages
Overview of Programming Data Types
No ratings yet
Overview of Programming Data Types
29 pages
PPL-Unit 2
No ratings yet
PPL-Unit 2
45 pages
Data Types
No ratings yet
Data Types
12 pages
Comprehensive Guide to Data Types
No ratings yet
Comprehensive Guide to Data Types
81 pages
Data Types: ISBN 0-321-49362-1
No ratings yet
Data Types: ISBN 0-321-49362-1
84 pages
Computer Science - Engineering - Principles of Programming Languages - Specification Implementation - Notes
No ratings yet
Computer Science - Engineering - Principles of Programming Languages - Specification Implementation - Notes
64 pages
Chapter 5 - Data Types
No ratings yet
Chapter 5 - Data Types
82 pages
Unit 2 - Principles of Programming Languages - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Principles of Programming Languages - WWW - Rgpvnotes.in
14 pages
Data Types Handout1
100% (1)
Data Types Handout1
75 pages
Data Types
No ratings yet
Data Types
103 pages
Unit 2 PPL
No ratings yet
Unit 2 PPL
56 pages
Design Issues For All Data Types
No ratings yet
Design Issues For All Data Types
27 pages
CH 6
No ratings yet
CH 6
88 pages
Data Structure and Algorithm Original Note
No ratings yet
Data Structure and Algorithm Original Note
116 pages
M6 Guide
No ratings yet
M6 Guide
10 pages
Types
No ratings yet
Types
62 pages
Slides 09 Programming Languages - UET CS - Talha Waheed - Data Types
No ratings yet
Slides 09 Programming Languages - UET CS - Talha Waheed - Data Types
50 pages
Lecture 4 - Programming With VB
No ratings yet
Lecture 4 - Programming With VB
8 pages
02 - Unit 2. Data Types
No ratings yet
02 - Unit 2. Data Types
64 pages
ch6 1-Datatypes
No ratings yet
ch6 1-Datatypes
84 pages
Chapter 4 - Data Types
No ratings yet
Chapter 4 - Data Types
24 pages
PPL-Unit 2 Part 3
No ratings yet
PPL-Unit 2 Part 3
12 pages
Mis387 3
No ratings yet
Mis387 3
8 pages
UNIT-2 Short Answers
No ratings yet
UNIT-2 Short Answers
12 pages
Algorithm Data Type and Data Structure Secured
No ratings yet
Algorithm Data Type and Data Structure Secured
17 pages
PPL (Unit2 Data Types)
75% (4)
PPL (Unit2 Data Types)
43 pages
Principles of Programming Languages: UNIT II - Intro To Programming Concepts Lecture 7 - Data Types
No ratings yet
Principles of Programming Languages: UNIT II - Intro To Programming Concepts Lecture 7 - Data Types
92 pages
Chap 2
No ratings yet
Chap 2
48 pages
02 - Unit 2. Data Types
No ratings yet
02 - Unit 2. Data Types
60 pages
Bca Unit 1 Data Structures
No ratings yet
Bca Unit 1 Data Structures
35 pages
Java Programing
No ratings yet
Java Programing
144 pages
Data Type and Data Structure
100% (1)
Data Type and Data Structure
16 pages
Data Structures in Java Programming
100% (1)
Data Structures in Java Programming
27 pages
PPL-Unit 3
No ratings yet
PPL-Unit 3
28 pages
Rconsole1 Quick Guide Final
No ratings yet
Rconsole1 Quick Guide Final
28 pages
Systems Analysis & Design Assignment
No ratings yet
Systems Analysis & Design Assignment
19 pages
Ieee Python Ai ML Projects - 2023-24
No ratings yet
Ieee Python Ai ML Projects - 2023-24
14 pages
Exploring Microsoft Office Powerpoint 2007: Enhancing With Illustrations
No ratings yet
Exploring Microsoft Office Powerpoint 2007: Enhancing With Illustrations
39 pages
A Beginner's Guide To L TEX
No ratings yet
A Beginner's Guide To L TEX
7 pages
24P01273 Shreenidhi C
No ratings yet
24P01273 Shreenidhi C
12 pages
Generative AI Tutorial For Beginners Course Content
No ratings yet
Generative AI Tutorial For Beginners Course Content
2 pages
ATM Skimmer & POS - FULL TUT FOR NEW USERS 2022
0% (1)
ATM Skimmer & POS - FULL TUT FOR NEW USERS 2022
3 pages
Alv Output To Excel
No ratings yet
Alv Output To Excel
12 pages
Tax
No ratings yet
Tax
48 pages
Computer Knowledge - Basic General Computer Awareness
No ratings yet
Computer Knowledge - Basic General Computer Awareness
8 pages
MARS IDE Tutorial for Students
No ratings yet
MARS IDE Tutorial for Students
20 pages
ATRG ClusterXL R6x R7x R8x (09-June-2020)
No ratings yet
ATRG ClusterXL R6x R7x R8x (09-June-2020)
168 pages
Asrock A Series DVR User's Installation and Operation Manual
No ratings yet
Asrock A Series DVR User's Installation and Operation Manual
73 pages
Acer Aspire 3750ZG 1 Pegatron EIH31 - Rev1.3
No ratings yet
Acer Aspire 3750ZG 1 Pegatron EIH31 - Rev1.3
89 pages
AVR Architecture & Programming
No ratings yet
AVR Architecture & Programming
8 pages
Theatro Headset - Communicator 3 User Manual by Theatro Labs
No ratings yet
Theatro Headset - Communicator 3 User Manual by Theatro Labs
25 pages
Cisco Certification Path Poster
No ratings yet
Cisco Certification Path Poster
1 page
LIC AAO Computer Knowledge Question Paper 2016
No ratings yet
LIC AAO Computer Knowledge Question Paper 2016
2 pages
Data Structure Mid Exam, Summer 2020
No ratings yet
Data Structure Mid Exam, Summer 2020
1 page
Network Engineering Resume Example
100% (1)
Network Engineering Resume Example
24 pages
E-Commerce and Cybercrime Study
No ratings yet
E-Commerce and Cybercrime Study
11 pages
Enriching Learning Ict Notes
No ratings yet
Enriching Learning Ict Notes
8 pages
The PR4G VHF ECCM System Extensive Tactical Communications For The Battlefield
0% (2)
The PR4G VHF ECCM System Extensive Tactical Communications For The Battlefield
5 pages
Control-M Installation Guide 6.1.03 PDF
No ratings yet
Control-M Installation Guide 6.1.03 PDF
418 pages
CMSC 412 Midterm #1 (Fall 2011) - Solutions
No ratings yet
CMSC 412 Midterm #1 (Fall 2011) - Solutions
4 pages
Windows 10 System Report
No ratings yet
Windows 10 System Report
36 pages
Mock Calls or Chat Scenarios For CET and PST
No ratings yet
Mock Calls or Chat Scenarios For CET and PST
75 pages
Buckrose J E - The House With The Golden Windows
No ratings yet
Buckrose J E - The House With The Golden Windows
297 pages
Sample SoW
No ratings yet
Sample SoW
6 pages