0% found this document useful (0 votes)

26 views41 pages

Indexing

The document discusses various indexing techniques used in advanced database management systems to enhance record retrieval efficiency. It covers single-level ordered indexes, primary, clustering, and secondary indexes, as well as multilevel indexes utilizing B-trees and B+-trees for dynamic indexing. Additionally, it addresses issues related to storage, key constraints, and alternative storage methods like column-based storage.

Uploaded by

chamikalak2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views41 pages

Indexing

Uploaded by

chamikalak2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

02.

INDEXING
ADVANCED DATABASE MANAGEMENT SYSTEMS
ICT 331-2
INTRODUCTION

 Indexes used to speed up record retrieval in response to

certain search conditions.
 Index structures provide secondary access paths.
 Any field can be used to create an index.
 Multiple indexes can be constructed
 Most indexes based on ordered files
 Tree data structures organize the index
TYPES OF SINGLE-LEVEL ORDERED INDEXES
 Ordered index similar to index in a textbook.
 Indexing field (attribute)
 Index stores each value of the index field with list of
pointers to all disk blocks that contain records with that
field value
 Values in index are ordered
 Primary index
 Specified on the ordering key field of ordered file of
records
TYPES OF SINGLE-LEVEL ORDERED INDEXES (CONT’D.)

 Clustering index
 Used if numerous records can have the same value for
the ordering field
 Secondary index
 Can be specified on any nonordering field
 Data file can have several secondary indexes
PRIMARY INDEXES
 Ordered file with two fields
 Primary key, K(i)
 Pointer to a disk block, P(i)
 One index entry in the index file for each block in the data
file
 Indexes may be dense or sparse
 Dense index has an index entry for every search key
value in the data file
 Sparse index has entries for only some search values
PRIMARY INDEXES (CONT’D.)

Primary index on the ordering key field of the file

PRIMARY INDEXES (CONT’D.)

 Major problem: insertion and deletion of records

 Move records around and change index values
 Solutions
 Use unordered overflow file
 Use linked list of overflow records
SOLUTIONS: USE UNORDERED OVERFLOW FILE

 Create an overflow file to store new records that cannot fit

into the main ordered file without disrupting its order.
 When an insertion occurs, the new record is placed in the
overflow file instead of adjusting the main file.
SOLUTION: LINKED LIST OF OVERFLOW

 Use a linked list structure to link overflow records to their

original block in the main file.
 Each block in the main file has a pointer to its
corresponding overflow records.
CLUSTERING INDEXES

 Clustering field
 File records are physically ordered on a nonkey field
without a distinct value for each record
 Structure of the Ordered File
 Same type as clustering field
 Disk block pointer
CLUSTERING INDEXES

 The clustering index has an entry for each distinct value of

the clustering field.
 There can be only one clustered index per table.
 Blocks of fixed size are reserved for each value of the
clustering field to avoid physical reordering during
insertion and deletion.
A clustering index on the
Dept_number ordering
nonkey field of an
EMPLOYEE file

To locate a record:
• Search for the clustering field
value (K(i)) in the Index File.
• Use the Block Pointer (P(i)) to
access the block in the Data
File.
• Search for the record within
the block.
SECONDARY INDEXES
 provides a secondary means of accessing a file for which
some primary access already exists.
 Ordered file with two fields
 Indexing field, K(i)
 Block pointer or record pointer, P(i)
 Usually need more storage space and longer search time
than primary index
 Improved search time for arbitrary record
Dense secondary index
(with block pointers) on a
SECONDARY INDEXES (CONT’D.)
nonordering key field of a
file.

To retrieve a record:
• Find the Index Field Value in
the Index File.
• Use the Block Pointer to
access the corresponding
block in the Data File.
• Search for the desired record
within the block using the
secondary key field.
TYPES OF SINGLE-LEVEL ORDERED INDEXES (CONT’D.)

Table 1 Types of indexes based on the properties of the indexing field

Table 2 Properties of index types

MULTILEVEL INDEXES

 Designed to greatly reduce remaining search space as

search is conducted
 Reduces the search space by the blocking factor (𝑏𝑓𝑟),
also called the fan-out ( ).
 represents the number of entries in a single block and is
larger than 2.
 Searching a multilevel index requires approximately block
accesses.
 Faster than binary search when 𝑓𝑜>2.
MULTILEVEL INDEXES

 Because a single-level index is an ordered file, we can

create a primary index to the index itself ; in this case, the
original index file is called the first-level index and the
index to the index is called the second-level index.
 We can repeat the process, creating a third, fourth, ..., top
level until all entries of the top level fit in one disk block
 A multi-level index can be created for any type of first-
level index (primary, secondary, clustering) as long as the
first-level index consists of more than one disk block
MULTILEVEL INDEXES

 Index file
 Considered first (or base level) of a multilevel index
 Second level
 Primary index to the first level
 Third level
 Primary index to the second level
A two-level primary index
resembling ISAM (indexed
sequential access method)
organization

[Link] the two-level index

structure in the diagram,
how would you locate the
record with a primary key
of 46?
If a new record with a primary key of 90 needs to be inserted, how would the two-level index structure be updated?

A two-level primary index

resembling ISAM (indexed
sequential access method)
organization

[Link] the two-level index

structure in the diagram,
how would you locate the
record with a primary key
of 46?
DYNAMIC MULTILEVEL INDEXES USING B-TREES AND B+ -
TREES
 Tree data structure terminology
 Tree is formed of nodes
 Each node (except root) has one parent and zero or
more child nodes
 Leaf node has no child nodes
 Unbalanced if leaf nodes occur at different levels
 Nonleaf node called internal node
 Subtree of node consists of node and all descendant
nodes
DYNAMIC MULTILEVEL INDEXES USING B-TREES AND B+ -
TREES
 Because of the insertion and deletion problem, most multi-
level indexes use B-tree or B+-tree data structures, which
leave space in each tree node (disk block) to allow for new
index entries
 These data structures are variations of search trees that
allow efficient insertion and deletion of new search values.
 In B-Tree and B+-Tree data structures, each node
corresponds to a disk block
 Each node is kept between half-full and completely full
TREE DATA STRUCTURE

A tree data structure that shows an unbalanced tree

SEARCH TREES AND B-TREES

 Search tree used to

guide search for a
record
 Given value of one
of record’s fields

A node in a search tree with pointers to subtrees below it

SEARCH TREES AND B-TREES (CONT’D.)

 Algorithms
necessary for
inserting and
deleting search
values into and
from the tree
A search tree of order p = 3
B-TREES

 Provide multi-level access structure

 Tree is always balanced
 Space wasted by deletion never becomes excessive
 Each node is at least half-full
 Each node in a B-tree of order p can have at most p-1
search values
B-TREE
B+ -TREES
 Data pointers stored only at the leaf nodes
 Leaf nodes have an entry for every value of the search
field, and a data pointer to the record if search field is a
key field
 For a nonkey search field, the pointer points to a block
containing pointers to the data file records
 Internal nodes
 Some search field values from the leaf nodes repeated
to guide search
B+ -TREES (CONT’D.)

The nodes of a B+-tree (a) Internal node of a B+-tree with q−1 search values (b)
Leaf node of a B+-tree with q−1 search values and q−1 data pointers
SEARCHING FOR A RECORD WITH SEARCH KEY FIELD VALUE
K, USING A B+ -TREE

Algorithm : Searching for a

record with search key field
value K, using a B+ -Tree
INDEXES ON MULTIPLE KEYS

 Multiple attributes involved in many retrieval and update

requests
 Composite keys
 Access structure using key value that combines
attributes
 Partitioned hashing
 Suitable for equality comparisons
INDEXES ON MULTIPLE KEYS (CONT’D.)

 Grid files
 Array with one
dimension for
each search
attribute

Example of a grid array on Dno and Age attributes

OTHER TYPES OF INDEXES
 Hash indexes
 Secondary structure for file access
 Uses hashing on a search key other than the one used
for the primary data file organization
 Index entries of form (K, Pr) or (K, P)
 Pr: pointer to the record containing the key
 P: pointer to the block containing the record for that
key
BITMAP INDEXES

 Used with a large number of rows

 Creates an index for one or more columns
 Each value or value range in the column is indexed
 Built on one particular value of a particular field
 Array of bits
 Existence bitmap
 Bitmaps for B+ -tree leaf nodes
FUNCTION-BASED INDEXING
 Value resulting from applying some function on a field (or
fields) becomes the index key
 Introduced in Oracle relational DBMS
 Example
 Function UPPER(Lname) returns uppercase
representation

 Query
SOME GENERAL ISSUES CONCERNING INDEXING

 Physical index
 Pointer specifies physical record address
 Disadvantage: pointer must be changed if record is moved
 Logical index
 Used when physical record addresses expected to change
frequently
 Entries of the form (K, Kp)
ADDITIONAL ISSUES RELATED TO STORAGE OF RELATIONS
AND INDEXES
 Enforcing a key constraint on an attribute
 Reject insertion if new record has same key attribute as
existing record
 Duplicates occur if index is created on a nonkey field
 Fully inverted file
 Has secondary index on every field
 Indexing hints in queries
 Suggestions used to expedite query execution
ADDITIONAL ISSUES RELATED TO STORAGE OF RELATIONS
AND INDEXES (CONT’D.)

 Column-based storage of relations

 Alternative to traditional way of storing relations by row
 Offers advantages for read-only queries
 Offers additional freedom in index creation
THANK YOU!

CH 17 Sum
No ratings yet
CH 17 Sum
9 pages
CH 3 Index
No ratings yet
CH 3 Index
40 pages
Final Updates - Lec 2
No ratings yet
Final Updates - Lec 2
40 pages
Co3 Session 21
No ratings yet
Co3 Session 21
53 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
Indexing in Database
No ratings yet
Indexing in Database
33 pages
CO3-Session-09 & 10
No ratings yet
CO3-Session-09 & 10
41 pages
Indexing
No ratings yet
Indexing
53 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
38 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
25 pages
Chapter - 2 - Revision
No ratings yet
Chapter - 2 - Revision
26 pages
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
48 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
Chapter 3
No ratings yet
Chapter 3
50 pages
4 Chapter17 Index
No ratings yet
4 Chapter17 Index
41 pages
DBMS Unit5
No ratings yet
DBMS Unit5
40 pages
Comparing Indexing and Hashing in DBMS
No ratings yet
Comparing Indexing and Hashing in DBMS
47 pages
Types of Indexing Methods Explained
No ratings yet
Types of Indexing Methods Explained
60 pages
Indexing
No ratings yet
Indexing
62 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
30 pages
Indexing Lecture Nov 2023 Detailed
No ratings yet
Indexing Lecture Nov 2023 Detailed
37 pages
Index 1
No ratings yet
Index 1
25 pages
Module Iippt
No ratings yet
Module Iippt
27 pages
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
No ratings yet
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
32 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
UNIT-5: Indexing and Hashing
No ratings yet
UNIT-5: Indexing and Hashing
78 pages
DBMS Indexing 5
No ratings yet
DBMS Indexing 5
63 pages
Database Indexing Essentials
No ratings yet
Database Indexing Essentials
110 pages
Database Management System-203105251: Assistant Professor Computer Science & Engineering
No ratings yet
Database Management System-203105251: Assistant Professor Computer Science & Engineering
35 pages
DBMS Indexing Methods
No ratings yet
DBMS Indexing Methods
33 pages
Understanding Index Algorithms
No ratings yet
Understanding Index Algorithms
27 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
80 pages
Indexing Structures For Files: Database Design Database Design
No ratings yet
Indexing Structures For Files: Database Design Database Design
9 pages
Indexing Hashing Files
No ratings yet
Indexing Hashing Files
68 pages
Unit Iv
No ratings yet
Unit Iv
29 pages
7-Indexing and Block
No ratings yet
7-Indexing and Block
20 pages
Chapter 3 File Organization Indexed Methods
No ratings yet
Chapter 3 File Organization Indexed Methods
31 pages
CH 14
No ratings yet
CH 14
6 pages
Database Indexing Techniques
No ratings yet
Database Indexing Techniques
50 pages
Lec 09
No ratings yet
Lec 09
52 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
26 - Databse Indexes
No ratings yet
26 - Databse Indexes
48 pages
03 UW Indexing
No ratings yet
03 UW Indexing
97 pages
DINLect 1
No ratings yet
DINLect 1
69 pages
Lecture 5 Trees
No ratings yet
Lecture 5 Trees
47 pages
Week 15 Physical Database Design Index - CH 17 Updated
No ratings yet
Week 15 Physical Database Design Index - CH 17 Updated
35 pages
L4 Indexing
No ratings yet
L4 Indexing
56 pages
CH 12 Updated
No ratings yet
CH 12 Updated
55 pages
B-Tree Indexing for Data Retrieval
No ratings yet
B-Tree Indexing for Data Retrieval
208 pages
CS2202 IndexingHashing
No ratings yet
CS2202 IndexingHashing
83 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
23 pages
Index and Hashing
No ratings yet
Index and Hashing
82 pages
Unit-6 Storage Strategies
No ratings yet
Unit-6 Storage Strategies
43 pages
Indexing - DBMS
No ratings yet
Indexing - DBMS
20 pages
File Organizations and Indexes
No ratings yet
File Organizations and Indexes
51 pages
Chapter 5. Record Storage and Primary File Organization
No ratings yet
Chapter 5. Record Storage and Primary File Organization
18 pages
Indexing Structures & Database Design
No ratings yet
Indexing Structures & Database Design
39 pages
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
No ratings yet
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
44 pages
Thompson Industrial Communications Fifth Edition
100% (1)
Thompson Industrial Communications Fifth Edition
46 pages
Amazing Structure Originated From Simple Design: About ZWCAD Architecture
No ratings yet
Amazing Structure Originated From Simple Design: About ZWCAD Architecture
2 pages
Comprehensive List of TLDs
No ratings yet
Comprehensive List of TLDs
19 pages
Allplan 2015 ArchitectureTutl
No ratings yet
Allplan 2015 ArchitectureTutl
499 pages
Research On Effects of Online Games On Students Academic Performance 2021
100% (1)
Research On Effects of Online Games On Students Academic Performance 2021
11 pages
05 - SPPA T2000 As 620, SIM and I-O Function Blocks
No ratings yet
05 - SPPA T2000 As 620, SIM and I-O Function Blocks
719 pages
Oracle Unified Method
No ratings yet
Oracle Unified Method
5 pages
US Army Corps of Engineers - Guidance For Evaluating Performance Based Chemical Data PDF
No ratings yet
US Army Corps of Engineers - Guidance For Evaluating Performance Based Chemical Data PDF
129 pages
Compiler Phases Explained
No ratings yet
Compiler Phases Explained
24 pages
74 Email Can Student-1-9 Student
No ratings yet
74 Email Can Student-1-9 Student
9 pages
Substation Bus Schemes Explained
No ratings yet
Substation Bus Schemes Explained
6 pages
IoT-Based Air Pollution Monitoring Report
No ratings yet
IoT-Based Air Pollution Monitoring Report
76 pages
Fittings ASME BPE PDF
No ratings yet
Fittings ASME BPE PDF
1 page
Booklet - Present Perfect - MAGDA
No ratings yet
Booklet - Present Perfect - MAGDA
11 pages
Banking Law and Management Exam Guide
No ratings yet
Banking Law and Management Exam Guide
22 pages
CCS - 336 - Cloud Services Management
No ratings yet
CCS - 336 - Cloud Services Management
118 pages
Naive Bayes for High School Specialization
No ratings yet
Naive Bayes for High School Specialization
8 pages
AMT49406 Datasheet
No ratings yet
AMT49406 Datasheet
19 pages
Bridge-PG v3.4
No ratings yet
Bridge-PG v3.4
59 pages
UNIT 5 - Data Science - III BSC CS
No ratings yet
UNIT 5 - Data Science - III BSC CS
16 pages
Multimedia Networks for Students
No ratings yet
Multimedia Networks for Students
66 pages
MAP08
No ratings yet
MAP08
26 pages
Manifest NonUFSFiles Win64
No ratings yet
Manifest NonUFSFiles Win64
116 pages
Privileged Account Management Guide
No ratings yet
Privileged Account Management Guide
9 pages
Microsoft Word Document Tasks Guide
No ratings yet
Microsoft Word Document Tasks Guide
3 pages
DS4 Elevator Manual
No ratings yet
DS4 Elevator Manual
49 pages
How To Write A Spreadsheet For Calculating Age
No ratings yet
How To Write A Spreadsheet For Calculating Age
2 pages
Evad 008
No ratings yet
Evad 008
20 pages
Proposal - Philsunrise Maritime Inc
No ratings yet
Proposal - Philsunrise Maritime Inc
2 pages
Fortiweb v7.4.2 Release Notes
No ratings yet
Fortiweb v7.4.2 Release Notes
22 pages

Indexing

Uploaded by

Indexing

Uploaded by

02.

 Indexes used to speed up record retrieval in response to

Primary index on the ordering key field of the file

 Major problem: insertion and deletion of records

 Create an overflow file to store new records that cannot fit

 Use a linked list structure to link overflow records to their

 The clustering index has an entry for each distinct value of

Table 1 Types of indexes based on the properties of the indexing field

Table 2 Properties of index types

 Designed to greatly reduce remaining search space as

 Because a single-level index is an ordered file, we can

[Link] the two-level index

A two-level primary index

[Link] the two-level index

A tree data structure that shows an unbalanced tree

 Search tree used to

A node in a search tree with pointers to subtrees below it

 Provide multi-level access structure

Algorithm : Searching for a

 Multiple attributes involved in many retrieval and update

Example of a grid array on Dno and Age attributes

 Used with a large number of rows

 Column-based storage of relations

You might also like