0% found this document useful (0 votes)

23 views9 pages

Database Modeling - Notes-V

Uploaded by

ishtiaq.hussain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views9 pages

Database Modeling - Notes-V

Uploaded by

ishtiaq.hussain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Batch processing of k sequentially stored records

read the transaction file:

lra = k where k = number of transaction records
sba = ceil(k/tfbf) where tfbf is the transaction file blocking factor
read the master file:
lra = n
sba = ceil(n/bf) where bf is the master file blocking factor
write a new master file:
lra = n + adds - deletes
sba = ceil((n+adds-deletes)/bf)
where adds is the number of records added or inserted,
and deletes is the number of records deleted.

1
Random Access Methods
Hashing
Basic mechanism – transformation of a primary key directly to a physical address,
called a bucket (or indirectly via a logical address)

Collisions – handled by variations of chained overflow techniques

random access to a hashed file

lra = 1 + overflow(avg)
rba = 1 + overflow(avg)
insertion into a hashed file
lra = 1 + overflow(avg) + rewrite
rba = 1 + overflow(avg)
rba=1 for the rewrite

2
Extendible Hashing
* number of buckets grow or contracts
* bucket splits when it becomes full
* collisions are resolved immediately, no long overflow chains
* primary key transformed to an entry in the Bucket Address Table
(BAT), typically in RAM
* BAT has pointers to disk buckets that hold the actual data
* Retrieve a single record = 1 rba (access the bucket in one step)
* Cost (service time) of I/O for updates, inserts, and deletes is the same as for B+-trees

3
B-trees and B+-trees
B-tree index basic characteristics
* each node contains p pointers and p-1 records
* each pointer at level i is for a data and pointer block at level i+1
* i=1 denotes the root level (single node or block)
* can be inefficient for searching because of the overhead in each search level

4
B+-tree index basic characteristics
* eliminates data pointers from all nodes except the leaf nodes
* each non-leaf index node has p pointers and p-1 key values
* each pointer at level i is for an index block (of key/pointer pairs) at level i+1
* each leaf index has a key value/pointer pair to point to the actual data
block (and record) containing that primary key value
* leaf index nodes can be logically connected via pointers for ordered sequence search
* hybrid method for efficient random access and sequential search

Example: B + -tree
To determine the order of a B+-tree, let us assume that the database has 500,000
records of 200 bytes each, the search key is 15 bytes, the tree and data pointers are
5 bytes, and the index node (and data block size) is 1024 bytes. For this
configuration we have non-leaf index node size = 1024 bytes = p*5 + (p-1)*15
bytes
p = floor((1024+15)/20) = floor(51.95) = 51
number of search key values in the leaf nodes = floor ((1024-5)/(15+5))=50
h = height of the B+-tree (number of index levels, including the leaf index nodes
n = number of records in the database (or file); all must be pointed at from the next to last level, h-
1

ph-1(p-1) > n
(h-1)log p + log(p-1) > log n
(h-1)log p > log n-log(p-1)
h > 1 + (log n-log(p-1)) / log p
h > 1 + (log 500,000-log 50)/log 51 = 3.34, h=4 (nearest higher integer)
A good approximation can be made by assuming that the leaf index nodes are
implemented with p pointers and p key values:
ph > n
h log p > log n
h > log n/log p
In this case, the result above becomes h > 3.35 or h = 4.

5
B+-tree performance
read a single record (B+-tree) = h+1 rba

update a single record (B+-tree) = search cost + rewrite data block

= (h+1) rba + 1 rba

general update cost for insertion (B+-tree)

=search cost (i.e., h+1 reads)
+simple rewrite of data block and leaf index node pointing to the
data block (i.e., 2 rewrites)
+nos*(write of new split index node
+ rewrite of the index node pointer to the new index node)
+ nosb*(write of new split data block)
= (h+1) rba + 2 rba + nos*(2 rba) + nosb*(1 rba)
where nos is the number of index split node operations required and nosb is the
number of data split block operations required

general update cost for deletion (B+-tree)

= search cost (i.e., h+1 reads)
+ simple rewrite of data block and leaf index node pointing to the
data block (i.e., 2 rewrites)
+ noc*(rewrite of the node pointer to the remaining node)
= (h+1) rba + 2 rba + noc*(1 rba)

where noc is the number of consolidations of index nodes required.

As an example, consider the insertion of a node (with key value 77) to the B+-
tree shown in Fig. 6.6. This insertion requires a search (query) phase and an
insertion phase with one split node. The total insertion cost for height 3 is
insertion cost = (3 + 1) rba search cost + (2 rba) rewrite cost
+ 1 split *(2 rba rewrite cost)
= 8 rba

6
7
Secondary Indexes
Basic characteristics of secondary indexes
* based on Boolean search criteria (AND, OR, NOT) of attributes that are
not the primary key

* attribute type index is level 1 (usually in RAM)

* attribute value index is level 2 (usually in RAM)

* accession list is level 3 (ordered list of pointers to blocks containing
records with the given attribute value)

* one accession list per attribute value; pointers have block address and
record offset typically

* accession lists can be merged to satisfy the intersection (AND) of

records that satisfy more than one condition

Boolean query cost (secondary index)

= search attribute type index + search attribute value index
+ search and merge m accession lists + access t target records

= (0 + 0 + sum of m accession list accesses) rba + t rba

= (sum of m accession list cost) rba + t rba
where m is the number of accession lists to be merged and t is the number
of target records to be accessed after the merge operation.
accession list cost (for accession list j) = ceil(pj/bfac) rba
where pj is the number of pointer entries in the jth accession list and bfac is
the blocking factor for all accession lists

bfac = block_size/pointer_size
* assume all accesses to the accession list are random due to dynamic re-allocation
of disk blocks

 use the 1% rule

(any variable affecting the result by less than 1% is ignored)

8
9

Ch14, Veiws, Normalization - Summary
No ratings yet
Ch14, Veiws, Normalization - Summary
68 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
No ratings yet
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
52 pages
CH 12 Updated
No ratings yet
CH 12 Updated
55 pages
Indexing
No ratings yet
Indexing
77 pages
CS2202 IndexingHashing
No ratings yet
CS2202 IndexingHashing
83 pages
Index and Hashing
No ratings yet
Index and Hashing
82 pages
Chapter 7 Indexing Part1
No ratings yet
Chapter 7 Indexing Part1
58 pages
DBMS Indexing 5
No ratings yet
DBMS Indexing 5
63 pages
Indexing
No ratings yet
Indexing
141 pages
1972 Bayer Mccreight
No ratings yet
1972 Bayer Mccreight
17 pages
Database Indexing Essentials
No ratings yet
Database Indexing Essentials
110 pages
UNIT-5: Indexing and Hashing
No ratings yet
UNIT-5: Indexing and Hashing
78 pages
Chapter 7 - Indexing
No ratings yet
Chapter 7 - Indexing
94 pages
03 UW Indexing
No ratings yet
03 UW Indexing
97 pages
Indexing Hashing Files
No ratings yet
Indexing Hashing Files
68 pages
Storage Final
No ratings yet
Storage Final
77 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
DM Module-3
No ratings yet
DM Module-3
60 pages
Database Indexing Techniques
No ratings yet
Database Indexing Techniques
50 pages
02 Blocking - Addional
No ratings yet
02 Blocking - Addional
74 pages
Database Indexing Basics
No ratings yet
Database Indexing Basics
31 pages
CSE 301 Lecture-8-Indexing WT
No ratings yet
CSE 301 Lecture-8-Indexing WT
31 pages
IT3020 L06 Indexing
No ratings yet
IT3020 L06 Indexing
41 pages
Database Management Systems November 6, 2008: Dynamic Indexes: Sections 14.3
No ratings yet
Database Management Systems November 6, 2008: Dynamic Indexes: Sections 14.3
38 pages
Indexing
No ratings yet
Indexing
56 pages
Storage and Indexing Methods
No ratings yet
Storage and Indexing Methods
43 pages
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
No ratings yet
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
44 pages
Unit 5 Indexing 2024
No ratings yet
Unit 5 Indexing 2024
50 pages
Organization and Maintenance of Large Ordered Indices
No ratings yet
Organization and Maintenance of Large Ordered Indices
35 pages
Unit-5 B+Trees & Hashing
No ratings yet
Unit-5 B+Trees & Hashing
37 pages
Unit Iv
No ratings yet
Unit Iv
29 pages
DBMS Unit5
No ratings yet
DBMS Unit5
40 pages
DBMS Indexing Methods
No ratings yet
DBMS Indexing Methods
33 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
9 pages
Indexing and Hashing
No ratings yet
Indexing and Hashing
20 pages
Indexing: Contents
No ratings yet
Indexing: Contents
13 pages
B+ Tree Indexing Explained
No ratings yet
B+ Tree Indexing Explained
46 pages
The Things They Carry
No ratings yet
The Things They Carry
9 pages
Chapter 11: Indexing and Hashing
No ratings yet
Chapter 11: Indexing and Hashing
47 pages
CO3-Session-09 & 10
No ratings yet
CO3-Session-09 & 10
41 pages
7 Indexing
No ratings yet
7 Indexing
13 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
INDEXING
No ratings yet
INDEXING
10 pages
DBMS Unit-Iv
No ratings yet
DBMS Unit-Iv
9 pages
Indexing
No ratings yet
Indexing
41 pages
Database Indexing & Hashing Basics
No ratings yet
Database Indexing & Hashing Basics
7 pages
Crash Barrier BBS & QTY
100% (10)
Crash Barrier BBS & QTY
4 pages
Unit 3 Storage Strategies Indices B-Trees Hashing
No ratings yet
Unit 3 Storage Strategies Indices B-Trees Hashing
12 pages
B+-Trees: Efficient Indexing Guide
No ratings yet
B+-Trees: Efficient Indexing Guide
35 pages
Dbms Indexing
No ratings yet
Dbms Indexing
3 pages
CH 13
No ratings yet
CH 13
34 pages
CH 14
No ratings yet
CH 14
6 pages
SKF3013 - Manual Amali PDF
No ratings yet
SKF3013 - Manual Amali PDF
26 pages
08 Indexes1
No ratings yet
08 Indexes1
7 pages
Storage and Indexing
No ratings yet
Storage and Indexing
41 pages
Hype Cycle For Human Capital 2022
No ratings yet
Hype Cycle For Human Capital 2022
99 pages
Sociology of Families Change Continuity and Diversity 1st Edition Ciabattari Test Bankinstant Download
100% (10)
Sociology of Families Change Continuity and Diversity 1st Edition Ciabattari Test Bankinstant Download
49 pages
Philmetals 2014 - Rev - Reduced PDF
No ratings yet
Philmetals 2014 - Rev - Reduced PDF
82 pages
Indexing
No ratings yet
Indexing
6 pages
Time Series Analysis and Forecasting of Gold Price Using ARIMA and LSTM Model
No ratings yet
Time Series Analysis and Forecasting of Gold Price Using ARIMA and LSTM Model
8 pages
SBM Assessment Tool For Online Validation With Essential MOVs
No ratings yet
SBM Assessment Tool For Online Validation With Essential MOVs
10 pages
The Wessex Head Injury Matrix
No ratings yet
The Wessex Head Injury Matrix
7 pages
Solution 3
No ratings yet
Solution 3
7 pages
A+ Guide To Managing and Maintaining Your PC, 6e: Motherboards
100% (1)
A+ Guide To Managing and Maintaining Your PC, 6e: Motherboards
36 pages
شرح مخطط backup
100% (1)
شرح مخطط backup
31 pages
Indexing Files: Last Time
No ratings yet
Indexing Files: Last Time
5 pages
HFSS-High Frequency Structure Simulator
No ratings yet
HFSS-High Frequency Structure Simulator
38 pages
Subject G11-Goodyear Tvl-Ia Eclassrecord 1stsem 2018-19
No ratings yet
Subject G11-Goodyear Tvl-Ia Eclassrecord 1stsem 2018-19
29 pages
AVL Tree Operations
No ratings yet
AVL Tree Operations
23 pages
Research On The Business Model of Pinduoduo Based
No ratings yet
Research On The Business Model of Pinduoduo Based
6 pages
Grade 11 Matrices
No ratings yet
Grade 11 Matrices
3 pages
5GRAIL WCRR Presentation
No ratings yet
5GRAIL WCRR Presentation
6 pages
KNNL - Malaprabha - Final Feasibility Report
No ratings yet
KNNL - Malaprabha - Final Feasibility Report
53 pages
Classical ALV Reporting - Overview of ALV
No ratings yet
Classical ALV Reporting - Overview of ALV
54 pages
CHE 1000-E LEARNING - BALANCING REDOX REACTIONS
No ratings yet
CHE 1000-E LEARNING - BALANCING REDOX REACTIONS
17 pages
Physical DBs B+ Tree
No ratings yet
Physical DBs B+ Tree
35 pages
Preparation of Fermented Blue Crab With Rice and It'S Market Ability
No ratings yet
Preparation of Fermented Blue Crab With Rice and It'S Market Ability
6 pages
Detyre Kursi Rrjeta Telematike
No ratings yet
Detyre Kursi Rrjeta Telematike
19 pages
Ariston Trainman63X
No ratings yet
Ariston Trainman63X
19 pages
A Rose For Emily Is A Story Told by William Faulkner. The Setting of The Story Occurred
No ratings yet
A Rose For Emily Is A Story Told by William Faulkner. The Setting of The Story Occurred
4 pages
Zishan Z3 User Manual
No ratings yet
Zishan Z3 User Manual
3 pages
Apr04 Seismic Forward Modeling
100% (1)
Apr04 Seismic Forward Modeling
12 pages
Rinkasan Materi Vane Shear Test
No ratings yet
Rinkasan Materi Vane Shear Test
7 pages
Higher Education Strategy 2011-2016
No ratings yet
Higher Education Strategy 2011-2016
4 pages
Associations Between Social Responsibility Disclosure and Characteristics of Companies
No ratings yet
Associations Between Social Responsibility Disclosure and Characteristics of Companies
8 pages
Digital Innovations Exam UiTM
No ratings yet
Digital Innovations Exam UiTM
6 pages

Database Modeling - Notes-V

Uploaded by

Database Modeling - Notes-V

Uploaded by

Batch processing of k sequentially stored records

read the transaction file:

Collisions – handled by variations of chained overflow techniques

random access to a hashed file

update a single record (B+-tree) = search cost + rewrite data block

general update cost for insertion (B+-tree)

general update cost for deletion (B+-tree)

where noc is the number of consolidations of index nodes required.

* attribute type index is level 1 (usually in RAM)

* attribute value index is level 2 (usually in RAM)

* accession lists can be merged to satisfy the intersection (AND) of

Boolean query cost (secondary index)

= (0 + 0 + sum of m accession list accesses) rba + t rba

 use the 1% rule

You might also like