6 Hash-Based Indexing

Hash-based indexing uses hashing functions to map values to bucket numbers to locate data entries. There are three main hashing techniques: static hashing uses overflow pages but can have long chains; extendible hashing uses a directory to double the number of buckets when one overflows; and linear hashing utilizes increasing hash functions during splitting rounds to redistribute entries without overflow pages or a directory.

Uploaded by

Michael ggwp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

6 Hash-Based Indexing

Uploaded by

Michael ggwp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

HASH-BASED INDEXING

Hashing function
 Excellent for equality selection
 The basic idea is to use a hashing function, which maps
values in a search field into a range of bucket numbers to
find the page on which a desired data entry belongs.
 Hashing function:
 Static Hashing : suffers from the problem of long overflow
chains, which can affect performance.
 Extendible Hashing : uses a directory to support inserts and
deletes efficiently without any overflow pages.
 Linear Hashing : uses a clever policy for creating new
buckets and supports inserts and deletes efficiently
without the use of a directory.
STATIC HASHING
 The pages containing the data can be viewed as a
collection of buckets, with one primary page and
possibly additional overflow pages per bucket.
 A file consists of buckets 0 through N − 1, with one
primary page per bucket initially.
 Buckets contain data entries.
Static Hashing
Operation on Static Hashing
 Search : apply a hash function h to identify the bucket to
which it belongs and then search this bucket.
 Insert :
 use the hash function to identify the correct bucket
 put the data entry there.
 If there is no space for this data entry, allocate a new overflow page, put
the data entry on this page, and add the page to the overflow chain of
the bucket.
 Delete :
 use the hashing function to identify the correct bucket,
 locate the data entry by searching the bucket, and then remove it.
 If this data entry is the last in an overflow page, the overflow page is
removed from the overflow chain of the bucket and added to a list of
free pages.
Hash Function
 hash function must distribute values in the domain of the
search field uniformly over the collection of buckets.
 N buckets, numbered 0 through N − 1, a hash function h :
h(value) = (a*value + b)
 The bucket identified : h(value) mod N.
 The constants a and b can be chosen to `tune' the hash
function.
EXTENDIBLE HASHING *
 Doubling the number of buckets and redistributing
the entries across the new set of buckets  high
cost.
 The Extendible Hashing scheme uses a directory to
support inserts and deletes efficiently without any
overflow pages.
 Use a directory of pointers to buckets, and double
the size of the number of buckets by doubling just the
directory and splitting only the bucket that
overflowed.
 To locate a data entry with hash value 5 (5*)?? 13*??
Example of an Extendible Hashed File:
After Inserting Entry r with h(r)=13
While Inserting Entry r with h(r)=20
After Inserting Entry r with h(r)=20
After Inserting Entry r with h(r)=9
Hash Function
 The basic technique used in Extendible Hashing : to
treat the result of applying a hash function h as a
binary number and to interpret the last d bits, where
d depends on the size of the directory, as an offset
into the directory.
 In example d = 2 (because have four buckets)
 After the split, d = 3 (because have eight buckets).
LINEAR HASHING
 Linear Hashing is a dynamic hashing technique, does not
require a directory.
 The scheme utilizes a family of hash functions h0, h1, h2, : : :,
with the property that each function's range is twice that
of its predecessor.
 If hi maps a data entry into one of M buckets, hi+1 maps a
data entry into one of 2M buckets.
 The idea is the best understood in in terms of round of
splitting.
 During round number level, only hash function hlevel and
hlevel+1 are in use.
 The buckets in the file at the beginning of the round are
split, one by one from the first to the last bucket.
 At any given point within the round, there are:
 Bucket that have been split
 Bucket that are yet to be split
 Bucket created by split in this round.
Buckets during a Round in Linear Hashing
Search for a data entry with a given search key value:
 Apply hash function hLevel, :
 If this leads to one of the unsplit buckets  look there.
 If it leads to one of the split buckets, the entry may be there
or it may have been moved to the new bucket created
earlier in this round by splitting this bucket;
 To determine which of these two buckets contains the entry,
apply hLevel+1.
 An overflow page is added to store the newly
inserted data entry (which triggered the split).
 A counter Level : indicate the current round number
and is initialized to 0.
 The bucket to split is denoted by Next and is initially
bucket 0 (the first bucket).
 Denote the number of buckets in the file at the
beginning of round Level by NLevel  NLevel = N * 2Level.
 For example:
 Let the number of buckets at the beginning of round 0,
denoted by N0, be N.
 Each bucket can hold four data entries, and the file initially
contains four buckets, as shown in the figure.
Example of a Linear Hashed File
 The bucket can be split whenever a new overflow page is
added.
 A split is “triggered” when inserting a new data entry
causes the creation of an overflow page.
 Whenever a split is triggered, the Next bucket is split
and hash function hlevel+1 redistributes entries between
these buckets and its split image.
 After splitting a bucket, the value of Next increment by 1.
 For example: insert data entry 43*  triggers a split
After Inserting Record r with h(r)=43
Not all insertions trigger a split.
For example: insert 37*.

After Inserting Record r with h(r)=37

 Sometimes the bucket pointed to by Next is full, and a new data entry
should be inserted in this bucket  inserting 29*

After Inserting Record r with h(r)=29

After Inserting Records with h(r)=22, 66, and 34
After Inserting Record r with h(r)=50

SPLK-1003 V12.95
No ratings yet
SPLK-1003 V12.95
23 pages
Group Assignment - On - Hashing in DBMS
No ratings yet
Group Assignment - On - Hashing in DBMS
4 pages
Chapter 1 (Thesis)
No ratings yet
Chapter 1 (Thesis)
23 pages
The Umbrella Academy Episode Script Transcript Season 1 02 Run Boy Run
No ratings yet
The Umbrella Academy Episode Script Transcript Season 1 02 Run Boy Run
77 pages
Linear Hashing
No ratings yet
Linear Hashing
21 pages
Ch11 Hash Indexes 1perpage Annotated
No ratings yet
Ch11 Hash Indexes 1perpage Annotated
28 pages
Database Systems (資料庫系統) : November 26/28, 2007 Lecture #9
No ratings yet
Database Systems (資料庫系統) : November 26/28, 2007 Lecture #9
43 pages
Chap. 6 Hash-Based Indexing: Abel J.P. Gomes
No ratings yet
Chap. 6 Hash-Based Indexing: Abel J.P. Gomes
15 pages
Hash-Based Indexes: Introduction To Database, Fall 2004/melikyan 1
No ratings yet
Hash-Based Indexes: Introduction To Database, Fall 2004/melikyan 1
19 pages
Linear Hashing: Historical Background
No ratings yet
Linear Hashing: Historical Background
4 pages
Dynamic Hashing
No ratings yet
Dynamic Hashing
35 pages
Unit 4-Hashing
No ratings yet
Unit 4-Hashing
24 pages
Static and Dynamic Hashing
No ratings yet
Static and Dynamic Hashing
12 pages
Hashing
No ratings yet
Hashing
15 pages
Chapter 11
No ratings yet
Chapter 11
22 pages
CO3 Notes Hashing
No ratings yet
CO3 Notes Hashing
10 pages
Unit 1 Lecture 10
No ratings yet
Unit 1 Lecture 10
11 pages
CO3 Session 6
No ratings yet
CO3 Session 6
29 pages
Adbs 5
No ratings yet
Adbs 5
37 pages
DBMS Hashing
No ratings yet
DBMS Hashing
3 pages
9-Hashing Schemes
No ratings yet
9-Hashing Schemes
23 pages
11 What Is Hashing in DBMS
No ratings yet
11 What Is Hashing in DBMS
20 pages
Lec04 Hashing CH 11 P2
No ratings yet
Lec04 Hashing CH 11 P2
44 pages
Hash-Based Indexes: As For Any Index, 3 Alternatives For Data Entries K
No ratings yet
Hash-Based Indexes: As For Any Index, 3 Alternatives For Data Entries K
7 pages
Chapter 11
No ratings yet
Chapter 11
22 pages
hash_dbms
No ratings yet
hash_dbms
5 pages
Lecture14 Hash Based Indexing and Sorting MHH 18oct 2016
No ratings yet
Lecture14 Hash Based Indexing and Sorting MHH 18oct 2016
71 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
11 pages
Hashing
No ratings yet
Hashing
33 pages
Hashing
No ratings yet
Hashing
8 pages
Hashing
No ratings yet
Hashing
4 pages
Linear Hash
No ratings yet
Linear Hash
15 pages
DSimp2
No ratings yet
DSimp2
21 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
6 pages
There Are Two Types of Hashing
No ratings yet
There Are Two Types of Hashing
2 pages
Unit-3 Hashing Storage Btree
No ratings yet
Unit-3 Hashing Storage Btree
26 pages
It Is A Very Efficient Method To Search The Exact Data Items Based On Hash Table
No ratings yet
It Is A Very Efficient Method To Search The Exact Data Items Based On Hash Table
49 pages
Hashing Function
No ratings yet
Hashing Function
14 pages
mod 5
No ratings yet
mod 5
13 pages
DSAD Dynamic Hashing
No ratings yet
DSAD Dynamic Hashing
79 pages
Chapter 11 Hashing
No ratings yet
Chapter 11 Hashing
42 pages
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
No ratings yet
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
32 pages
u12
No ratings yet
u12
8 pages
L07
No ratings yet
L07
24 pages
22-M4-File Organization - Single Level Indexing-09!09!2024
No ratings yet
22-M4-File Organization - Single Level Indexing-09!09!2024
12 pages
Unit-3 Part 2 Indexing and Hashing
No ratings yet
Unit-3 Part 2 Indexing and Hashing
36 pages
hashing
No ratings yet
hashing
11 pages
Performance Comparison of Extendible Hashing and Linear Hashing Techniques
No ratings yet
Performance Comparison of Extendible Hashing and Linear Hashing Techniques
8 pages
Hashing
No ratings yet
Hashing
8 pages
2.8. ADS_collision Resolution-Extendible Hashing-1
No ratings yet
2.8. ADS_collision Resolution-Extendible Hashing-1
47 pages
E Ds Extendiblehashing
No ratings yet
E Ds Extendiblehashing
3 pages
Unit 6.2 Indexing and Hashing
No ratings yet
Unit 6.2 Indexing and Hashing
37 pages
DSA_240404_220052 (1)
No ratings yet
DSA_240404_220052 (1)
9 pages
DS 8
No ratings yet
DS 8
30 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
4 pages
Hashing
No ratings yet
Hashing
47 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
Hash Function
No ratings yet
Hash Function
9 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
Database Indexing and Hashing
No ratings yet
Database Indexing and Hashing
7 pages
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
From Everand
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
VIOLET CASTRO
No ratings yet
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
EPLAN 5.50 Podręcznik Użytkownika cz.1
No ratings yet
EPLAN 5.50 Podręcznik Użytkownika cz.1
604 pages
History - Wikipedia
No ratings yet
History - Wikipedia
37 pages
VITA-Veneering en 10.2019
No ratings yet
VITA-Veneering en 10.2019
135 pages
NDFC Deployment For Connectrix MDS Switches Part 2
No ratings yet
NDFC Deployment For Connectrix MDS Switches Part 2
38 pages
Guided Writing (Informal Letter)
100% (1)
Guided Writing (Informal Letter)
11 pages
List of Vocabulary C2
No ratings yet
List of Vocabulary C2
43 pages
Martial Arts Training
No ratings yet
Martial Arts Training
27 pages
Luigi the Goldfish Fish Us Airali Design
No ratings yet
Luigi the Goldfish Fish Us Airali Design
7 pages
Jacuzzi Serene 3
No ratings yet
Jacuzzi Serene 3
2 pages
The Wasps' Nest Poetry Commentary
No ratings yet
The Wasps' Nest Poetry Commentary
3 pages
CMCA LEC 12-Intrapartum
No ratings yet
CMCA LEC 12-Intrapartum
189 pages
Chapter#1
No ratings yet
Chapter#1
67 pages
A.742 Buddha Construction Private Limited - June - 2024 - FINAL
No ratings yet
A.742 Buddha Construction Private Limited - June - 2024 - FINAL
4 pages
Ospe Hpe, Ihc
No ratings yet
Ospe Hpe, Ihc
15 pages
BU3 - 1.0a - Introduction and Definition of Architectural Acoustics
No ratings yet
BU3 - 1.0a - Introduction and Definition of Architectural Acoustics
29 pages
Technical Data Heavy Lift Trucks 28-50: DCF300-12 - Cummins QSB6,7 - Duplex, Clear View, Standard - LH 4000
No ratings yet
Technical Data Heavy Lift Trucks 28-50: DCF300-12 - Cummins QSB6,7 - Duplex, Clear View, Standard - LH 4000
2 pages
invoice_JLEU78977-1
No ratings yet
invoice_JLEU78977-1
2 pages
Upper Intermediate 3 Progress Check 4
No ratings yet
Upper Intermediate 3 Progress Check 4
29 pages
06-Section 6A Hole Cleaning
No ratings yet
06-Section 6A Hole Cleaning
18 pages
Formato Escala ASIA
No ratings yet
Formato Escala ASIA
2 pages
Scales For Improvisation: Jamey Aebersold Scale Syllabus
No ratings yet
Scales For Improvisation: Jamey Aebersold Scale Syllabus
7 pages
Eng 3081
No ratings yet
Eng 3081
78 pages
Sourav Final
No ratings yet
Sourav Final
49 pages
Crew Cash Handling Policy and Procedures
No ratings yet
Crew Cash Handling Policy and Procedures
3 pages
O - Need For Speed - Top Five Oracle Performance Tuning Tips - NYOUG
No ratings yet
O - Need For Speed - Top Five Oracle Performance Tuning Tips - NYOUG
67 pages
Automated Bottle Filling System in Industries Using PLC: Present by Ei Thandar Kyaw Roll No-Vi-Ep-14
No ratings yet
Automated Bottle Filling System in Industries Using PLC: Present by Ei Thandar Kyaw Roll No-Vi-Ep-14
15 pages
Addiction Severity Index Lite - CF: Thomas Mclellan, Ph.D. John Cacciola, Ph.D. Deni Carise, Ph.D. Thomas H. Coyne, MSW
No ratings yet
Addiction Severity Index Lite - CF: Thomas Mclellan, Ph.D. John Cacciola, Ph.D. Deni Carise, Ph.D. Thomas H. Coyne, MSW
12 pages

6 Hash-Based Indexing

Uploaded by

6 Hash-Based Indexing

Uploaded by

HASH-BASED INDEXING

After Inserting Record r with h(r)=37

After Inserting Record r with h(r)=29

You might also like