0% found this document useful (0 votes)

0 views

13.hashing

The document reviews various searching techniques, highlighting the efficiency of sequential and binary searches, as well as binary search trees (BST) and the need for height balancing to maintain O(log n) search time. It introduces hashing as a method to achieve O(1) search time using hash functions, while also discussing collision resolution techniques such as linear probing and chaining. Additionally, it outlines types of hash tables, hashing functions, and applications of hash tables in areas like database systems and network processing.

Uploaded by

Anas Aqeel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

13.hashing

Uploaded by

Anas Aqeel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

HASHING

REVIEW OF SEARCHING TECHNIQUES

Recall the efficiency of searching techniques covered

earlier.

 The sequential search algorithm takes time

proportional to the data size, i.e, O(n).

 Binary search improves on liner search reducing

the search time to O(log n).

 With a BST, an O(log n) search efficiency can be

obtained; but the worst-case complexity is O(n).

 To guarantee the O(log n) search time, BST height

balancing is required.
REVIEW OF SEARCHING TECHNIQUES

 Can we do better than that? Is it

possible to design a search of
O(1) – that is, one that has a
constant search time, no matter
where the element is in the list
HASHING
 is used to order and access
elements in a list quickly -- the
goal is O(1) time -- by using a
function of the key value to
identify its location in the list.

 The function of the key value is

called a hash function.

4
USING A HASH FUNCTION

values
[0] 0000
HandyParts company
makes no more than 100
[1] 0001 different parts. But the
[2] parts all have four digit
0002
numbers which ranges
[3] 0
0003 from 0000 to 0100.
[4]
0004
8
We can directly access any
.
.
part record through the
. 10 array index.
.
.
. i.e. there is one-to-one
correspondence between
[ 97] 0097
Part number & index
[ 98] 0098

[ 99] 0099
5
USING A HASH FUNCTION

values
[0] Empty Now if another company makes no
[1] 4501
more than 100 different parts. But
the parts all have four digit
[2] Empty numbers with no
[3] restriction on range .
8903
7803
What to do?
[4]
Empty
.
.8
This hash function can be used to
. store and retrieve parts in an array.
.
. 10
.
Hash(key) = partNum % 100
[ 97] Empty

[ 98] 2298

[ 99] 3699
6
PLACING ELEMENTS IN THE ARRAY

values
[0] Empty
5500 Use the hash function

[1] 4501 Hash(key) = partNum % 100

[2] Empty
to place the element with
[3] 8903
7803
[4]
part number 5500 in the
Empty
.
.8
array.
.
.
.
.10
Hash(key) = 5500 % 100 =0
[ 97] Empty

[ 98] 2298

[ 99] 3699
7
PLACING ELEMENTS IN THE ARRAY

values
[0] 5500
Use the hash function

[1] 4501 Hash(key) = partNum % 100

[2] Empty
to place the element with
[3] 8903
7803
[4]
part number 5502 in the
Empty
.
.8
array.
.
.
.
.10

[ 97] Empty

[ 98] 2298

[ 99] 3699
8
PLACING ELEMENTS IN THE ARRAY

values
[0] 5500
Use the hash function

[1] 4501 Hash(key) = partNum % 100

[2] Empty
to place the element with
[3] 8903
7803
[4]
part number 5502 in the
Empty
.
.8
array.
.
.
.
.10

[ 97] Empty

[ 98] 2298

[ 99] 3699
9
PLACING ELEMENTS IN THE ARRAY

values
[0] 5500
Next place part number
6702 in the array.
[1] 4501

[2] Hash(key) = partNum % 100

5502

[3]
7803 6702 % 100 = 2
[4]
Empty But values[2] is already
.
.
occupied.
.
.
.
. COLLISION OCCURS
[ 97] Empty
The condition resulting when
[ 98] 2298
two or more keys produce
[ 99] 3699 the same hash location
10
HOW TO RESOLVE THE COLLISION?

values
[0] 5500
One way is by linear probing.
This uses the rehash function
[1] 4501

[2] (HashValue + 1) % 100

5502

[3]
7803 repeatedly until an empty location
[4]
is found for part number 6702.
Empty
.
.
.
.
.
. Linear Probing: Resolving a hash
collision by sequentially searching a
[ 97] Empty
hash table beginning at the location
[ 98] 2298 returned by the has function.

[ 99] 3699
11
RESOLVING THE COLLISION

values
[0] 5500
Still looking for a place for 6702
using the function
[1] 4501

[2] (HashValue + 1) % 100

5502

[3]
7803
[4]
(6702 + 1) % 100 = 3
Empty
.
.
.
.
.
.

[ 97] Empty

[ 98] 2298

[ 99] 3699
12
COLLISION RESOLVED

values
[0] 5500
Part 6702 can be placed at
the location with index 4.
[1] 4501

[2] (6702 + 2) % 100 = 4

5502

[3]
7803
[4]
Empty
.
.
.
.
.
.

[ 97] Empty

[ 98] 2298

[ 99] 3699
13
COLLISION RESOLVED

values
[0] 5500
Part 6702 is placed at
the location with index 4.
[1] 4501

[2] 5502
Where would the part with
[3] 7803 number 4598 be placed using
[4] 6702
linear probing?

Empty
[5]
. .
. . 4598 will be stored at index 5
. . /*treating list as circular*/
[ 97] Empty

[ 98] 2298

[ 99] 3699
14
BUCKETS & CHAINING
 Another alternative for handling
collisions is to allow multiple element
keys to hash to the same location.

 Bucket
 A collection of elements associated with a
particular hash location
BUCKETS & CHAINING
 Suppose we have a bucket of size 3. so 3
elements can share the location.
[00 Empty Empty Insert 5462
Empty
5460
] 5462%100 = 2
[01 14001 72101 Empty
Insert 5460
]
5460%100 = 0
[02 9872 5462
Empty Empty
9462
] Insert 9462
. . . . 9462%100 = 2
. . . . Insert 71462
. . . . 71462%100 = 2

[99 19899 2399 199

] Where to insert?
CHAINING
 A linked list of elements that share the same hash
location

0 ...

1
...
2

D-1 ...
HASH TABLES
 There are two types of Hash Tables: Open-addressed Hash Tables and Separate-
Chained Hash Tables.

 An Open-addressed Hash Table is a one-dimensional array indexed by

integer values that are computed by an index function called a hash function.

 A Separate-Chained Hash Table is a one-dimensional array of linked lists indexed

by integer values that are computed by an index function called a hash function.

 Hash tables are sometimes referred to as scatter tables..\

 Typical hash table operations are:

· Insertion.
· Searching
· Deletion.
TYPES OF HASHING
 There are two types of hashing :
1. Static hashing: In static hashing, the hash function maps
search-key values to a fixed set of locations.

2. Dynamic hashing: In dynamic hashing a hash table can grow to

handle more items. The associated hash function must
change as the table grows.

 The load factor of a hash table is the ratio of the number of keys in the table
to the size of the hash table.

 Note: The higher the load factor, the slower the retrieval.

 With open addressing, the load factor cannot exceed 1. With

chaining, the load factor often exceeds 1.
HASH FUNCTIONS (CONT’D)
 A good hash function should:

· Minimize collisions.

· Be easy and quick to compute.

· Distribute key values evenly in the hash

table.

· Use all the information provided in the key.

COMMON HASHING FUNCTIONS
1. Division Remainder (using the table size as the
divisor)

 Computes hash value from key using the %

operator.

 Prime numbers are better table size values.

COMMON HASHING FUNCTIONS (CONT’D)
2. Truncation or Digit/Character Extraction

 Works based on the distribution of digits or characters in the key.

 More evenly distributed digit positions are extracted and used for
hashing purposes.

 For instance, students IDs or ISBN codes may contain common

subsequences which may increase the likelihood of collision.

 Very fast but digits/characters distribution in keys may not be very

even.
COMMON HASHING FUNCTIONS (CONT’D)
3. Folding

 It involves splitting keys into two or more parts and then combining the
parts to form the hash addresses.

 To map the key 25936715 to a range between 0 and 9999, we can:

· split the number into two as 2593 and 6715 and
· add these two to obtain 9308 as the hash value.

 Very useful if we have keys that are very large.

 Fast and simple especially with bit patterns.

 A great advantage is ability to transform non-integer keys into integer

COMMON HASHING FUNCTIONS (CONT’D)
4. Radix Conversion

 Transforms a key into another number base to obtain the hash value.

 Typically use number base other than base 10 and base 2 to calculate
the hash addresses.

 To map the key 55354 in the range 0 to 9999 using base 11 we have:

5535410 = 3865211

 We may truncate the high-order 3 to yield 8652 as our hash address

within 0 to 9999.
COMMON HASHING FUNCTIONS (CONT’D)
5. Mid-Square

 The key is squared and the middle part of the result taken as the
hash value.

 To map the key 3121 into a hash table of size 1000, we square it
31212 = 9740641 and extract 406 as the hash value.

 Works well if the keys do not contain a lot of leading or trailing

zeros.

 Non-integer keys have to be preprocessed to obtain corresponding

integer values.
SOME APPLICATIONS OF HASH TABLES
 Database systems: Specifically, those that require efficient random access. Generally,
database systems try to optimize between two types of access methods: sequential and
random. Hash tables are an important part of efficient random access because they
provide a way to locate data in a constant amount of time.

 Symbol tables: The tables used by compilers to maintain information about symbols
from a program. Compilers access information about symbols frequently. Therefore, it
is important that symbol tables be implemented very efficiently.

 Data dictionaries: Data structures that support adding, deleting, and searching for
data. Although the operations of a hash table and a data dictionary are similar, other
data structures may be used to implement data dictionaries. Using a hash table is
particularly efficient.

 Network processing algorithms: Hash tables are fundamental components of several

network processing algorithms and applications, including route lookup, packet
classification, and network monitoring.

 Browser Cashes: Hash tables are used to implement browser cashes.

Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing
No ratings yet
Hashing
23 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Hashing
No ratings yet
Hashing
56 pages
Hashing Unit 1
No ratings yet
Hashing Unit 1
91 pages
DSA Lab 11 Hashing
No ratings yet
DSA Lab 11 Hashing
9 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hash Table Data Structure
No ratings yet
Hash Table Data Structure
34 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Unit-5
No ratings yet
Unit-5
50 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Hashing
No ratings yet
Hashing
66 pages
MODULE-5
No ratings yet
MODULE-5
33 pages
Hashing
No ratings yet
Hashing
20 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
Hashing RPK
No ratings yet
Hashing RPK
61 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
CSC 302 - Hashing Techniques
No ratings yet
CSC 302 - Hashing Techniques
19 pages
Unit28 Hashing1
No ratings yet
Unit28 Hashing1
19 pages
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
No ratings yet
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
19 pages
Done DS GTU Study Material Presentations Unit-4 13032021035653AM
No ratings yet
Done DS GTU Study Material Presentations Unit-4 13032021035653AM
24 pages
Chapter One - Hashing PDF
No ratings yet
Chapter One - Hashing PDF
30 pages
Dsa Hashing (21CS32)
No ratings yet
Dsa Hashing (21CS32)
16 pages
Exp 5 - Dsa Lab File
No ratings yet
Exp 5 - Dsa Lab File
10 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
Hashing and Graphs
No ratings yet
Hashing and Graphs
28 pages
HAshing (ISE department)
No ratings yet
HAshing (ISE department)
31 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Hashing
No ratings yet
Hashing
16 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
HASHING
No ratings yet
HASHING
63 pages
Hashing
No ratings yet
Hashing
44 pages
Dsa Module 6 Ktuassist
No ratings yet
Dsa Module 6 Ktuassist
9 pages
HASHING
No ratings yet
HASHING
8 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
ADS M TECH MID 2
No ratings yet
ADS M TECH MID 2
26 pages
Lec 11 Hashing and Collision
No ratings yet
Lec 11 Hashing and Collision
16 pages
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
No ratings yet
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
39 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Hashing
No ratings yet
Hashing
23 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Hash Tables: Unit - III - Chapter 5 of Data Structures and Algorithm Analysis in C++ - Mark Allen Weiss
No ratings yet
Hash Tables: Unit - III - Chapter 5 of Data Structures and Algorithm Analysis in C++ - Mark Allen Weiss
60 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
Hash Table: Didih Rizki Chandranegara
No ratings yet
Hash Table: Didih Rizki Chandranegara
33 pages
Lab 2
No ratings yet
Lab 2
10 pages
Lab5 Hashing Algos
No ratings yet
Lab5 Hashing Algos
10 pages
Hashing Cropped (1)
No ratings yet
Hashing Cropped (1)
12 pages
Hashing
No ratings yet
Hashing
11 pages
TCP2101 Algorithm Design & Analysis: - Hash Tables
No ratings yet
TCP2101 Algorithm Design & Analysis: - Hash Tables
58 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
UNIT 1- Hashing
No ratings yet
UNIT 1- Hashing
118 pages
Hashing
No ratings yet
Hashing
42 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Dsa 5
No ratings yet
Dsa 5
22 pages
Hash Tables in DS
No ratings yet
Hash Tables in DS
14 pages
vision_cs_2023_algorithm_chapter_2_hashing_85
No ratings yet
vision_cs_2023_algorithm_chapter_2_hashing_85
12 pages
Fast mental calculation tricks
From Everand
Fast mental calculation tricks
EasyMath
No ratings yet
MCA 2nd yr Practical list S-2020
No ratings yet
MCA 2nd yr Practical list S-2020
8 pages
Lectures On Probability Theory And Mathematical Statistics 2nd Edition Marco Taboga download
100% (1)
Lectures On Probability Theory And Mathematical Statistics 2nd Edition Marco Taboga download
76 pages
Daihatsu Centurion Del Atlantico Operation Manual Section 13
No ratings yet
Daihatsu Centurion Del Atlantico Operation Manual Section 13
75 pages
Bolt, Stud, Sealing, Gasketing and Nut Sizes For Piping: Class 150 Steel and 125 Cast Iron
No ratings yet
Bolt, Stud, Sealing, Gasketing and Nut Sizes For Piping: Class 150 Steel and 125 Cast Iron
1 page
CSP0000160-01 Lubrication
No ratings yet
CSP0000160-01 Lubrication
13 pages
JP Polyplast Swift SFMS-6523
No ratings yet
JP Polyplast Swift SFMS-6523
5 pages
Material Inspection Check List
No ratings yet
Material Inspection Check List
2 pages
UX Design with Figma User Centered Interface Design and Prototyping with Figma Design Thinking 1st Edition Tom Green pdf download
100% (1)
UX Design with Figma User Centered Interface Design and Prototyping with Figma Design Thinking 1st Edition Tom Green pdf download
58 pages
Lab Report
No ratings yet
Lab Report
12 pages
Ocn68472-V101167 TMW Fa22 1ST Replen Nyo 05 02 22
No ratings yet
Ocn68472-V101167 TMW Fa22 1ST Replen Nyo 05 02 22
10 pages
Promotion Basics in Metadata Pipeline Cheat Sheet
No ratings yet
Promotion Basics in Metadata Pipeline Cheat Sheet
3 pages
HW5 Chp4 Ans
No ratings yet
HW5 Chp4 Ans
4 pages
076MSCSK002 Bibek Adhikari
No ratings yet
076MSCSK002 Bibek Adhikari
51 pages
BSBPMG522 Assessor Marking Guide
No ratings yet
BSBPMG522 Assessor Marking Guide
44 pages
Introduction To Human Resource Management: Orientation & Training
No ratings yet
Introduction To Human Resource Management: Orientation & Training
26 pages
Online Collaboration
No ratings yet
Online Collaboration
15 pages
MDX P300 Manual - EN FR ES JP
No ratings yet
MDX P300 Manual - EN FR ES JP
88 pages
Lenskart Report
No ratings yet
Lenskart Report
47 pages
Embracing Continuous Delivery With Azure Pipelines
No ratings yet
Embracing Continuous Delivery With Azure Pipelines
31 pages
Operating Manual CTC
No ratings yet
Operating Manual CTC
68 pages
Shibi
No ratings yet
Shibi
144 pages
Black Friday - 2020
No ratings yet
Black Friday - 2020
346 pages
JLN-740 - 741 (E) 7ZPNA3408 (3版) SERVICE MANUAL 191009-161-220
No ratings yet
JLN-740 - 741 (E) 7ZPNA3408 (3版) SERVICE MANUAL 191009-161-220
60 pages
Carrier Sense Multiple Access With Collision Detection1
No ratings yet
Carrier Sense Multiple Access With Collision Detection1
5 pages
CS-Tool - SPD (Spreadtrum) Phones Success Reports - GSM-Forum
0% (1)
CS-Tool - SPD (Spreadtrum) Phones Success Reports - GSM-Forum
15 pages
JCDP 23 991 PDF
No ratings yet
JCDP 23 991 PDF
7 pages
Pre Deployment
No ratings yet
Pre Deployment
14 pages
12.ode On Solitude Mcqs
No ratings yet
12.ode On Solitude Mcqs
27 pages
Data Sheet: HEF4071B Gates
No ratings yet
Data Sheet: HEF4071B Gates
3 pages
PROFIdrive Statemachine DOC V10 en
No ratings yet
PROFIdrive Statemachine DOC V10 en
13 pages

13.hashing

Uploaded by

13.hashing

Uploaded by

HASHING

REVIEW OF SEARCHING TECHNIQUES

Recall the efficiency of searching techniques covered

 The sequential search algorithm takes time

 Binary search improves on liner search reducing

 With a BST, an O(log n) search efficiency can be

 To guarantee the O(log n) search time, BST height

 Can we do better than that? Is it

 The function of the key value is

[1] 4501 Hash(key) = partNum % 100

[1] 4501 Hash(key) = partNum % 100

[1] 4501 Hash(key) = partNum % 100

[2] Hash(key) = partNum % 100

[2] (HashValue + 1) % 100

[2] (HashValue + 1) % 100

[2] (6702 + 2) % 100 = 4

[99 19899 2399 199

 An Open-addressed Hash Table is a one-dimensional array indexed by

 A Separate-Chained Hash Table is a one-dimensional array of linked lists indexed

 Hash tables are sometimes referred to as scatter tables..\

 Typical hash table operations are:

2. Dynamic hashing: In dynamic hashing a hash table can grow to

 With open addressing, the load factor cannot exceed 1. With

· Be easy and quick to compute.

· Distribute key values evenly in the hash

· Use all the information provided in the key.

 Computes hash value from key using the %

 Prime numbers are better table size values.

 Works based on the distribution of digits or characters in the key.

 For instance, students IDs or ISBN codes may contain common

 Very fast but digits/characters distribution in keys may not be very

 To map the key 25936715 to a range between 0 and 9999, we can:

 Very useful if we have keys that are very large.

 Fast and simple especially with bit patterns.

 A great advantage is ability to transform non-integer keys into integer

 We may truncate the high-order 3 to yield 8652 as our hash address

 Works well if the keys do not contain a lot of leading or trailing

 Non-integer keys have to be preprocessed to obtain corresponding

 Network processing algorithms: Hash tables are fundamental components of several

 Browser Cashes: Hash tables are used to implement browser cashes.

You might also like