22CS302 LM21

The document provides an overview of hash tables, explaining how hashing is used to uniquely identify objects and facilitate quick data retrieval. It discusses the importance of a good hash function, which should be easy to compute, provide uniform distribution, and minimize collisions. Additionally, it illustrates the application of hashing in counting character frequencies in a string, demonstrating improved efficiency compared to traditional methods.

Uploaded by

poojask1636

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views7 pages

22CS302 LM21

Uploaded by

poojask1636

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

22XX302 DATA STRUCTURES I

UNIT IV

&
HASH TABLE

1. Basics of Hash Tables

Hashing is a technique that is used to uniquely identify a specific object from a group of
similar objects. Some examples of how hashing is used in our lives include:
In universities, each student is assigned a unique roll number that can be used to retrieve
information about them.
In libraries, each book is assigned a unique number that can be used to determine information
about the book, such as its exact position in the library or the users it has been issued to etc.
In both these examples the students and books were hashed to a unique number.
Assume that you have an object and you want to assign a key to it to make searching easy. To
store the key/value pair, you can use a simple array like a data structure where keys (integers)
can be used directly as an index to store values. However, in cases where the keys are large
and cannot be used directly as an index, you should use hashing.
In hashing, large keys are converted into small keys by using hash functions. The values are
then stored in a data structure called hash table. The idea of hashing is to distribute entries
(key/value pairs) uniformly across an array. Each element is assigned a key (converted key).
By using that key you can access the element in O(1) time. Using the key, the algorithm
(hash function) computes an index that suggests where an entry can be found or inserted.
Hashing is implemented in two steps:
An element is converted into an integer by using a hash function. This element can be used as
an index to store the original element, which falls into the hash table.
The element is stored in the hash table where it can be quickly retrieved using hashed key.
hash = hashfunc(key)
index = hash % array_size
In this method, the hash is independent of the array size and it is then reduced to an index (a
number between 0 and array_size − 1) by using the modulo operator (%).

2. Hash function

A hash function is any function that can be used to map a data set of an arbitrary size to a
data set of a fixed size, which falls into the hash table. The values returned by a hash function
are called hash values, hash codes, hash sums, or simply hashes.
To achieve a good hashing mechanism, It is important to have a good hash function with the
following basic requirements:
Easy to compute: It should be easy to compute and must not become an algorithm in itself.
Uniform distribution: It should provide a uniform distribution across the hash table and
should not result in clustering.
Less collisions: Collisions occur when pairs of elements are mapped to the same hash value.
These should be avoided.
Note: Irrespective of how good a hash function is, collisions are bound to occur. Therefore, to
maintain the performance of a hash table, it is important to manage collisions through various
collision resolution techniques.

3. Need for a good hash function

Let us understand the need for a good hash function. Assume that you have to store strings in
the hash table by using the hashing technique {“abcdef”, “bcdefa”, “cdefab” , “defabc” }.
To compute the index for storing the strings, use a hash function that states the following:
The index for a specific string will be equal to the sum of the ASCII values of the characters
modulo 599.
As 599 is a prime number, it will reduce the possibility of indexing different strings
(collisions). It is recommended that you use prime numbers in case of modulo. The ASCII
values of a, b, c, d, e, and f are 97, 98, 99, 100, 101, and 102 respectively. Since all the strings
contain the same characters with different permutations, the sum will 599.
The hash function will compute the same index for all the strings and the strings will be
stored in the hash table in the following format. As the index of all the strings is the same,
you can create a list on that index and insert all the strings in that list.
Here, it will take O(n) time (where n is the number of strings) to access a specific string. This
shows that the hash function is not a good hash function.
Let’s try a different hash function. The index for a specific string will be equal to sum of
ASCII values of characters multiplied by their respective order in the string after which it is
modulo with 2069 (prime number).
String Hash function Index
abcdef (971 + 982 + 993 + 1004 + 1015 + 1026)%2069 38
bcdefa (981 + 992 + 1003 + 1014 + 1025 + 976)%2069 23
cdefab (991 + 1002 + 1013 + 1024 + 975 + 986)%2069 14
defabc (1001 + 1012 + 1023 + 974 + 985 + 996)%2069 11
4. Hash table
A hash table is a data structure that is used to store keys/value pairs. It uses a hash function to
compute an index into an array in which an element will be inserted or searched. By using a
good hash function, hashing can work well. Under reasonable assumptions, the average time
required to search for an element in a hash table is O(1).
Let us consider string S. You are required to count the frequency of all the characters in this
string.
string S = “ababcd”
The simplest way to do this is to iterate over all the possible characters and count their
frequency one by one. The time complexity of this approach is O(26*N) where N is the size
of the string and there are 26 possible characters.
void countFre(string S)
{
for(char c = ‘a’;c <= ‘z’;++c)
{
int frequency = 0;
for(int i = 0;i < S.length();++i)
if(S[i] == c)
frequency++;
cout << c << ‘ ‘ << frequency << endl;
}
}
Output
a2
b2
c1
d1
e0
f0
…
z0
Let us apply hashing to this problem. Take an array frequency of size 26 and hash the 26
characters with indices of the array by using the hash function. Then, iterate over the string
and increase the value in the frequency at the corresponding index for each character. The
complexity of this approach is O(N) where N is the size of the string.
int Frequency[26];

int hashFunc(char c)
{
return (c - ‘a’);
}
void countFre(string S)
{
for(int i = 0;i < S.length();++i)
{
int index = hashFunc(S[i]);
Frequency[index]++;
}
for(int i = 0;i < 26;++i)
cout << (char)(i+’a’) << ‘ ‘ << Frequency[i] << endl;
}
Output
a2
b2
c1
d1
e0
f0
…
z0

Srinivas University: Institute of Engineering and Technology Mukka, Mangaluru
No ratings yet
Srinivas University: Institute of Engineering and Technology Mukka, Mangaluru
34 pages
Understanding Hashing Techniques
No ratings yet
Understanding Hashing Techniques
23 pages
Lecture 3.2.1 Hashing
No ratings yet
Lecture 3.2.1 Hashing
17 pages
DS - Unit 5 - Notes
No ratings yet
DS - Unit 5 - Notes
8 pages
Hash Table Search Complexity Explained
No ratings yet
Hash Table Search Complexity Explained
43 pages
Hash Tables: Concepts & Implementations
No ratings yet
Hash Tables: Concepts & Implementations
53 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
11 pages
Understanding Hashing in Data Structures
No ratings yet
Understanding Hashing in Data Structures
39 pages
Lecture 23 Hash Code Map
No ratings yet
Lecture 23 Hash Code Map
41 pages
Hashing and Indexing Techniques Explained
No ratings yet
Hashing and Indexing Techniques Explained
28 pages
DSA2 Chapter 5 Hashing
No ratings yet
DSA2 Chapter 5 Hashing
44 pages
Understanding Hashing in Data Structures
No ratings yet
Understanding Hashing in Data Structures
44 pages
09 Hashtables
No ratings yet
09 Hashtables
25 pages
Understanding Hashing Techniques
No ratings yet
Understanding Hashing Techniques
47 pages
Lec12 Hash Tables 09092024 090609pm
No ratings yet
Lec12 Hash Tables 09092024 090609pm
48 pages
Hashing in Data Structures
No ratings yet
Hashing in Data Structures
27 pages
Chapter 5 - Hashing - Part1
No ratings yet
Chapter 5 - Hashing - Part1
28 pages
8 Hashtables
No ratings yet
8 Hashtables
84 pages
Dsa M5
No ratings yet
Dsa M5
38 pages
Hash Tables and Hash Functions Guide
No ratings yet
Hash Tables and Hash Functions Guide
30 pages
DS Module 5 Hashing
No ratings yet
DS Module 5 Hashing
23 pages
BCS304 DS Module 5 Notes
No ratings yet
BCS304 DS Module 5 Notes
45 pages
Notes of Advanced Data Structures
No ratings yet
Notes of Advanced Data Structures
204 pages
Hashing Data Structure
No ratings yet
Hashing Data Structure
22 pages
Understanding Hash Functions and Design
No ratings yet
Understanding Hash Functions and Design
9 pages
Dictionaries: Sets
No ratings yet
Dictionaries: Sets
92 pages
Unit 5 Session 5 Hashing
No ratings yet
Unit 5 Session 5 Hashing
20 pages
Hash
No ratings yet
Hash
7 pages
Unit 5
No ratings yet
Unit 5
50 pages
9A Hash Tables
No ratings yet
9A Hash Tables
7 pages
Hashing and Hash Tables Explained
No ratings yet
Hashing and Hash Tables Explained
23 pages
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
No ratings yet
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
77 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
9 pages
Hash Tables: A Detailed Description
No ratings yet
Hash Tables: A Detailed Description
10 pages
Unit 3.4 Hashing Techniques
No ratings yet
Unit 3.4 Hashing Techniques
7 pages
Hashing Techniques Explained
No ratings yet
Hashing Techniques Explained
23 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
69 pages
Hashing Basics for Tech Enthusiasts
No ratings yet
Hashing Basics for Tech Enthusiasts
12 pages
ADS Unit-2
No ratings yet
ADS Unit-2
53 pages
Module 6 DSA 24
No ratings yet
Module 6 DSA 24
64 pages
Hash Tables: A Guide for CS Students
No ratings yet
Hash Tables: A Guide for CS Students
48 pages
Lecture 08 - Hash Tables
No ratings yet
Lecture 08 - Hash Tables
21 pages
Hashing: Collision Handling Methods
No ratings yet
Hashing: Collision Handling Methods
52 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Idst 2016 SA 05 Hashing
No ratings yet
Idst 2016 SA 05 Hashing
68 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Intro To Hashing
No ratings yet
Intro To Hashing
10 pages
Hashing
No ratings yet
Hashing
14 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
Unit2 Hashing DSA
No ratings yet
Unit2 Hashing DSA
55 pages
Hash Function
No ratings yet
Hash Function
4 pages
(SChen Sec C02) Topic 7 - Hashing (Updated)
No ratings yet
(SChen Sec C02) Topic 7 - Hashing (Updated)
74 pages
Hashing
No ratings yet
Hashing
56 pages
Understanding Hashing Techniques and Functions
No ratings yet
Understanding Hashing Techniques and Functions
13 pages
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
No ratings yet
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
19 pages
Array Manipulation Java Examples
No ratings yet
Array Manipulation Java Examples
11 pages
A Project Report Sorting Visualizer
No ratings yet
A Project Report Sorting Visualizer
12 pages
Dijkstra Algorithm for Shortest Path
No ratings yet
Dijkstra Algorithm for Shortest Path
9 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
Facility Layout Optimization
No ratings yet
Facility Layout Optimization
61 pages
ADA Que Bank 2023-2024
No ratings yet
ADA Que Bank 2023-2024
18 pages
Python Lab Assignment-I
No ratings yet
Python Lab Assignment-I
3 pages
4.2 Backpropagation 1
No ratings yet
4.2 Backpropagation 1
78 pages
Divide and Conquer
No ratings yet
Divide and Conquer
14 pages
Problem Solving AI
No ratings yet
Problem Solving AI
40 pages
K-Nearest Neighbours Algorithm: KNN-Visualization
No ratings yet
K-Nearest Neighbours Algorithm: KNN-Visualization
2 pages
Barrier Function
No ratings yet
Barrier Function
8 pages
Roots of Nonlinear Equation 2024 2
No ratings yet
Roots of Nonlinear Equation 2024 2
44 pages
AI Agents: MCQs for Learners
No ratings yet
AI Agents: MCQs for Learners
82 pages
Data Warehousing Course Plan
No ratings yet
Data Warehousing Course Plan
3 pages
NP Completeness
No ratings yet
NP Completeness
15 pages
Python Lab
No ratings yet
Python Lab
27 pages
Concurrent Great Deluge Algorithms For Preemptive Scheduling Problems
No ratings yet
Concurrent Great Deluge Algorithms For Preemptive Scheduling Problems
5 pages
B.Tech Data Structures Exam
No ratings yet
B.Tech Data Structures Exam
2 pages
DSA Interview Prep: Arrays & Strings
No ratings yet
DSA Interview Prep: Arrays & Strings
7 pages
01 Decision Tree Induction Algorithms - Tutorial
No ratings yet
01 Decision Tree Induction Algorithms - Tutorial
12 pages
Queue Operations in Data Structures
No ratings yet
Queue Operations in Data Structures
25 pages
Collections in Java
No ratings yet
Collections in Java
4 pages
Add Lab Manual
No ratings yet
Add Lab Manual
11 pages
Algorithm Design & Analysis Course
No ratings yet
Algorithm Design & Analysis Course
7 pages
Shoolini University Deep Learning Exam
No ratings yet
Shoolini University Deep Learning Exam
3 pages
Computer Project File For Class 12 (New)
No ratings yet
Computer Project File For Class 12 (New)
21 pages
KNN Example
No ratings yet
KNN Example
9 pages
Chapter-03-Searching and Planning
No ratings yet
Chapter-03-Searching and Planning
107 pages
Aho-Corasick String Matching Explained
No ratings yet
Aho-Corasick String Matching Explained
24 pages

22CS302 LM21

Uploaded by

22CS302 LM21

Uploaded by

22XX302 DATA STRUCTURES I

1. Basics of Hash Tables

3. Need for a good hash function

You might also like