0% found this document useful (0 votes)

70 views39 pages

210 Maps PDF

The document discusses maps and hash tables. It defines a map as an abstract data structure that stores key-value pairs without duplicate keys. Hash tables are discussed as an implementation of maps that can provide O(1) time complexity for operations by using a hash function to map keys to bucket locations in an array. Collisions can occur if multiple keys hash to the same location, but a good hash function aims to uniformly distribute keys across buckets to minimize collisions.

Uploaded by

HaroonRashid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views39 pages

210 Maps PDF

Uploaded by

HaroonRashid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

csci 210: Data Structures

Maps and Hash Tables

Summary
• Topics
• the Map ADT
• Map vs Dictionary
• implementation of Map: hash tables

• READING:
• GT textbook chapter 9.1 and 9.2
Map ADT
• A Map is an abstract data structure similar to a Dictionary
• it stores key-value (k,v) pairs
• there cannot be duplicate keys

• Maps are useful in situations where a key can be viewed as a unique identifier for the object
• the key is used to decide where to store the object in the structure
• in other words, the key associated with an object can be viewed as the address for the
object
• maps are sometimes called associative arrays

Map ADT

• size()
• isEmpty()
• get(k):
• if M contains an entry with key k, return it; else return null
• put(k,v):
• if M does not have an entry with key k, add entry (k,v) and return null
• else replace existing value of entry with v and return the old value
• remove(k):
• remove entry (k,*) from M
Map example
(k,v) key=integer, value=letter

M={}

• put(5,A) M={(5,A)}

• put(7,B) M={(5,A), (7,B)}

• put(2,C) M={(5,A), (7,B), (2,C)}

• put(8,D) M={(5,A), (7,B), (2,C), (8,D)}

• put(2,E) M={(5,A), (7,B), (2,E), (8,D)}

• get(7) return B

• get(4) return null

• get(2) return E

• remove(5) M={(7,B), (2,E), (8,D)}

• remove(2) M={(7,B), (8,D)}

• get(2) return null

Example
• Implement a language dictionary with a map
• key = word
• value = definition of word
• get(word)
• returns the definition if the word is in dictionary
• returns null if the word is not in dictionary

• Note: Maps provide an alternative approach to searching

Maps vs Trees
• How are Maps different than Search Trees? BST:
data = <key, ...>
• Binary search trees also associate keys with values for any node u: BST property
• In the data of each BST node there exists a field designated as the key
u
• the BST is ordered by this key
• e.g: a BST of student records
• data = student record
<key, ...>
• key = student ID
• search/insert/delete by student ID are efficient

• Binary trees also support Insert, Delete, Search

• and others
• O(n) worst-case time
• O(lg n) if the tree is balanced

all keys are all keys are

<= u.getKey() <= u.getKey()
Binary Search Tree

• Note: Want to search/insert/delete efficiently by name ? student record

• need to build a BST with key=name <key=ID, ...>

• Want to search/insert/delete efficiently by age?

• need to build a BST with key=age

• Want to search/insert/delete efficiently by SSN? student record

<key=name, ...>
• need to build a BST with key=SSN
Dictionary ADT
• A generic data structure that supports {INSERT, DELETE, SEARCH} is called a DICTIONARY

• A Dictionary stores (k,v) key-value pairs called entries

• k is the key
• v is the value

• A Dictionary can have elements with same key

• Note: how does a BST with equal elements look like?

• A DICTIONARY usually keeps track of the order of the elements

• supports other operations like predecessor, successor, traverse--in-order
Java.util.Map
• check out the interface

• additional handy methods

• putAll
• entrySet
• containsValue
• containsKey

• Implementation?
Class-work
• Write a program that reads from the user the name of a text file, counts the word frequencies
of all words in the file, and outputs a list of words and their frequency.
• e.g. text file: article, poem, science, etc

• Questions:
• Think in terms of a Map data structure that associates keys to values.
• What will be your <key-value> pairs?

• Sketch the main loop of your program.

Map Implementations
• Linked-list

• Binary search trees

• Hash tables
A LinkedList implementation of Maps
• store the (k,v) pairs in a doubly linked list

• get(k)
• hop through the list until find the element with key k

• put(k,v)
• Node x = get(k)
• if (x != null)
• replace the value in x with v
• else create a new node(k,v) and add it at the front

• remove(k)
• Node x = get(k)
• if (x == null) return null
• else remove node x from the list
• Note: why doubly-linked? need to delete at an arbitrary position

• Analysis: O(n) on a map with n elements

Map Implementations
• Linked-list:
• get/search, put/insert, remove/delete: O(n)

• Binary search trees

• search, insert, delete: O(n) if not balanced
• O(lg n) if balanced BST

• A new approach
• Hash tables:
• we’ll see that (under some assumptions) search, insert, delete: O(1)
Hashing
• A completely different approach to searching from the comparison-based methods (binary
search, binary search trees)

• rather than navigating through a dictionary data structure comparing the search key with
the elements, hashing tries to reference an element in a table directly based on its key

• hashing transforms a key into a table address

Hashing
• If the keys were integers in the range 0 to 99
• The simplest idea:
• store keys in an array H[0..99]
• H initially empty

direct addressing:
x x x x x ...
store key k at index k
(0,v) x x (3,v) (4,v) ...

• put(k, value)
• store <k, value> in H[k] issues:
- keys need to be integers in a small range
• get(k)
- space may be wasted is H not full
• check if H[K] is empty
Hashing
• Hashing has 2 components
• the hash table: an array A of size N
• each entry is thought of a bucket: a bucket array
• a hash function: maps each key to a bucket
• h is a function : {all possible keys} ----> {0, 1, 2, ..., N-1}
• key k is stored in bucket h(k)

0 1 2 3 4 5 6 8

A ...

bucket i stores all keys with h(k) =i

• The size of the table N and the hash function are decided by the user
• Goal: chose a hash function that distributes keys uniformly throughout the table
Example
• keys: integers
• chose N = 10
• chose h(k) = k % 10
• [ k % 10 is the remainder of k/10 ]

0 1 2 3 4 5 6 7 8 9

• add (2,), (13,), (15,), (88,), (2345,), (100,)

• Collision: two keys that hash to the same value

• e.g. 15, 2345 hash to slot 5

• Note: if we were using direct addressing: N = 2^32. Unfeasible.

Hashing
• h : {universe of all possible keys} ----> {0,1,2,...,N-1}
• The keys need not be integers
• e.g. strings
• define a hash function that maps strings to integers
• The universe of all possible keys need not be small
• e.g. strings

• Hashing is an example of space-time trade-off:

• if there were no memory(space) limitation, simply store a huge table
• O(1) search/insert/delete
• if there were no time limitation, use a linked list and search sequentially

• Hashing: use a reasonable amount of memory and strike a balance space-time

• adjust hash table size

• Under some assumptions, hashing supports insert, delete and search in in O(1) time
Hashing
• Notation:
• U = universe of keys
• N = hash table size
• n = number of entries
• note: n may be unknown beforehand

• Goal of a hash function: called “universal hashing”

• the probability of any two keys hashing to the same slot is 1/N

• Essentially this means that the hash function throws the keys uniformly at random into the
table

• If a hash function satisfies the universal hashing property, then the expected number of
elements that hash to the same entry is n/N

• if n < N : O(1) elements per entry

• if n >= N: O(n/N) elements per entry
Hashing
• Chosing h and N

• Goal: distribute the keys

• n is usually unknown

• If n > N, then the best one can hope for is that each bucket has O(n/N) elements
• need a good hash function
• search, insert, delete in O(n/N) time

• If n <= N, then the best one can hope for is that each bucket has O(1) elements
• need a good hash function
• search, insert, delete in O(1) time

• If N is large==> less collisions and easier for the hash function to perform well
• Best: if you can guess n beforehand, chose N order of n
• no space waste
Hash functions
• How to define a good hash function?

• An ideal has function approximates a random function: for each input element, every output
should be in some sense equally likely

• In general impossible to guarantee

• Every hash function has a worst-case scenario where all elements map to the same entry

• Hashing = transforming a key to an integer

• There exists a set of good heuristics
Hashing strategies
• Casting to an integer
• if keys are short/int/char:
• h(k) = (int) k;
• if keys are float
• convert the binary representation of k to an integer
• in Java: h(k) = Float.floatToIntBits(k)
• if keys are long long
• h(k) = (int) k
• lose half of the bits

• Rule of thumb: want to use all bits of k when deciding the hash code of k
• better chances of hash spreading the keys
Hashing strategies
• Summing components
• let the binary representation of key k = <x0,x1,x2,...,xk-1>

• use all bits of k when computing the hash code of k

• sum the high-order bits with the low-order bits

• (int) <x0,x1,x2,.x31> + (int)<x32,.,xk-1>

• e.g. String s;
• sum the integer representation of each character
• (int)s[0] + (int)s[1] + (int) s[2] + ...
Hashing strategies
• summation is not a good choice for strings/character arrays
• e.g. s1 = “temp10” and s2 = “temp01” collide
• e.g. “stop”, “tops”, “pots”, “spot” collide

• Polynomial hash codes

• k = <x0,x1,x2,...,xk-1>
• take into consideration the position of x[i]

• chose a number a >0 (a !=1)

• h(k) = x0ak-1 + x1ak-2 + ...+xk-2a + xk-1

• experimentally, a = 33, 37, 39, 41 are good choices when working with English words
• produce less than 7 collision for 50,000 words!!!
• Java hashCode for Strings uses one of these constants
Hashing strategies
• Need to take into account the size of the table
• Modular hashing
• h(k) = i mod N

• If take N to be a prime number, this helps the spread out the hashed values

• If N is not prime, there is a higher likelihood that patterns in the distribution of the input
keys will be repeated in the distribution of the hash values

• e.g. keys = {200, 205, 210, 215, 220, ... 600}

• N = 100
• each hash code will collide with 3 others
• N = 101
• no collisions
Hashing strategies
• Combine modular and multiplicative:
• h(k) = a k % N
• chose a = random value in [0,1]
• advantage: the value of N is not critical and need not be prime

• empirically:
• a popular choice is a = 0.618033 (the golden ratio)
• chose N = power of 2
Hashing strategies
• If keys are not integers
• transform the key piece by piece into an integer
• need to deal with large values

• e.g. key = string

• h(k) = (s0ak-1 + s1ak-2 + ...+sk-2a + sk-1) %N
• for e.g. a = 33
• Horner’s method: h(k) = ((((s0a + s1)* a + s2)*a + s3)*a + ....)*a + sk-1

int hash (char[] v, int N) {

int h = 0, a = 33;
for (int i=0; i< v.length; i++)
h = (a *h + v[i])
return h % N;
} int hash (char[] v, int N) {
int h = 0, a = 33;
for (int i=0; i< v.length; i++)
• the sum may produce a number than we can
h = (a *h + v[i]) %N
represent as an integer
• take %N after every multiplication return h;
}
Hashing strategies
• Universal hashing

• chose N prime
• chose p a prime number larger than N
• chose a, b at random from {0,1,...p-1}

• h(k) = ((a k + b) mod p) mod N

• gets very close to having two keys collide with probability 1/N
• i.e. to throwing the keys into the hash table randomly

• Many other variations of these have been studied, particularly has functions that can be
implemented with efficient machine instructions such as shifting
Hashing
• Hashing
1. hash function
convert keys into table addresses

2. collision handling
• Collision: two keys that hash to the same value
Decide how to handle when two kets hash to the same address

• Note: if n > N there must be collisions

• Collision with chaining

• bucket arrays

• Collision with probing

• linear probing
• quadratic probing
• double hashing
Collisions with chaining
0 1 2 3 4 5 6 8

A ...

bucket i stores all keys with h(k) =i

• Store all elements that hash to the same entry in a linked list (array/vector)
• Can chose to store the lists in sorted order or not

• Insert(k)
• insert k in the linked list of h(k) under universal hashing:
• Search(k) each list has size O(n/N) with high probability
• search in the linked list of h(k) insert, delete, search: O(n/N)
• Delete(k)
• find and delete k from the linked list of h(k)
Collisions with chaining
0 1 2 3 4 5 6 8

A ...

bucket i stores all keys with h(k) =i

• Pros:
• can handle arbitrary number of collisions as there is no cap on the list size
• don’t need to guess n ahead: if N is smaller than n, the elements will be chained

• Cons: space waste

• use additional space in addition to the hash table
• if N is too large compared to n, part of the hash table may be empty

• Choosing N: space-time tradeoff

• Rule of thumb:
• chose N 1/5 to 1/10 of the number of keys that we expect in the table, so that keys are
expected to have about 10 elements each. Keep lists unsorted.
Collisions with probing
• Idea: do not use extra space, use only the hash table

• Idea: when inserting key k, if slot h(k) is full, then try some other slots in the table until
finding one that is empty
• the set of slots tried for key k is called the probing sequence of k

• Linear probing:
• if slot h(k) is full, try next, try next, ...
• probing sequence: h(k), h(k) + 1, h(k) + 2, ...

• insert(k)
• search(k)
• delete(k)

• Example: N = 10, h(k) = k % 10, collisions with linear probing

• insert 1, 7, 4, 13, 23, 25, 25
Linear probing
• Notation: alpha = n/N (load factor of the hash table)

• In general performance of probing degrades inversely proportional with the load of the hash
• for a sparse table (small alpha) we expect most searches to find an empty position within a few
probes
• for a nearly full table (alpha close to 1) a search could require a large number of probess

• Proposition:
• Under certain randomness assumption it can be shown that the average number of probes
examined when searching for key k in a hash table with linear probing is 1/2 (1 + 1/(1 - alpha))
• [No proof]
• alpha = 0: 1 probe
• alpha = 1/2: 1.5 probes (half-full)
• alpha= 2/3: 2 probes (2/3 full)
• alpha = 9/10: 5.5 probes

• Collisions with probing: cannot insert more than N items in the table
• need to guess n ahead
• if at any point n is > N, need to re-allocate a new hash table, and re-hash everything. Expensive!
Linear probing
• Pros:
• space efficiency

• Con:
• need to guess n correctly and set N > n
• if alpha gets large ==> high penalty
• the table is resized and and all objects re-inserted into the new table
• Rule of thumb: good performance with probing if alpha stays less than 2/3.
Double hashing
• Empirically linear hashing introduces a phenomenon called clustering:
• insertion of one key can increase the time for other keys with other hash values
• groups of keys clustered together in the table

• Double hashing:
• instead of examining every successive position, use a second hash function to get a fixed
increment
• probing sequence: h1(k), h1(k) + h2(k), h1(k) + 2h2(k), h1(k) + 3h2(k),...

• chose h2 so that it never evaluates to 0 for any key

• would give an infinite loop on first collision
• Rule of thumb:
• chose h2(k) relatively prime to N

• Performance:
• double hashing and linear hashing have the same performance for sparse tables
• empirically double hashing eliminates clustering
• we can allow the table to become more full with double hashing than with linear hashing
before performance degrades
Java.util.Hashtable
• This class implements a hash table, which maps keys to values. Any non-null object can be
used as a key or as a value.

• java.lang.Object
• java.util.Dictionary
• java.util.Hashtable

• implements Map

• [check out Java docs]

• implements a Map with linear probing; uses .75 as maximal load factor, and rehashes every
time the table gets fuller

• Example

//create a hashtable of <key=string, value=number> pairs

Hashtable numbers = new Hashtable();
numbers.put("one", new Integer(1));
numbers.put("two", new Integer(2));
numbers.put("three", new Integer(3));

//retrieve a string
Integer n = (Integer)numbers.get("two");
if (n != null) {
System.out.println("two = " + n);
}
Hash functions in Java

• The generic Object class comes with a default hashCode() method that maps an Object to an
integer
• int hashCode()
• Inherited by every Object
• The default hashCode() returns the address of the Object’s location in memory
• too generic
• poor choice for most situations
• Typically you want to override it
• e.g. class String
• overrides Strng.hashCode() with a hash function that works well on Strings
Perspective
• Best hashing method depends on application

• Probing is the method of choice if n can be guessed

• Linear probing is fastest if table is sparse
• Double hashing makes most efficient use of memory as it allows the table to become
more full, but requires extra time to to compute a second hash function
• rule of thumb: load factor < .66
• Chaining is easiest to implement and does not need guessing n
• rule of thumb: load factor < .9 for O(1) performance, but not vital

• Hashing can provide better performance than binary search trees if the keys are sufficiently
random so that a good hash function can be developed
• when hashing works, better use hashing than BST
• However
• Hashing does not guarantee worst-case performance
• Binary search trees support a wider range of operations
Exercises
• What is the worst-case running time for inserting n key-value pairs into an initially empty map
that is implemented with a list?

• Describe how to use a map to implement the basic ops in a dictionary ADT, assuming that the
user does not attempt to insert entries with the same key

• Describe how an ordered list implemented as a doubly linked list could be used to implement
the map ADT.

• Draw the 11-entry hash that results from using the hash function h(i) = (2i+5) mod 11 to hash
keys 12, 44, 13, 88, 23, 94, 11, 39, 20, 16, 5.
• (a) Assume collisions are handled by chaining.
• (b) Assume collisions are handled by linear probing.
• (c) Assume collisions are handled with double hashing, with the secondary hash function
h’(k) = 7 - (k mod 7).
• Show the result of rehashing this table in a table of size 19, using teh new hasah function h(k)
= 2k mod 19.

• Think of a reason that you would not use a hash table to implement a dictionary.

Ai Fundamentals Source Quizzes
100% (1)
Ai Fundamentals Source Quizzes
109 pages
DSA - Unit 1
No ratings yet
DSA - Unit 1
43 pages
11 Hash Tables Slides
No ratings yet
11 Hash Tables Slides
34 pages
Module 5 Graphs
No ratings yet
Module 5 Graphs
35 pages
Hashing Unit 1
No ratings yet
Hashing Unit 1
91 pages
Deep Learning Computer Vision NLP
No ratings yet
Deep Learning Computer Vision NLP
140 pages
Hashing Powerpoint
No ratings yet
Hashing Powerpoint
58 pages
Hashing
No ratings yet
Hashing
96 pages
14 Hashing
No ratings yet
14 Hashing
61 pages
Lec12 Hash Tables 09092024 090609pm
No ratings yet
Lec12 Hash Tables 09092024 090609pm
48 pages
Maps and Hashing - Final
No ratings yet
Maps and Hashing - Final
51 pages
Hash Table
No ratings yet
Hash Table
68 pages
L15 Maps and Hashes
No ratings yet
L15 Maps and Hashes
41 pages
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
No ratings yet
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
32 pages
Practice: Word Problems: A Plan For Problem Solving
No ratings yet
Practice: Word Problems: A Plan For Problem Solving
92 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Lecture 8 Hashing
No ratings yet
Lecture 8 Hashing
47 pages
DSAL Lab Manual
No ratings yet
DSAL Lab Manual
61 pages
Mic Solved Manual 1
No ratings yet
Mic Solved Manual 1
29 pages
CH8 Hashing
No ratings yet
CH8 Hashing
110 pages
Lecture 13 - Hash Tables
No ratings yet
Lecture 13 - Hash Tables
51 pages
Week 3
No ratings yet
Week 3
29 pages
Lecture 6 - Searching
No ratings yet
Lecture 6 - Searching
41 pages
Module-4 Dictionaries and Hash Tables
No ratings yet
Module-4 Dictionaries and Hash Tables
31 pages
Hashing
No ratings yet
Hashing
22 pages
ADS Unit 3
No ratings yet
ADS Unit 3
14 pages
Basis 3
No ratings yet
Basis 3
122 pages
9.map 1 HashTable
No ratings yet
9.map 1 HashTable
31 pages
Hassing Dsa
No ratings yet
Hassing Dsa
28 pages
CH 4
No ratings yet
CH 4
58 pages
Examples: Finite Automata With Output
No ratings yet
Examples: Finite Automata With Output
6 pages
ANN Theory
No ratings yet
ANN Theory
49 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
351 - 27435 - EE419 - 2020 - 1 - 2 - 1 - 14 EE419 Lec14 Jury Stability
No ratings yet
351 - 27435 - EE419 - 2020 - 1 - 2 - 1 - 14 EE419 Lec14 Jury Stability
39 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
02 - Indices
No ratings yet
02 - Indices
208 pages
Hash Tables
No ratings yet
Hash Tables
20 pages
Frequent Pattern Analysis-Arpriori
No ratings yet
Frequent Pattern Analysis-Arpriori
27 pages
DS Unit-Ii
No ratings yet
DS Unit-Ii
12 pages
L08 Hierachical Agglomerative Clustering
No ratings yet
L08 Hierachical Agglomerative Clustering
41 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
CS 03
No ratings yet
CS 03
22 pages
Dsa 240404 220052
No ratings yet
Dsa 240404 220052
9 pages
Ads-Unit I
No ratings yet
Ads-Unit I
16 pages
Week 9 - Hash Functions and Collision
No ratings yet
Week 9 - Hash Functions and Collision
73 pages
Week7 PDF
No ratings yet
Week7 PDF
18 pages
CS230: Lecture 9 Deep Reinforcement Learning: Kian Katanforoosh Menti Code: 80 24 08
No ratings yet
CS230: Lecture 9 Deep Reinforcement Learning: Kian Katanforoosh Menti Code: 80 24 08
44 pages
Problem 1. A conducting slab: I ikz−ωt
No ratings yet
Problem 1. A conducting slab: I ikz−ωt
38 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
Hashing Reading
No ratings yet
Hashing Reading
10 pages
Unit 6 Vectors
No ratings yet
Unit 6 Vectors
31 pages
03 InformedHeuristicSearch
No ratings yet
03 InformedHeuristicSearch
80 pages
The Divisibility & Modular Arithmetic: Selected Exercises: Goal
No ratings yet
The Divisibility & Modular Arithmetic: Selected Exercises: Goal
14 pages
Partial Differential Equation
0% (1)
Partial Differential Equation
18 pages
Hash
No ratings yet
Hash
5 pages
Hashing
No ratings yet
Hashing
44 pages
Text Classification With Switch Transformer - 1716327819025
No ratings yet
Text Classification With Switch Transformer - 1716327819025
5 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
EEM 306 Introduction To Communications: Department of Electrical and Electronics Engineering Anadolu University
No ratings yet
EEM 306 Introduction To Communications: Department of Electrical and Electronics Engineering Anadolu University
22 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
Linear Programming: Brewer's Problem Simplex Algorithm Implementation Linear Programming
No ratings yet
Linear Programming: Brewer's Problem Simplex Algorithm Implementation Linear Programming
49 pages
Maps and Dictionary: Data Structures and Algorithms
No ratings yet
Maps and Dictionary: Data Structures and Algorithms
50 pages
EEL4410 Ch7 Online+videos p4 PDF
No ratings yet
EEL4410 Ch7 Online+videos p4 PDF
16 pages
12 Revenue Management PDF
No ratings yet
12 Revenue Management PDF
15 pages
NUMERICALS
No ratings yet
NUMERICALS
6 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
20A04502T Digital Signal Processing
No ratings yet
20A04502T Digital Signal Processing
2 pages
Linear-Algebra With Python
100% (2)
Linear-Algebra With Python
26 pages
Unit #5 - Implicit Differentiation, Related Rates
No ratings yet
Unit #5 - Implicit Differentiation, Related Rates
14 pages
Mock End Sem 2024-2025 NMCP
No ratings yet
Mock End Sem 2024-2025 NMCP
2 pages
Implementation and Analysis of Different Digit Recognition Methods On Reduced MNIST Dataset
No ratings yet
Implementation and Analysis of Different Digit Recognition Methods On Reduced MNIST Dataset
10 pages
Eet202/3 Digital Electronics Ii Tutorial 4
No ratings yet
Eet202/3 Digital Electronics Ii Tutorial 4
13 pages
Winner Take All Autoencoders
No ratings yet
Winner Take All Autoencoders
11 pages
Maps
No ratings yet
Maps
36 pages
Rational Zero Theorem
No ratings yet
Rational Zero Theorem
9 pages
Indexing and Hashing: Solutions To Practice Exercises
No ratings yet
Indexing and Hashing: Solutions To Practice Exercises
11 pages
Design Analysis Algorithm Aug Sep2023
No ratings yet
Design Analysis Algorithm Aug Sep2023
2 pages
DSA Chapter 08 (Searching)
No ratings yet
DSA Chapter 08 (Searching)
65 pages
CS301 Lec41
No ratings yet
CS301 Lec41
18 pages
Unit 6: Stability of Linear Control System
No ratings yet
Unit 6: Stability of Linear Control System
14 pages
International Postgraduate Scholarship: Application Form & Guidelines
No ratings yet
International Postgraduate Scholarship: Application Form & Guidelines
8 pages
Chapter 28 Hashing: Hash Table. The Function That Maps A Key To An Index in The Hash Table Is
No ratings yet
Chapter 28 Hashing: Hash Table. The Function That Maps A Key To An Index in The Hash Table Is
4 pages
02 FermetureTransitive-EN
No ratings yet
02 FermetureTransitive-EN
2 pages
M (T) U (T) : Upper Side Band Lower Side Band Upper Side Band Lower Side Band
No ratings yet
M (T) U (T) : Upper Side Band Lower Side Band Upper Side Band Lower Side Band
5 pages
Operations Research: Transportation Problem Find Optimal Solution
No ratings yet
Operations Research: Transportation Problem Find Optimal Solution
13 pages
Computational Lab in Physics: Finding Roots of Nonlinear Functions
No ratings yet
Computational Lab in Physics: Finding Roots of Nonlinear Functions
12 pages
36 BST Remove Hashing
No ratings yet
36 BST Remove Hashing
7 pages
C++ Review (Ch. 1) Algorithm Analysis (Ch. 2) : Sets With Insert/delete/member: Hashing (Ch. 5)
No ratings yet
C++ Review (Ch. 1) Algorithm Analysis (Ch. 2) : Sets With Insert/delete/member: Hashing (Ch. 5)
42 pages
Sets Maps and Hash Tables Review
No ratings yet
Sets Maps and Hash Tables Review
3 pages
5772 Learning Stationary Time Series Using Gaussian Processes With Nonparametric Kernels
No ratings yet
5772 Learning Stationary Time Series Using Gaussian Processes With Nonparametric Kernels
9 pages
An Easy New Approach For Matrix Decomposition: Mohammed Hassan Elzubair
No ratings yet
An Easy New Approach For Matrix Decomposition: Mohammed Hassan Elzubair
3 pages
Data Structure MCQ
No ratings yet
Data Structure MCQ
50 pages
Hash Tables: Map Dictionary Key "Address."
No ratings yet
Hash Tables: Map Dictionary Key "Address."
16 pages
03-09-016 Related Rates
No ratings yet
03-09-016 Related Rates
2 pages
Exam 2-Study Comments
No ratings yet
Exam 2-Study Comments
2 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
POAI MCQs
No ratings yet
POAI MCQs
40 pages
CS2040 Summary
No ratings yet
CS2040 Summary
16 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Chap-1 ADS
No ratings yet
Chap-1 ADS
5 pages
Hashing
No ratings yet
Hashing
7 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Lagrange Interpolation
No ratings yet
Lagrange Interpolation
7 pages
CSE 12 The Map Abstract Data Type
No ratings yet
CSE 12 The Map Abstract Data Type
25 pages
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet

210 Maps PDF

Uploaded by

210 Maps PDF

Uploaded by

csci 210: Data Structures

Maps and Hash Tables

• put(7,B) M={(5,A), (7,B)}

• put(2,C) M={(5,A), (7,B), (2,C)}

• put(8,D) M={(5,A), (7,B), (2,C), (8,D)}

• put(2,E) M={(5,A), (7,B), (2,E), (8,D)}

• get(4) return null

• remove(5) M={(7,B), (2,E), (8,D)}

• remove(2) M={(7,B), (8,D)}

• get(2) return null

• Note: Maps provide an alternative approach to searching

• Binary trees also support Insert, Delete, Search

all keys are all keys are

• Note: Want to search/insert/delete efficiently by name ? student record

• Want to search/insert/delete efficiently by age?

• Want to search/insert/delete efficiently by SSN? student record

• A Dictionary stores (k,v) key-value pairs called entries

• A Dictionary can have elements with same key

• A DICTIONARY usually keeps track of the order of the elements

• additional handy methods

• Sketch the main loop of your program.

• Binary search trees

• Analysis: O(n) on a map with n elements

• Binary search trees

• hashing transforms a key into a table address

bucket i stores all keys with h(k) =i

• add (2,*), (13,*), (15,*), (88,*), (2345,*), (100,*)

• Collision: two keys that hash to the same value

• Note: if we were using direct addressing: N = 2^32. Unfeasible.

• Hashing is an example of space-time trade-off:

• Hashing: use a reasonable amount of memory and strike a balance space-time

• Goal of a hash function: called “universal hashing”

• if n < N : O(1) elements per entry

• Goal: distribute the keys

• In general impossible to guarantee

• Hashing = transforming a key to an integer

• use all bits of k when computing the hash code of k

• sum the high-order bits with the low-order bits

• (int) <x0,x1,x2,.x31> + (int)<x32,.,xk-1>

• Polynomial hash codes

• chose a number a >0 (a !=1)

• e.g. keys = {200, 205, 210, 215, 220, ... 600}

• e.g. key = string

int hash (char[] v, int N) {

• h(k) = ((a k + b) mod p) mod N

• Note: if n > N there must be collisions

• Collision with chaining

• Collision with probing

bucket i stores all keys with h(k) =i

bucket i stores all keys with h(k) =i

• Cons: space waste

• Choosing N: space-time tradeoff

• Example: N = 10, h(k) = k % 10, collisions with linear probing

• chose h2 so that it never evaluates to 0 for any key

• [check out Java docs]

//create a hashtable of <key=string, value=number> pairs

• Probing is the method of choice if n can be guessed

You might also like

• add (2,), (13,), (15,), (88,), (2345,), (100,)