0% found this document useful (0 votes)

24 views10 pages

Huffman Coding

DSA Huffman

Uploaded by

rudrakashyap.0044

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views10 pages

Huffman Coding

DSA Huffman

Uploaded by

rudrakashyap.0044

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Huffman Coding

Huffman coding is a lossless data compression algorithm. The idea is to assign variable-length
codes to input characters, lengths of the assigned codes are based on the frequencies of
corresponding characters. The most frequent character gets the smallest code and the least
frequent character gets the largest code.

The variable-length codes assigned to input characters are Prefix Codes, means the codes (bit
sequences) are assigned in such a way that the code assigned to one character is not the prefix of
code assigned to any other character. This is how Huffman Coding makes sure that there is no
ambiguity when decoding the generated bitstream.

Let us understand prefix codes with a counter example. Let there be four characters a, b, c and d,
and their corresponding variable length codes be 00, 01, 0 and 1. This coding leads to ambiguity
because code assigned to c is the prefix of codes assigned to a and b. If the compressed bit
stream is 0001, the de-compressed output may be "cccd" or "ccb" or "acd" or "ab".

See this for applications of Huffman Coding.

There are mainly two major parts in Huffman Coding

1. Build a Huffman Tree from input characters.

2. Traverse the Huffman Tree and assign codes to characters.

Steps to build Huffman Tree

Input is an array of unique characters along with their frequency of occurrences and output is
Huffman Tree.

1. Create a leaf node for each unique character and build a min heap of all leaf nodes (Min
Heap is used as a priority queue. The value of frequency field is used to compare two
nodes in min heap. Initially, the least frequent character is at root)
2. Extract two nodes with the minimum frequency from the min heap.

3. Create a new internal node with a frequency equal to the sum of the two nodes
frequencies. Make the first extracted node as its left child and the other extracted node as
its right child. Add this node to the min heap.
4. Repeat steps#2 and #3 until the heap contains only one node. The remaining node is the
root node and the tree is complete.
Let us understand the algorithm with an example:
character Frequency
a 5
b 9
c 12
d 13
e 16
f 45
Step 1. Build a min heap that contains 6 nodes where each node represents root of a tree with
single node.
Step 2 Extract two minimum frequency nodes from min heap. Add a new internal node with
frequency 5 + 9 = 14.

Now min heap contains 5 nodes where 4 nodes are roots of trees with single element each, and
one heap node is root of tree with 3 elements

character Frequency
c 12
d 13
Internal Node 14
e 16
f 45
Step 3: Extract two minimum frequency nodes from heap. Add a new internal node with
frequency 12 + 13 = 25
Now min heap contains 4 nodes where 2 nodes are roots of trees with single element each, and
two heap nodes are root of tree with more than one nodes

character Frequency
Internal Node 14
e 16
Internal Node 25
f 45
Step 4: Extract two minimum frequency nodes. Add a new internal node with frequency 14 + 16
= 30

Now min heap contains 3 nodes.

character Frequency
Internal Node 25
Internal Node 30
f 45
Step 5: Extract two minimum frequency nodes. Add a new internal node with frequency 25 + 30
= 55
Now min heap contains 2 nodes.

character Frequency
f 45
Internal Node 55
Step 6: Extract two minimum frequency nodes. Add a new internal node with frequency 45 + 55
= 100

Now min heap contains only one node.

character Frequency
Internal Node 100
Since the heap contains only one node, the algorithm stops here.

Steps to print codes from Huffman Tree:

Traverse the tree formed starting from the root. Maintain an auxiliary array. While moving to the
left child, write 0 to the array. While moving to the right child, write 1 to the array. Print the
array when a leaf node is encountered.
The codes are as follows:

character code-word
f 0
c 100
d 101
a 1100
b 1101
e 111

Huffman Coding using Priority Queue

Prerequisite: Greedy Algorithms | Set 3 (Huffman
Coding), priority_queue::push() and priority_queue::pop() in C++ STL
Given a char array ch[] and frequency of each character as freq[]. The task
is to find Huffman Codes for every character in ch[] using Priority Queue.

Example
Input: ch[] = { ‘a’, ‘b’, ‘c’, ‘d’, ‘e’, ‘f’ }, freq[] = { 5, 9, 12, 13, 16, 45 }
Output:
f0
c 100
d 101
a 1100
b 1101
e 111

Approach:
1. Push all the characters in ch[] mapped to corresponding
frequency freq[] in priority queue.
2. To create Huffman Tree, pop two nodes from priority queue.
3. Assign two popped node from priority queue as left and right child of new
node.
4. Push the new node formed in priority queue.
5. Repeat all above steps until size of priority queue becomes 1.
6. Traverse the Huffman Tree (whose root is the only node left in the priority
queue) to store the Huffman Code
7. Print all the stored Huffman Code for every character in ch[].
Below is the implementation of the above approach:

// C++ Program for Huffman Coding

// using Priority Queue
#include <iostream>
#include <queue>
using namespace std;

// Maximum Height of Huffman Tree.

#define MAX_SIZE 100

class HuffmanTreeNode {
public:
// Stores character
char data;
int freq;
HuffmanTreeNode* left;
HuffmanTreeNode* right;

// Initializing the current node

HuffmanTreeNode(char character, int frequency)
{
data = character;
freq = frequency;
left = right = NULL;
}
};

// Custom comparator class

class Compare {
public:
bool operator()(HuffmanTreeNode* a, HuffmanTreeNode* b)
{
// Defining priority on the basis of frequency
return a->freq > b->freq;
}
};
// Function to generate Huffma Encoding Tree
HuffmanTreeNode* generateTree(priority_queue<HuffmanTreeNode*,
vector<HuffmanTreeNode*>,Compare> pq)
{
// We keep on looping till only one node remains in the Priority Queue
while (pq.size() != 1) {

// Node which has least frequency and Remove node from Priority Queue

HuffmanTreeNode* left = pq.top();

pq.pop();

// Node which has least frequency and Remove node from Priority Queue

HuffmanTreeNode* right = pq.top();

pq.pop();

// A new node is formed with frequency left->freq + right->freq

// We take data as '$' because we are only concerned with the frequency

HuffmanTreeNode* node = new HuffmanTreeNode('$', left->freq+ right->freq);

node->left = left;
node->right = right;

// Push back node created to the Priority Queue

pq.push(node);
}
return pq.top();
}

// Function to print the huffman code for each character.

// It uses arr to store the codes

void printCodes(HuffmanTreeNode* root, int arr[], int top)

{
// Assign 0 to the left node and recur
if (root->left) {
arr[top] = 0;
printCodes(root->left, arr, top + 1);
}

// Assign 1 to the right node and recur

if (root->right) {
arr[top] = 1;
printCodes(root->right, arr, top + 1);
}

// If this is a leaf node, then we print root->data

// We also print the code for this character from arr

if (!root->left && !root->right) {
cout << root->data << " ";
for (int i = 0; i < top; i++) {
cout << arr[i];
}
cout << endl;
}
}

void HuffmanCodes(char data[],int freq[], int size)

{
// Declaring priority queue using custom comparator
priority_queue<HuffmanTreeNode*,vector<HuffmanTreeNode*>,Compare> pq;

// Populating the priority queue

for (int i = 0; i < size; i++) {
HuffmanTreeNode* newNode = new HuffmanTreeNode(data[i], freq[i]);
pq.push(newNode);
}

// Generate Huffman Encoding Tree and get the root node

HuffmanTreeNode* root = generateTree(pq);

// Print Huffman Codes

int arr[MAX_SIZE], top = 0;
printCodes(root, arr, top);
}

// Driver Code
int main()
{
char data[] = { 'a', 'b', 'c', 'd', 'e', 'f' };
int freq[] = { 5, 9, 12, 13, 16, 45 };
int size = sizeof(data) / sizeof(data[0]);

HuffmanCodes(data, freq, size);

return 0;
}

Output:
f 0
c 100
d 101
a 1100
b 1101
e 111
Time Complexity: O(n*logn) where n is the number of unique characters
Auxiliary Space: O(n)

Leetcode Python Solutions
86% (7)
Leetcode Python Solutions
226 pages
CP264 - Final Review
No ratings yet
CP264 - Final Review
18 pages
Haufmann Coding
No ratings yet
Haufmann Coding
6 pages
3a.huffman Encoding
No ratings yet
3a.huffman Encoding
4 pages
4.6 Huffman Coding, Optimal Merge Patterns.
No ratings yet
4.6 Huffman Coding, Optimal Merge Patterns.
9 pages
DAA Unit-4
No ratings yet
DAA Unit-4
26 pages
HUFFMAN CODING
No ratings yet
HUFFMAN CODING
7 pages
Assignment No: 02 Title: Huffman Algorithm
No ratings yet
Assignment No: 02 Title: Huffman Algorithm
7 pages
5, Huffman Code
No ratings yet
5, Huffman Code
5 pages
61 Practical 06
No ratings yet
61 Practical 06
5 pages
huffmancode
No ratings yet
huffmancode
3 pages
Mini Project
No ratings yet
Mini Project
26 pages
Huffman's Algorithm Lecture1
No ratings yet
Huffman's Algorithm Lecture1
69 pages
Combinedm
No ratings yet
Combinedm
22 pages
A3
No ratings yet
A3
5 pages
DAA-02
No ratings yet
DAA-02
7 pages
Huffman coding (Anurag Verma) v1.0
No ratings yet
Huffman coding (Anurag Verma) v1.0
12 pages
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
No ratings yet
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
28 pages
You Do Not Need To Fully Understand This Section To Complete The Assessment.
No ratings yet
You Do Not Need To Fully Understand This Section To Complete The Assessment.
9 pages
DAA Unit-IV
No ratings yet
DAA Unit-IV
12 pages
Unit-3
No ratings yet
Unit-3
122 pages
FALLSEM2024-25 STS3007 TH AP2024252001217 2024-11-13 Reference-Material-I
No ratings yet
FALLSEM2024-25 STS3007 TH AP2024252001217 2024-11-13 Reference-Material-I
17 pages
Huffman Code
No ratings yet
Huffman Code
2 pages
Activity Selection Problem + Huffman encoding tree
No ratings yet
Activity Selection Problem + Huffman encoding tree
4 pages
Huffman Code
No ratings yet
Huffman Code
5 pages
Bst
No ratings yet
Bst
5 pages
Huffman Coding Algorithm
No ratings yet
Huffman Coding Algorithm
4 pages
LAB_6
No ratings yet
LAB_6
6 pages
Assignment: Course Title: Computer Algorithm Course Code: CSE 1001
No ratings yet
Assignment: Course Title: Computer Algorithm Course Code: CSE 1001
20 pages
Huffman Coding in C
100% (1)
Huffman Coding in C
9 pages
Adsa U4,4
No ratings yet
Adsa U4,4
9 pages
Compression: Another Example of Greedy Algorithm: Huffman Codes
No ratings yet
Compression: Another Example of Greedy Algorithm: Huffman Codes
4 pages
Huffmann Algo
No ratings yet
Huffmann Algo
3 pages
CSA Lab 10
No ratings yet
CSA Lab 10
4 pages
ex 7 Daa
No ratings yet
ex 7 Daa
8 pages
4.6 Huffman Coding, Optimal Merge Pattern
No ratings yet
4.6 Huffman Coding, Optimal Merge Pattern
24 pages
Huffman Tree and Coding
No ratings yet
Huffman Tree and Coding
6 pages
Project Report Huffman Algorithm: Jinnah University For Women
No ratings yet
Project Report Huffman Algorithm: Jinnah University For Women
11 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
49 pages
Huffman
No ratings yet
Huffman
70 pages
TREES
No ratings yet
TREES
6 pages
Unit 3 Advance Tree
No ratings yet
Unit 3 Advance Tree
185 pages
Huffman Coding
No ratings yet
Huffman Coding
22 pages
Data Structure Assignment No3
No ratings yet
Data Structure Assignment No3
18 pages
Daa Lab - 1
No ratings yet
Daa Lab - 1
7 pages
Problem E: Huffman Codes
No ratings yet
Problem E: Huffman Codes
2 pages
4.3 Huffman Algorithm
No ratings yet
4.3 Huffman Algorithm
6 pages
huffman tree
No ratings yet
huffman tree
8 pages
Week 10 - Greedy Algorithm 4
No ratings yet
Week 10 - Greedy Algorithm 4
19 pages
Lecture Tree
No ratings yet
Lecture Tree
10 pages
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
No ratings yet
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
1 page
Traversal To Binary Tree
No ratings yet
Traversal To Binary Tree
22 pages
Data structures (Binary Trees)
No ratings yet
Data structures (Binary Trees)
41 pages
Huffman Paraphrased
No ratings yet
Huffman Paraphrased
26 pages
Experiment-5 AIM:-To Implement Huffman Encoding Introduction
No ratings yet
Experiment-5 AIM:-To Implement Huffman Encoding Introduction
5 pages
HuffmanCoding-2
No ratings yet
HuffmanCoding-2
16 pages
Static Huffman Coding Term Paper
No ratings yet
Static Huffman Coding Term Paper
23 pages
5 Huffman Coding
No ratings yet
5 Huffman Coding
50 pages
Unite 4-Greedy Method - CSE
No ratings yet
Unite 4-Greedy Method - CSE
41 pages
Huff Man Code
No ratings yet
Huff Man Code
5 pages
Huffman Code1
100% (1)
Huffman Code1
13 pages
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
From Everand
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
Kanto
No ratings yet
Dasalgo Reviewer Nice
No ratings yet
Dasalgo Reviewer Nice
3 pages
Data Structures 2-1 2nd Mid
0% (1)
Data Structures 2-1 2nd Mid
5 pages
Top 50 Interview Questions
No ratings yet
Top 50 Interview Questions
2 pages
Array Representation
No ratings yet
Array Representation
6 pages
Week 12-Trees
No ratings yet
Week 12-Trees
113 pages
資結重點2
No ratings yet
資結重點2
112 pages
CD3281 DS LAB MANUAL
No ratings yet
CD3281 DS LAB MANUAL
34 pages
DSA Final
No ratings yet
DSA Final
66 pages
Universiti Pendidikan Sultan Idris: Danesh Kumar A/L Sures Kumar E20181022338 Moganaa A/P Krishnan E20181020463
No ratings yet
Universiti Pendidikan Sultan Idris: Danesh Kumar A/L Sures Kumar E20181022338 Moganaa A/P Krishnan E20181020463
11 pages
Priority Search Trees
100% (1)
Priority Search Trees
18 pages
DS Model - QP - S2
No ratings yet
DS Model - QP - S2
2 pages
File Org & Indexing _ DPP 03 (of Lec 06) 2
No ratings yet
File Org & Indexing _ DPP 03 (of Lec 06) 2
15 pages
MIT6_006S20_q1_sol
No ratings yet
MIT6_006S20_q1_sol
15 pages
Binary Search Trees: AVL Trees: Daniel Kane
No ratings yet
Binary Search Trees: AVL Trees: Daniel Kane
23 pages
Ds With Python Week11-1
No ratings yet
Ds With Python Week11-1
11 pages
Exercise in Inserting A B-Tree
No ratings yet
Exercise in Inserting A B-Tree
23 pages
Adsa U2,1
100% (1)
Adsa U2,1
8 pages
Heaps and Priority Queues 1
No ratings yet
Heaps and Priority Queues 1
26 pages
Data Structures and Algorithms Sheet #7 Trees: Part I: Exercises
No ratings yet
Data Structures and Algorithms Sheet #7 Trees: Part I: Exercises
6 pages
DSA Final Mcqs
100% (1)
DSA Final Mcqs
333 pages
Ch23 Solution Cormen
No ratings yet
Ch23 Solution Cormen
10 pages
Data Structures Lab 8 9 Binary Trees
No ratings yet
Data Structures Lab 8 9 Binary Trees
39 pages
Trees2 2
No ratings yet
Trees2 2
14 pages
Tree (Java)
No ratings yet
Tree (Java)
27 pages
Tree Data Structure 1.
No ratings yet
Tree Data Structure 1.
13 pages
Binary Tree Problems Must For Interviews and Competitive Coding
No ratings yet
Binary Tree Problems Must For Interviews and Competitive Coding
386 pages
BST Search Insert Delete
No ratings yet
BST Search Insert Delete
20 pages
Unit 5 DS
No ratings yet
Unit 5 DS
30 pages

Huffman Coding

Uploaded by

Huffman Coding

Uploaded by

Huffman Coding

See this for applications of Huffman Coding.

1. Build a Huffman Tree from input characters.

Steps to build Huffman Tree

Now min heap contains 3 nodes.

Now min heap contains only one node.

Steps to print codes from Huffman Tree:

Huffman Coding using Priority Queue

// C++ Program for Huffman Coding

// Maximum Height of Huffman Tree.

// Initializing the current node

// Custom comparator class

HuffmanTreeNode* left = pq.top();

HuffmanTreeNode* right = pq.top();

// A new node is formed with frequency left->freq + right->freq

HuffmanTreeNode* node = new HuffmanTreeNode('$', left->freq+ right->freq);

// Push back node created to the Priority Queue

// Function to print the huffman code for each character.

void printCodes(HuffmanTreeNode* root, int arr[], int top)

// Assign 1 to the right node and recur

// If this is a leaf node, then we print root->data

// We also print the code for this character from arr

void HuffmanCodes(char data[],int freq[], int size)

// Populating the priority queue

// Generate Huffman Encoding Tree and get the root node

// Print Huffman Codes

HuffmanCodes(data, freq, size);

You might also like