0% found this document useful (0 votes)

8 views10 pages

Basics of Compression

Uploaded by

Summiya Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views10 pages

Basics of Compression

Uploaded by

Summiya Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

1/22/2024

Basics of Compression

Dr. Sania Bhatti

Outline
 Need for compression and compression
algorithms classification
 Basic Coding Concepts
 Fixed-length
coding and variable-length coding
 Compression Ratio
 Entropy

 RLE Compression (Entropy Coding)

 Huffman Compression (Statistical Entropy Coding)

1
1/22/2024

Need for Compression

 Uncompressed video
 Uncompressed audio  640 x 480 resolution, 8 bit
 8 KHz, 8 bit color, 24 fps
 8K per second  7.37 Mbytes per second
 30M per hour  26.5 Gbytes per hour
 44.1 KHz, 16 bit  640 x 480 resolution, 24 bit
 88.2K per second (3 bytes) color, 30 fps
 317.5M per hour  27.6 Mbytes per second
 100 Gbyte disk holds 315  99.5 Gbytes per hour
hours of CD quality music  100 Gbyte disk holds 1 hour
of high quality video

Broad Classification
 Entropy Coding (statistical)
 lossless;independent of data characteristics
 e.g. RLE, Huffman, LZW, Arithmetic coding
 Source Coding
 lossy;may consider semantics of the data
 depends on characteristics of the data
 e.g. DCT, DPCM, ADPCM, color model transform
 Hybrid Coding (used by most multimedia systems)
 combine entropy with source encoding
 e.g., JPEG-2000, H.264, MPEG-2, MPEG-4, MPEG-7

2
1/22/2024

Data Compression
 Branch of information theory
 minimize amount of information to be
transmitted
 Transform a sequence of characters into a
new string of bits
 same information content
 length as short as possible

Concepts
 Coding (the code) maps source messages from
alphabet (A) into code words (B)

 Source message (symbol) is basic unit into which a

string is partitioned
 can be a single letter or a string of letters

 EXAMPLE: aa bbb cccc ddddd eeeeee fffffffgggggggg

 A = {a, b, c, d, e, f, g, space}
 B = {0, 1}
6

3
1/22/2024

Taxonomy of Codes
 Block-block
 source msgs and code words of fixed length; e.g.,
ASCII
 Block-variable
 sourcemessage fixed, code words variable; e.g.,
Huffman coding
 Variable-block
 source variable, code word fixed; e.g., RLE
 Variable-variable
 source variable, code words variable; e.g., Arithmetic
7

Example of Block-Block
 Coding “aa bbb cccc ddddd Symbol Code word
eeeeee fffffffgggggggg” a 000
b 001
 Requires 120 bits c 010
d 011
e 100

f 101
g 110
space 111

4
1/22/2024

Example of Variable-Variable
 Coding “aa bbb cccc ddddd Symbol Code word
eeeeee fffffffgggggggg” aa 0
bbb 1
 Requires 30 bits cccc 10
 don’t forget the spaces ddddd 11
eeeeee 100

fffffff 101
gggggggg 110
space 111

Concepts (cont.)
 A code is
 distinct
if each code word can be distinguished from
every other (mapping is one-to-one)
 uniquely decodable if every code word is identifiable
when immersed in a sequence of code words
 e.g., with previous table, message 11 could be defined as
either ddddd or bbbbbb

5
1/22/2024

Static Codes
 Mapping is fixed before transmission
 message represented by same codeword
every time it appears in message (ensemble)
 Huffman coding is an example

 Better for independent sequences

 probabilities
of symbol occurrences must be
known in advance;
11

Dynamic Codes
 Mapping changes over time
 also referred to as adaptive coding
 Attempts to exploit locality of reference
 periodic,
frequent occurrences of messages
 dynamic Huffman is an example

 Hybrids?
 build set of codes, select based on input

6
1/22/2024

Traditional Evaluation Criteria

 Algorithm complexity
 running time

 Amount of compression
 redundancy
 compression ratio

 How to measure?
13

Measure of Information
 Consider symbols si and the probability of
occurrence of each symbol p(si)
 In case of fixed-length coding , smallest
number of bits per symbol needed is
 L ≥ log2(N) bits per symbol
 Example: Message with 5 symbols need 3
bits (L ≥ log25)

7
1/22/2024

Variable-Length Coding-
Entropy
 What is the minimum number of bits per
symbol?
 Answer: Shannon’s result – theoretical
minimum average number of bits per code
word is known as Entropy (H)
n

  p(s ) log
i 1
i 2 p( si )

Entropy Example
 Alphabet = {A, B}
 p(A) = 0.4; p(B) = 0.6

 Compute Entropy (H)

 -0.4*log2 0.4 + -0.6*log2 0.6 = .97 bits

8
1/22/2024

Entropy Example
 Calculate the entropy for an image with
only two levels 0 and 255. P(0)=0.5 and
P(255)= 0.5

Entropy example
 A gray scale image has 256 levels A={ 0,
1, 2, ………….255} with equal
probabilities. Calculate Entropy.

 H= 256* (1/256)*log2(1/256) = 8bits

9
1/22/2024

Entropy Example
 Calculate the Entropy of aaabbbbccccdd
 P(a)= 0.23
 P(b) = 0.3
 P(c)=0.3
 P(d)= 0.15

Text and Image Compression
100% (1)
Text and Image Compression
57 pages
3-1-Lossless Compression
No ratings yet
3-1-Lossless Compression
10 pages
cp467_12_lecture14_compression1
No ratings yet
cp467_12_lecture14_compression1
146 pages
Chapter 5 Data Compression
No ratings yet
Chapter 5 Data Compression
71 pages
chapter10_part1_Huffman(1)
No ratings yet
chapter10_part1_Huffman(1)
17 pages
L15-Compression
No ratings yet
L15-Compression
63 pages
Compression [Compatibility Mode]
No ratings yet
Compression [Compatibility Mode]
12 pages
Compression: Some Slides Courtesy James Allan@umass
No ratings yet
Compression: Some Slides Courtesy James Allan@umass
47 pages
2. Coding Theory
No ratings yet
2. Coding Theory
49 pages
Assignment cyber security solved
No ratings yet
Assignment cyber security solved
22 pages
3 Source Coding
No ratings yet
3 Source Coding
31 pages
DC-PPT 5
No ratings yet
DC-PPT 5
44 pages
Source Coding
No ratings yet
Source Coding
29 pages
Cursive Handwriting Practice Grids PDF
No ratings yet
Cursive Handwriting Practice Grids PDF
55 pages
chapter 2
No ratings yet
chapter 2
13 pages
Multimedia Data Compression
No ratings yet
Multimedia Data Compression
31 pages
Ic23 Unit02 Script
No ratings yet
Ic23 Unit02 Script
29 pages
Chapter Three
No ratings yet
Chapter Three
30 pages
Chapter Presentation
No ratings yet
Chapter Presentation
57 pages
Video Processing Communications Yao Wang Chapter8a
No ratings yet
Video Processing Communications Yao Wang Chapter8a
19 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
Module IV
No ratings yet
Module IV
37 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
20 Compression
No ratings yet
20 Compression
58 pages
CH 6
No ratings yet
CH 6
21 pages
Week 3
No ratings yet
Week 3
30 pages
Image Compression
No ratings yet
Image Compression
50 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
Synopsis On: Data Compression
No ratings yet
Synopsis On: Data Compression
25 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
Entropy
No ratings yet
Entropy
10 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
Data Compression
No ratings yet
Data Compression
28 pages
Lecture
No ratings yet
Lecture
75 pages
Mad Unit 3-Jntuworld
No ratings yet
Mad Unit 3-Jntuworld
53 pages
Basics of Compression: Goals
No ratings yet
Basics of Compression: Goals
15 pages
Chapter 4 Lossless Compression Algorithims
No ratings yet
Chapter 4 Lossless Compression Algorithims
30 pages
Data Compression Basics: Discrete Source
No ratings yet
Data Compression Basics: Discrete Source
34 pages
Algorithms in The Real World: Data Compression: Lectures 1 and 2
No ratings yet
Algorithms in The Real World: Data Compression: Lectures 1 and 2
55 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Lossless Compression: Lesson 1
No ratings yet
Lossless Compression: Lesson 1
10 pages
X9 Free IPTV Links M3u EXP2024 Playlist 23042024
75% (12)
X9 Free IPTV Links M3u EXP2024 Playlist 23042024
5 pages
Data Compression 2
No ratings yet
Data Compression 2
19 pages
Chapter 4 - Introduction To Source Coding PDF
No ratings yet
Chapter 4 - Introduction To Source Coding PDF
72 pages
2017 May 24 Huffman Lecture1
No ratings yet
2017 May 24 Huffman Lecture1
24 pages
Compression PDF
No ratings yet
Compression PDF
55 pages
Source 515 A
No ratings yet
Source 515 A
80 pages
Introduction To Data Compression - Guy E. Blelloch PDF
No ratings yet
Introduction To Data Compression - Guy E. Blelloch PDF
54 pages
Noise, Information Theory, and Entropy: CS414 - Spring 2007
No ratings yet
Noise, Information Theory, and Entropy: CS414 - Spring 2007
44 pages
3.source Coding Data Compression
No ratings yet
3.source Coding Data Compression
25 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Compression For Sending and Storing Information: Text, Audio, Images, Videos
No ratings yet
Compression For Sending and Storing Information: Text, Audio, Images, Videos
28 pages
All Pattern Program
No ratings yet
All Pattern Program
29 pages
Matlab Viterbi Decoder
No ratings yet
Matlab Viterbi Decoder
17 pages
Organic Chemistry Woorksheet On Nomenclature Chemistry 112 I. Alkanes A. Give The IUPAC Name For Each of The Following
No ratings yet
Organic Chemistry Woorksheet On Nomenclature Chemistry 112 I. Alkanes A. Give The IUPAC Name For Each of The Following
3 pages
Names of Reciprocals of Large Numbers
No ratings yet
Names of Reciprocals of Large Numbers
5 pages
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
No ratings yet
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
35 pages
Dsa Assignment 1
No ratings yet
Dsa Assignment 1
14 pages
COT-PowerPoint - COT 2-MATHEMATICS GRADE 5
100% (1)
COT-PowerPoint - COT 2-MATHEMATICS GRADE 5
48 pages
Lesson 1 - Whole Numbers and Decimals
No ratings yet
Lesson 1 - Whole Numbers and Decimals
12 pages
Compressor Principles
No ratings yet
Compressor Principles
32 pages
Referencia de Comandos VKP80II
No ratings yet
Referencia de Comandos VKP80II
60 pages
أبولو
No ratings yet
أبولو
8 pages
Grade 4 Decimals in
No ratings yet
Grade 4 Decimals in
13 pages
1.8 Recurring Decimals
No ratings yet
1.8 Recurring Decimals
6 pages
Ahmar Blooch Kuta
No ratings yet
Ahmar Blooch Kuta
6 pages
ASCII Control Characters
No ratings yet
ASCII Control Characters
8 pages
Abramyan
No ratings yet
Abramyan
5 pages
Add Subtract Within 100
No ratings yet
Add Subtract Within 100
4 pages
The Absolute Minimum Every Software Developer Must Know About Unicode and Charsets
No ratings yet
The Absolute Minimum Every Software Developer Must Know About Unicode and Charsets
10 pages
Place and Value Decimal
No ratings yet
Place and Value Decimal
2 pages
Inches Tables
No ratings yet
Inches Tables
4 pages
Unit 7 Test - Print - Quizizz
No ratings yet
Unit 7 Test - Print - Quizizz
4 pages
IGCSE (9-1) Maths - Practice Paper 6H Mark Scheme
No ratings yet
IGCSE (9-1) Maths - Practice Paper 6H Mark Scheme
17 pages
HTML URL Encoding Reference
No ratings yet
HTML URL Encoding Reference
7 pages
Typography 2
No ratings yet
Typography 2
36 pages
5F - Riya Gupta - 28 - Maths
No ratings yet
5F - Riya Gupta - 28 - Maths
14 pages
Sec A
No ratings yet
Sec A
1 page
Table ASCII (American Standard Code For Information Interchange)
No ratings yet
Table ASCII (American Standard Code For Information Interchange)
8 pages
Ascii Chart
No ratings yet
Ascii Chart
1 page
C Programming for Arduino
From Everand
C Programming for Arduino
Julien Bayle
4/5 (13)
Mastering Blockchain
From Everand
Mastering Blockchain
Imran Bashir
4/5 (5)
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
Blowfish Cipher Tutorials - Herong's Tutorial Examples
From Everand
Blowfish Cipher Tutorials - Herong's Tutorial Examples
Herong Yang
No ratings yet
Application and Implementation of DES Algorithm Based on FPGA
From Everand
Application and Implementation of DES Algorithm Based on FPGA
madhav
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
From Everand
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
Sherwyn Allibang
5/5 (2)
CCNA (640-802) Exam Questions Cisco
From Everand
CCNA (640-802) Exam Questions Cisco
Eddie Vi
4.5/5 (14)
Colour Banding: Exploring the Depths of Computer Vision: Unraveling the Mystery of Colour Banding
From Everand
Colour Banding: Exploring the Depths of Computer Vision: Unraveling the Mystery of Colour Banding
Fouad Sabry
No ratings yet

Basics of Compression

Uploaded by

Basics of Compression

Uploaded by

1/22/2024

Dr. Sania Bhatti

 RLE Compression (Entropy Coding)

Need for Compression

 Source message (symbol) is basic unit into which a

 EXAMPLE: aa bbb cccc ddddd eeeeee fffffffgggggggg

 Better for independent sequences

Traditional Evaluation Criteria

 Compute Entropy (H)

 H= 256* (1/256)*log2(1/256) = 8bits

You might also like