0% found this document useful (0 votes)

141 views6 pages

Audio Compression Techniques Overview

Simple audio compression methods include silence compression, ADPCM, LPC, and CELP. These methods take advantage of properties of human hearing like frequency and temporal masking. MPEG audio uses psychoacoustic modeling and layered compression including sub-band coding and transform coding to achieve high compression ratios up to 24:1 with little perceived quality loss. It divides the audio into frequency bands and allocates bits non-uniformly based on masking thresholds.

Uploaded by

WillFonseca

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

141 views6 pages

Audio Compression Techniques Overview

Uploaded by

WillFonseca

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

A u d io C om pr e ssion

M ultim e d ia S yste m s (M od u le 4 L e sso n 4 )

S um m ary: S ource s:
H S im ple A ud io C om pression: H D r. Z e -N ian L is
m Lo ssy : P re d iction b ase d cours e m aterial at:
http://w w w .cs.sfu.c a/C o u rse C e ntra l
H Ps ych oa coustic M o de l /3 6 5 /li/

H M PE G A ud io H M PE G A ud io:
http://w w w .m peg .org /M PE G /a ud io .h
m La ye r I a n d I I
tm l
m M P3 (M P E G L ay er II I )

Simple Audio Compression Methods

H Silence Compression - detect the "silence", similar to run-
length coding
H Adaptive Differential Pulse Code Modulation (ADPCM) e.g., in
CCITT G.721 -- 16 or 32 Kbits/sec.
m Encode the difference between two or more consecutive
signals; the difference is then quantized --> hence the loss
m Adaptive quantization
m It is necessary to predict where the waveform is headed
m Apple has proprietary scheme called ACE/MACE. A Lossy
scheme that tries to predict where wave will go in next sample.
Gives about 2:1 compression.
H Linear Predictive Coding (LPC) fits signal to speech model
and then transmits parameters of model. It sounds like a
computer talking, 2.4 kbits/sec.
H Code Excited Linear Predictor (CELP) does LPC, but also
transmits error term --> audio conferencing quality at 4.8
kbits/sec.

Psychoacoustic Model
Human hearing and voice
m Frequency range is about 20 Hz to 20 kHz, most sensitive at 1
to 5 KHz.
m Dynamic range (quietest to loudest) is about 96 dB
m Normal voice range is about 500 Hz to 2 kHz
Low frequencies are vowels and bass
High frequencies are consonants
How sensitive is human hearing?
To answer this question we look at the following concepts:
m Threshold of hearing
Describes the notion of quietness
m Frequency Masking
A component (at a particular frequency) masks components at
neighboring frequencies. Such masking may be partial.
m Temporal Masking
When two tones (samples) are played closed together in time, one can
mask the other.
Threshold of hearing
Experiment: Put a person in a quiet room. Raise level of 1 kHz
tone until just barely audible. Vary the frequency and plot

30
bB
20

0
2 4 6 8 10 12 14 16
Frequency (KHz)

H The ear is most sensitive to frequencies between 1 and 5

kHz, where we can actually hear signals below 0 dB.
H Two tones of equal power and different frequencies will not
be equally loud.
H Sensitivity decreases at low and high frequencies.

Frequency Masking
Experiment: Play 1 kHz tone (masking tone) at fixed level (60
dB). Play test tone at a different level (e.g., 1.1 kHz), and
raise level until just distinguishable. Vary the frequency of
the test tone and plot the threshold when it becomes
audible:

Frequency Masking (Contd.)

H Repeat previous experiment for various frequencies of
masking tones
Temporal Masking
H If we hear a loud sound, and then it stops, it takes a little
while until we can hear a soft tone nearby (in frequency).
H Experiment:
m Play 1 kHz masking tone at 60 dB, plus a test tone at 1.1 kHz at
40 dB. Test tone can't be heard (it's masked).
m Stop masking tone, then stop test tone after a short delay.
m Adjust delay time to the shortest time when test tone can be
heard (e.g., 5 ms).
m Repeat with different level of the test tone and plot:

Net effect of masking:

MPEG Audio
Facts
H The two most common advanced (beyond simple ADPCM)
techniques for audio coding are:
m Sub-Band Coding (SBC) based
m Adaptive Transform Coding based
H MPEG audio coding is comprised of three independent layers.
Each layer is a self-contained SBC coder with its own time-
frequency mapping, psychoacoustic model, and quantizer.
m Layer I: Uses sub-band coding
m Layer II: Uses sub-band coding (longer frames, more
compression)
m Layer III: Uses both sub-band coding and transform coding.
H MPEG-1 Audio is intended to take a PCM audio signal sampled
at a rate of 32, 44.1 or 48 kHz, and encode it at a bit rate
of 32 to 192 kbps per audio channel (depending on layer).
More Facts
H MPEG-1: Bitrate of 1.5 Mbits/sec for audio and video About
1.2 Mbits/sec for video, 0.3 Mbits/sec for audio
m (Uncompressed CD audio is 44,100 samples/sec * 16 bits/sample
* 2 channels > 1.4 Mbits/sec)
H Compression factor ranging from 2.7 to 24.
H With Compression rate 6:1 (16 bits stereo sampled at 48
KHz is reduced to 256 kbits/sec)
m Under optimal listening conditions, expert listeners could not
distinguish between coded and original audio clips.
H Supports one or two audio channels in one of the four modes:
1. Monophonic -- single audio channel
2. Dual-monophonic -- two independent channels, e.g., English and
French
3. Stereo -- for stereo channels that share bits, but not using
Joint-stereo coding
4. Joint-stereo -- takes advantage of the correlations between
stereo channels

MPEG Coding Algorithm

Input Filter into Output
Allocate bits Format
Critical Bands
(Quantization) BitStream
(Sub-band filtering

Compute
Masking
(Psychoacoustic
Model)

1. Use convolution filters to divide the audio signal (e.g., 48 kHz

sound) into 32 frequency sub-bands. (sub-band filtering)
2. Determine amount of masking for each band caused by nearby
band using the psychoacoustic model .
3. If the power in a band is below the masking threshold, don't
encode it.
4. Otherwise, determine number of bits needed to represent the
coefficient such that, the noise introduced by quantization is
below the masking effect (Recall that one fewer bit of
quantization introduces about 6 dB of noise).
5. Format bitstream

Masking and Quantization (Example)

H Say, performing the sub-band filtering step on the input
results in the following values (for demonstration, we are
only looking at the first 16 of the 32 bands):

Band 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

Level 0 8 12 10 6 2 10 60 35 20 15 2 3 5 3 1

H The 60dB level of the 8th band gives a masking of 12 dB in

the 7th band, 15dB in the 9th. (according to the
Psychoacoustic model)
H The level in 7th band is 10 dB ( < 12 dB ), so ignore it.
H The level in 9th band is 35 dB ( > 15 dB ), so send it.
H We only send the amount above the masking level
H Therefore, instead of using 6 bits to encode it, we can use 4
bits -- a saving of 2 bits (= 12 dB).
H determine number of bits needed to represent the coefficient
such that, the noise introduced by quantization is below the
masking effect [noise introduced = 12bB; masking = 15 dB]
MPEG Coding Specifics

12 12 12
samples samples samples

Sub-band filter 0

Sub-band filter 1
Audio
Samples
Sub-band filter 2
. .
. .
. . .
.
12 12 12
.
samples samples samples

Sub-band filter 31
Layer I
Frame
Layer II, III
Frame

MPEG Coding Specifics

H MPEG Layer I
m Filter is applied one frame (12x32 = 384 samples) at a time. At 48 kHz,
each frame carries 8ms of sound.
m Uses a 512-point FFT to get detailed spectral information about the
signal. (sub-band filter). Uses equal frequency spread per band.
m Psychoacoustic model only uses frequency masking.
m Typical applications: Digital recording on tapes, hard disks, or magneto-
optical disks, which can tolerate the high bit rate.
m Highest quality is achieved with a bit rate of 384k bps.
H MPEG Layer II
m Use three frames in filter (before, current, next, a total of 1152
samples). At 48 kHz, each frame carries 24 ms of sound.
m Models a little bit of the temporal masking.
m Uses a 1024-point FFT for greater frequency resolution. Uses equal
frequency spread per band.
m Highest quality is achieved with a bit rate of 256k bps.
m Typical applications: Audio Broadcasting, Television, Consumer and
Professional Recording, and Multimedia.
MPEG Coding Specifics
H MPEG Layer III
m Better critical band filter is used
m Uses non-equal frequency bands
m Psychoacoustic model includes temporal masking effects, takes
into account stereo redundancy, and uses Huffman coder.
Stereo Redundancy Coding:
m Intensity stereo coding -- at upper-frequency sub-bands,
encode summed signals instead of independent signals from left
and right channels.
m Middle/Side (MS) stereo coding -- encode middle (sum of left
and right) and side (difference of left and right) channels.

Effectiveness of MPEG Audio

Layer Target Ratio Quality* Quality at
bit-rate at 128 kbps
64 kbps
Layer I 192 kbps 4:1 -- --

Layer II 128 kbps 6:1 2.1 to 2.6 4+

Layer III 64 kbps 12:1 3.6 to 3.8 4+

*Quality factor:
m 5 perfect
m 4 - just noticeable
m 3 - slightly annoying
m 2 annoying
m 1 - very annoying

MPEG Audio Standards and Psychoacoustics
No ratings yet
MPEG Audio Standards and Psychoacoustics
46 pages
Digital Audio Coding Overview
No ratings yet
Digital Audio Coding Overview
23 pages
MPEG
No ratings yet
MPEG
12 pages
Understanding MPEG Audio Compression
No ratings yet
Understanding MPEG Audio Compression
23 pages
Audio Compression1
No ratings yet
Audio Compression1
22 pages
MPEG Audio Compression Techniques
No ratings yet
MPEG Audio Compression Techniques
23 pages
Audio Compression Insights
No ratings yet
Audio Compression Insights
25 pages
Perceptual Audio Encoding Fundamentals
No ratings yet
Perceptual Audio Encoding Fundamentals
30 pages
Audio Compression Standards: James Rodney P. Santiago
No ratings yet
Audio Compression Standards: James Rodney P. Santiago
51 pages
M5 MPEGAudio
No ratings yet
M5 MPEGAudio
60 pages
Audio Compression
No ratings yet
Audio Compression
30 pages
MPEG Audio Compression Overview
No ratings yet
MPEG Audio Compression Overview
25 pages
Understanding Advanced Audio Coding (AAC)
100% (1)
Understanding Advanced Audio Coding (AAC)
33 pages
Multimedia System Design Part - 4
No ratings yet
Multimedia System Design Part - 4
37 pages
Psychoacoustics in MPEG Audio Coding
No ratings yet
Psychoacoustics in MPEG Audio Coding
36 pages
Overview of MPEG Audio Compression Standards
No ratings yet
Overview of MPEG Audio Compression Standards
31 pages
Audio Compression Techniques Overview
No ratings yet
Audio Compression Techniques Overview
53 pages
4 Chapter Audio and Video Compression
No ratings yet
4 Chapter Audio and Video Compression
122 pages
Audio Compression Techniques Explained
No ratings yet
Audio Compression Techniques Explained
32 pages
MPEG Audio Compression Overview
No ratings yet
MPEG Audio Compression Overview
42 pages
Understanding MP3 and Audio Compression
No ratings yet
Understanding MP3 and Audio Compression
12 pages
Audio Compression Techniques Guide
No ratings yet
Audio Compression Techniques Guide
31 pages
Techniques in MPEG Audio Compression
No ratings yet
Techniques in MPEG Audio Compression
50 pages
High-Quality Low Bitrate Audio Coding
No ratings yet
High-Quality Low Bitrate Audio Coding
4 pages
High-Quality Low Bitrate Audio Coding
No ratings yet
High-Quality Low Bitrate Audio Coding
4 pages
Digital Audio Representation Overview
No ratings yet
Digital Audio Representation Overview
22 pages
36-Perceptual Coding, MPEG Audio Coding-03!04!2025
No ratings yet
36-Perceptual Coding, MPEG Audio Coding-03!04!2025
57 pages
Audio Compression Techniques
No ratings yet
Audio Compression Techniques
34 pages
Audio Coding for Engineers
No ratings yet
Audio Coding for Engineers
15 pages
Audio Compression Techniques Explained
No ratings yet
Audio Compression Techniques Explained
27 pages
AES 17 Conference Mp3 and AAC Explained AES17
No ratings yet
AES 17 Conference Mp3 and AAC Explained AES17
12 pages
Understanding Digital Audio Basics
No ratings yet
Understanding Digital Audio Basics
29 pages
Audio Coding: Techniques & Standards
No ratings yet
Audio Coding: Techniques & Standards
6 pages
Audio Coding: Basics & Current Techniques
No ratings yet
Audio Coding: Basics & Current Techniques
6 pages
Understanding Digital Audio Basics
No ratings yet
Understanding Digital Audio Basics
14 pages
MIDI vs Digital Audio Overview
No ratings yet
MIDI vs Digital Audio Overview
14 pages
Basic Sound and Digital Audio Concepts
No ratings yet
Basic Sound and Digital Audio Concepts
37 pages
IT and Arts Organizations Overview
No ratings yet
IT and Arts Organizations Overview
32 pages
Audio Coding and Standards
No ratings yet
Audio Coding and Standards
32 pages
MPEG-4 Advanced Audio Coding Overview
No ratings yet
MPEG-4 Advanced Audio Coding Overview
13 pages
5 Basics of Digital Audio
No ratings yet
5 Basics of Digital Audio
29 pages
Multimedia System: Chapter Five: Basics of Digital Audio
No ratings yet
Multimedia System: Chapter Five: Basics of Digital Audio
42 pages
MPEG Audio Compression Techniques
No ratings yet
MPEG Audio Compression Techniques
5 pages
Brandenburg Mp3 Aac
No ratings yet
Brandenburg Mp3 Aac
12 pages
STA013 mp3解壓縮晶片
No ratings yet
STA013 mp3解壓縮晶片
17 pages
Msa 02
No ratings yet
Msa 02
9 pages
Audio Compression Techniques Explained
No ratings yet
Audio Compression Techniques Explained
19 pages
Digital Audio Basics and Digitization Techniques
No ratings yet
Digital Audio Basics and Digitization Techniques
23 pages
Digital Audio Basics and Digitization
No ratings yet
Digital Audio Basics and Digitization
27 pages
An Introduction To Digital Multimedia 2ND ED2
100% (1)
An Introduction To Digital Multimedia 2ND ED2
24 pages
Understanding Computer Audio Processing
No ratings yet
Understanding Computer Audio Processing
28 pages
A Tutorial On MPEG/Audio Compression
No ratings yet
A Tutorial On MPEG/Audio Compression
12 pages
Basic Audio Compression Techniques
No ratings yet
Basic Audio Compression Techniques
17 pages
Digital Audio Basics and MIDI Overview
No ratings yet
Digital Audio Basics and MIDI Overview
14 pages
Comparative Analysis of Modern Formats of Lossy Audio Compression
No ratings yet
Comparative Analysis of Modern Formats of Lossy Audio Compression
13 pages
Efficient Audio Compression Techniques
No ratings yet
Efficient Audio Compression Techniques
19 pages
MPEG Audio Coding Overview
No ratings yet
MPEG Audio Coding Overview
15 pages
Digital Audio Fundamentals and Techniques
No ratings yet
Digital Audio Fundamentals and Techniques
9 pages
Sensor Conditioned Clean-SC in Aeroacoustics
No ratings yet
Sensor Conditioned Clean-SC in Aeroacoustics
9 pages
Acoustics Laboratory Program
100% (1)
Acoustics Laboratory Program
4 pages
Wave Field Synthesis in Spatial Audio
No ratings yet
Wave Field Synthesis in Spatial Audio
8 pages
Measurement Microphones: Uses & Standards
No ratings yet
Measurement Microphones: Uses & Standards
4 pages
RMS and Peak Voltage Explained
No ratings yet
RMS and Peak Voltage Explained
9 pages
LED Strip Light User Manual
No ratings yet
LED Strip Light User Manual
6 pages
Microphone Calibration by Substitution
No ratings yet
Microphone Calibration by Substitution
22 pages
Monkey Forest - Audio Measuring System
No ratings yet
Monkey Forest - Audio Measuring System
70 pages
Microphone Calibration by Substitution
No ratings yet
Microphone Calibration by Substitution
22 pages
Comprehensive Business & SEO Guide
No ratings yet
Comprehensive Business & SEO Guide
751 pages
Geotechnical Services Proposal
100% (1)
Geotechnical Services Proposal
13 pages
Properties of Acids and Alkalis Explained
No ratings yet
Properties of Acids and Alkalis Explained
7 pages
Plastic Injection Molding Overview
No ratings yet
Plastic Injection Molding Overview
37 pages
Capacitor and Capacitance
80% (5)
Capacitor and Capacitance
48 pages
Audible Detector Base Install Guide
No ratings yet
Audible Detector Base Install Guide
4 pages
Is Gtu Papers
No ratings yet
Is Gtu Papers
12 pages
Toaz - Info Cortex Prime Game Handbook 1 50
No ratings yet
Toaz - Info Cortex Prime Game Handbook 1 50
50 pages
Siemens Transformer Installation Guide
No ratings yet
Siemens Transformer Installation Guide
22 pages
Standing Waves Exploration PhET-Koser PDF
No ratings yet
Standing Waves Exploration PhET-Koser PDF
3 pages
Aug 037 0 en Remote Access For Siemens s7 300400 Plcs
No ratings yet
Aug 037 0 en Remote Access For Siemens s7 300400 Plcs
57 pages
?revision - Physics Practical Exam - Grade 12 - 24-25
No ratings yet
?revision - Physics Practical Exam - Grade 12 - 24-25
25 pages
B.Sc. Genetic Engineering Exam Prep
No ratings yet
B.Sc. Genetic Engineering Exam Prep
4 pages
Primavera Project Management Skills
No ratings yet
Primavera Project Management Skills
2 pages
LAB2 - An Islamic Banking Application Using C# & SQL Server
50% (2)
LAB2 - An Islamic Banking Application Using C# & SQL Server
8 pages
Water Cycle Review
No ratings yet
Water Cycle Review
10 pages
Biology Form 2 Syllabus Final
No ratings yet
Biology Form 2 Syllabus Final
6 pages
74HC/HCT4017 Johnson Decade Counter
No ratings yet
74HC/HCT4017 Johnson Decade Counter
13 pages
Pandas Assignment Version-2
No ratings yet
Pandas Assignment Version-2
9 pages
GMS-Ω GQuuuuuuX the Gundam Wiki Fandom
No ratings yet
GMS-Ω GQuuuuuuX the Gundam Wiki Fandom
1 page
Fundus Image Exudate Segmentation
No ratings yet
Fundus Image Exudate Segmentation
6 pages
BSNL - SDCA - LDCA-2-Network Plans-I
80% (5)
BSNL - SDCA - LDCA-2-Network Plans-I
27 pages
KD240GX LPB
No ratings yet
KD240GX LPB
2 pages
CAH Series: Comfort Air Handling Units
No ratings yet
CAH Series: Comfort Air Handling Units
20 pages
Ratio (Pro Level Sheet)
No ratings yet
Ratio (Pro Level Sheet)
9 pages
Avamar Time Sync Issue Diagnosis Guide
No ratings yet
Avamar Time Sync Issue Diagnosis Guide
4 pages
2020 Organic Chemistry Exam Questions
No ratings yet
2020 Organic Chemistry Exam Questions
10 pages
Tips and Tricks of CATIA
No ratings yet
Tips and Tricks of CATIA
5 pages
IARI PH.D Agronomy 2013
No ratings yet
IARI PH.D Agronomy 2013
12 pages
Espinoza Et Al-2018-Basin Research-3
No ratings yet
Espinoza Et Al-2018-Basin Research-3
29 pages

Audio Compression Techniques Overview

Uploaded by

Audio Compression Techniques Overview

Uploaded by

A u d io C om pr e ssion

M ultim e d ia S yste m s (M od u le 4 L e sso n 4 )

Simple Audio Compression Methods

H The ear is most sensitive to frequencies between 1 and 5

Frequency Masking (Contd.)

Net effect of masking:

MPEG Coding Algorithm

1. Use convolution filters to divide the audio signal (e.g., 48 kHz

Masking and Quantization (Example)

H The 60dB level of the 8th band gives a masking of 12 dB in

MPEG Coding Specifics

Effectiveness of MPEG Audio

Layer II 128 kbps 6:1 2.1 to 2.6 4+

Layer III 64 kbps 12:1 3.6 to 3.8 4+

You might also like