0% found this document useful (0 votes)

6 views

Unit 5e - Autoencoders

Uploaded by

Esha Thaniya Malla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Unit 5e - Autoencoders

Uploaded by

Esha Thaniya Malla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Autoencoders

Chapter 14 from
The Deep Learning book
(Goodfellow et al)
1
Autoencoders
 Autoencoders are artificial neural networks
capable of learning efficient representations of
the input data, called codings (or latent
representation), without any supervision.
 These codings typically have a much lower
dimensionality than the input data, making
autoencoders useful for dimensionality reduction.
 Some autoencoders are generative models: they
are capable of randomly generating new data
that looks very similar to the training data.
 However, the generated images are usually fuzzy and
not entirely realistic.
2
Autoencoders
 Which of the following number sequences
do you find the easiest to memorize?
 40, 27, 25, 36, 81, 57, 10, 73, 19, 68
 50, 25, 76, 38, 19, 58, 29, 88, 44, 22, 11, 34,
17, 52, 26, 13, 40, 20

3
Autoencoders
 At first glance, it would seem that the first
sequence should be easier, since it is much
shorter.
 However, if you look carefully at the second
sequence, you may notice that it follows two
simple rules:
 Even numbers are followed by their half,

 And odd numbers are followed by their triple

plus one
 This is a famous sequence known as the
hailstone sequence.
4
Autoencoders
 Once you notice this pattern, the second
sequence becomes much easier to memorize
than the first because you only need to memorize
 the first number,

 the length of the sequence and

 the two rules.

 This leads to efficient data representation.

 Autoencoders find efficient data representations
recognizing the underlying patterns.

5
Autoencoders

6
Autoencoders formalized
 Autoencoder consists of two parts: an encoder and a
decoder
 The encoder transforms the data into a set of “factors” ,
i.e.

 A decoder decode from the encoded information, and try

to reconstruct the original data, i.e.

 The goal for the autoencoder is to minimize the difference

between the original data and the reconstructed data:

7
Autoencoders formalized

Hidden layer (code)

f g

Input Reconstruction

8
Autoencoders
 A question that comes to the mind of every beginner
of autoencoder is “isn’t it just copying data?”
 In practice, there are often constraints on the
encoder part to make sure that it will NOT lead to a
solution that just copies the data.
 For example, in practice, it may require that the

encoded information is of lower rank, so that the

encoder can be viewed as a dim. reduction step.
 While autoencoder consists of two steps, sometimes
only the output from the encoder is of interest for
downstream analysis.

9
Autoencoders

Undercomplete Overcomplete
Autoencoder Autoencoder

10
Autoencoders
 Autoencoders may be thought of as being a special
case of feedforward networks and may be trained
with all the same techniques, typically minibatch SGD.
 Unlike general feedforward networks, autoencoders
may also be trained using recirculation (Hinton and
McClelland, 1988), a learning algorithm based on
comparing the activations of the network on the
original input to the activations on the reconstructed
input.
 Recirculation is regarded as more biologically
plausible than back-propagation but is rarely used for
machine learning applications.
11
Undercomplete Autoencoders
 Learning an undercomplete representation forces
the autoencoder to capture the most salient
features of the training data.
 When the decoder is linear and L is the mean
squared error, an undercomplete autoencoder
learns to span the same subspace as PCA.
 Autoencoders with nonlinear encoder function f and
nonlinear decoder function g can thus learn a more
powerful nonlinear generalization of PCA.
 But, …

12
Undercomplete Autoencoders
 Unfortunately, if the encoder and decoder are
allowed too much capacity, the autoencoder can
learn to perform the copying task without
extracting useful information about the
distribution of the data.
 E.g. An autoencoder with a one-dimensional code
and a very powerful nonlinear encoder can learn
to map x(i) to code.
 The decoder can learn to map these integer indices
back to the values of specific training examples

13
Regularized Autoencoders
 Ideally, choose code size (dimension of h) small
and capacity of encoder f and decoder g based
on complexity of distribution modeled.
 Regularized autoencoders: Rather than limiting
model capacity by keeping encoder/decoder
shallow and code size small, we can use a loss
function that encourages the model to have
properties other than copy its input to output.
 Sparsity of the representation
 Smoothness of the derivatives
 Robustness to noise and errors in the data
14
Sparse Autoencoders
 Sparse autoencoders is a training criterion that
add a sparsity penalty to the loss function:

 An autoencoder that has been regularized to be

sparse must respond to unique statistical
features of the dataset it has been trained on,
rather than simply acting as an identity function.
 In this way, training to perform the copying task with a
sparsity penalty can yield a model that has learned
useful features as a byproduct.

15
Denoising Autoencoders (DAEs)
 In addition to add penalty terms, there are other
tricks for autoencoders to avoid copying data.
 One trick is to add some noise to the input data,
is used to denote a noisy version of the input .
 The denoising autoencoder (DAE) seeks to
minimize

 Denoising training forces f and g to implicitly learn

the structure of pdata(x)
 Another example of how useful properties can emerge
as a by -product of minimizing reconstruction error.
16
Contractive Autoencoder (CAE)
 Another strategy for regularizing an
autoencoder is to use penalty as in sparse
autoencoders L(x, g ( f (x))) + Ω(h,x)
but with a different form of
 Forces the model to learn a function that
does not change much when x changes
slightly.
 An autoencoder regularized in this way is
called a contractive autoencoder, CAE.
17
Representational Power
 Autoencoders are often trained with a single layer.
 However using a deep encoder offers many
advantages:
 They can approximate any mapping from input to code
arbitrarily well, given enough hidden units.
 They yield much better compression than
corresponding shallow autoencoders.
 Depth can exponentially reduce the computational
cost of representing some functions.
 Depth can also exponentially decrease the amount of
training data needed to learn some functions.

18
Stochastic Encoders and Decoders
 General strategy for designing the output units
and loss function of a feedforward network is to
 Define the output distribution p(y|x)

 Minimize the negative log-likelihood –log p(y|x)

 In this case y is a vector of targets such as

class labels.
 In an autoencoder x is the target as well as the
input.
 Yet we can apply the same machinery as

before.
19
Stochastic Encoders and Decoders

Hidden layer (code)

P encoder ( h | x ) P decoder ( x | h )

Input Reconstruction

20
Denoising Autoencoders (DAEs)
 Defined as an autoencoder that receives a corrupted
data point as input and is trained to predict the original,
uncorrupted data point as its output.
 Traditional autoencoders minimize L(x, g ( f (x)))
 DAE seeks to minimize .
• The autoencoder must undo this corruption rather
than simply copying their input.

Encoder Decoder

Noisy Denoised
Latent space
Input Input
representation
21
Denoising Autoencoders (DAEs)
 DAE trained to reconstruct clean data point x from the
corrupted by minimizing loss
L=-log pencoder(x|h=f(x))
 The autoencoder learns a reconstruction distribution
preconstruct(x| )) ) estimated from training pairs (x, )) as
follows:
1. Sample a training sample x from the training data
2. Sample a corrupted version from C( ~| =x)
3. Use (x, )) as a training example for estimating the
autoencoder distribution precoconstruct(x| ) =pdecoder(x|h)
with h the output of encoder f( ) and pdecoder typically
defined by a decoder g(h).
22
Denoising Autoencoders (DAEs)
 Score matching is often employed to train
DAEs.
 Score Matching encourages the model to
have the same score as the data
distribution at every training point x.
 The score is a particular gradient field: x log p(x)
 DAE estimates this score as (g(f(x)-x).
 See picture on the next slide.

23
Denoising Autoencoders (DAEs)
 Training examples x are red crosses.
 Gray circle is equiprobable corruptions.
 The vector field (g(f(x)-x), indicated by green
arrows, estimates the score x log p( x) which is the
slope of the density of data.

24
Contractive Autoencoder (CAE)
 Contractive autoencoder has an explicit
regularizer on h=f(x), encouraging the derivatives
of f to be as small as possible:
 Penalty Ω(h) is the squared Frobenius norm (sum
of squared elements) of the Jacobian matrix of
partial derivatives associated with encoder
function.

25
DAEs vs. CAEs
 DAE make the reconstruction function
resist small, finite sized perturbations in
input.
 CAE make the feature encoding function
resist small, infinitesimal perturbations in
input.
 Both denoising AE and contractive AE
perform well!
 Both are over overcomplete.
26
DAEs vs. CAEs
 Advantage of DAE: simpler to implement
 Requires adding one or two lines of code to
regular AE.
 No need to compute Jacobian of hidden layer.
 Advantage of CAE: gradient is
deterministic.
 Might be more stable than DAE, which uses a
sampled gradient.
 One less hyper-parameter to tune (noise-
factor).
27
Recurrent Autoencoders
 In a recurrent autoencoder, the encoder is
typically a sequence-to-vector RNN which
compresses the input sequence down to a
single vector.
 The decoder is a vector-to-sequence RNN
that does the reverse.

28
Convolutional autoencoders
 Convolutional neural networks are far better
suited than dense networks to work with images.
 Convolutional autoencoder: The encoder is a
regular CNN composed of convolutional layers
and pooling layers.
 It typically reduces the spatial dimensionality of the
inputs (i.e., height and width) while increasing the
depth (i.e., the number of feature maps).
 The decoder does the reverse using transpose
convolutional layers.

29
Applications of Autoencoders
 Data compression
 Dimensionality reduction
 Information retrieval
 Image denoising
 Feature extraction
 Removing watermarks from Images

30
Applications of Autoencoders
 Autoencoders have been successfully applied to
dimensionality reduction and information retrieval
tasks.
 Dimensionality reduction is one of the early
motivations for studying autoencoders.
 yielded less reconstruction error than PCA.

 If we can produce a code that is low-dimensional

and binary, then we can store all database
entries in a hash table that maps binary code
vectors to entries -- semantic hashing.

31
Chapter Summary
 Autoencoders motivated.
 Sparse autoencoders
 Denoising autoencoders
 Contractive autoencoder
 Recurrent/Convolutional autoencoders
 Applications of Autoencoders

Quiz Barringer Chapter 02
100% (2)
Quiz Barringer Chapter 02
3 pages
DLL Gen Math Week 3
100% (6)
DLL Gen Math Week 3
9 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
Kutztown Trasnscript
No ratings yet
Kutztown Trasnscript
2 pages
Curriculum Map
67% (3)
Curriculum Map
9 pages
Unit5 Autoencoders.doc
No ratings yet
Unit5 Autoencoders.doc
45 pages
DL UNIT 4
No ratings yet
DL UNIT 4
21 pages
D5_PPT
No ratings yet
D5_PPT
79 pages
ch14 Autoencoder
No ratings yet
ch14 Autoencoder
42 pages
Autoencoders U
No ratings yet
Autoencoders U
44 pages
DeepLearning Unit IV Notes
No ratings yet
DeepLearning Unit IV Notes
58 pages
Autoencoders
No ratings yet
Autoencoders
35 pages
Experiment 4
No ratings yet
Experiment 4
26 pages
MODULE 5 Auto-Encoders and Generative Models
No ratings yet
MODULE 5 Auto-Encoders and Generative Models
25 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
UNIT-V DL
No ratings yet
UNIT-V DL
31 pages
UNIT-5 part1
No ratings yet
UNIT-5 part1
15 pages
Autoencoders
No ratings yet
Autoencoders
20 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
Auto Encoders
No ratings yet
Auto Encoders
4 pages
DL Unit 5
No ratings yet
DL Unit 5
19 pages
Lecture 23b Auto Encoder
No ratings yet
Lecture 23b Auto Encoder
27 pages
Autoencoders
No ratings yet
Autoencoders
12 pages
6. Brief Introduction on Current Research Areas - Autoencoders
No ratings yet
6. Brief Introduction on Current Research Areas - Autoencoders
20 pages
03 Autoencoders 4
No ratings yet
03 Autoencoders 4
159 pages
Study Materials - Denoising Autoencoders
No ratings yet
Study Materials - Denoising Autoencoders
7 pages
AAI Module 3
No ratings yet
AAI Module 3
11 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
Deep Learning Module-2 & 4
No ratings yet
Deep Learning Module-2 & 4
48 pages
DUnit IV
No ratings yet
DUnit IV
22 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
Auto Encoder
No ratings yet
Auto Encoder
10 pages
DL Unit - 4
No ratings yet
DL Unit - 4
26 pages
Unit 4
No ratings yet
Unit 4
10 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
Autoencoders
No ratings yet
Autoencoders
14 pages
Autoencoders: Presented By: 2019220013 Balde Lansana (
No ratings yet
Autoencoders: Presented By: 2019220013 Balde Lansana (
21 pages
module 03
No ratings yet
module 03
13 pages
Autoencoders
No ratings yet
Autoencoders
4 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Module 4
No ratings yet
Module 4
10 pages
dlunit4
No ratings yet
dlunit4
122 pages
L23_autoencoders
No ratings yet
L23_autoencoders
16 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
DL M3 Tech
No ratings yet
DL M3 Tech
15 pages
DL Unit3 Autoencoder
No ratings yet
DL Unit3 Autoencoder
91 pages
AAI - Module 2 - Variational Autoencoders
No ratings yet
AAI - Module 2 - Variational Autoencoders
9 pages
Unit II
No ratings yet
Unit II
35 pages
DL Class5
No ratings yet
DL Class5
23 pages
Generative_Models
No ratings yet
Generative_Models
65 pages
unit-iv-v-deep-learning-material
No ratings yet
unit-iv-v-deep-learning-material
32 pages
Unit IV DL
No ratings yet
Unit IV DL
122 pages
Autoencoder
No ratings yet
Autoencoder
39 pages
Vae Gan
No ratings yet
Vae Gan
214 pages
DL-UNIT_3
No ratings yet
DL-UNIT_3
14 pages
Unit 3
No ratings yet
Unit 3
39 pages
AD3501-DL-UNIT 5 NOTES
No ratings yet
AD3501-DL-UNIT 5 NOTES
16 pages
Chapter17 Autoencoders
No ratings yet
Chapter17 Autoencoders
23 pages
Unit IV DL
No ratings yet
Unit IV DL
122 pages
Ch3-Auto-encoder
No ratings yet
Ch3-Auto-encoder
40 pages
ML Lec 19 Autoencoder
No ratings yet
ML Lec 19 Autoencoder
54 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
PDF FORMAT - UNIT5 Questions From Dielectrics and Magnetism
No ratings yet
PDF FORMAT - UNIT5 Questions From Dielectrics and Magnetism
21 pages
PDF Format - Unit4 Questions From Semiconductor Physics
No ratings yet
PDF Format - Unit4 Questions From Semiconductor Physics
17 pages
PDF Format - Unit1 Interference and Diffraction
No ratings yet
PDF Format - Unit1 Interference and Diffraction
14 pages
Unit - Ii Mefa
No ratings yet
Unit - Ii Mefa
59 pages
Quantum Mechanics Lecture Notes
No ratings yet
Quantum Mechanics Lecture Notes
6 pages
r19 Mfcs - Unit-1 (Ref-2)
No ratings yet
r19 Mfcs - Unit-1 (Ref-2)
40 pages
Optical Fibers
No ratings yet
Optical Fibers
7 pages
Unit - I Mefa
No ratings yet
Unit - I Mefa
103 pages
4.1.3 - SampleQuestions - Electrical-Engineering - V1
No ratings yet
4.1.3 - SampleQuestions - Electrical-Engineering - V1
3 pages
MEAN Unit-2
No ratings yet
MEAN Unit-2
49 pages
MEAN Unit-4
No ratings yet
MEAN Unit-4
60 pages
Mean Unit-3
No ratings yet
Mean Unit-3
42 pages
Unit 3 Slides - Getting Started With Neural Networks
No ratings yet
Unit 3 Slides - Getting Started With Neural Networks
70 pages
Unit 5c - Generative Adversarial Networks
No ratings yet
Unit 5c - Generative Adversarial Networks
33 pages
2021 Wells Mountain Initiative Paper App
No ratings yet
2021 Wells Mountain Initiative Paper App
12 pages
Iso 10015 134
No ratings yet
Iso 10015 134
16 pages
Instant download (Ebook) Transforming Organizations: One Process at a Time by Kathryn A. LeRoy ISBN 9781138297364, 1138297364 pdf all chapter
100% (4)
Instant download (Ebook) Transforming Organizations: One Process at a Time by Kathryn A. LeRoy ISBN 9781138297364, 1138297364 pdf all chapter
71 pages
1225.consumer Behaviour and Advertising Management
100% (5)
1225.consumer Behaviour and Advertising Management
391 pages
Chapter 1 - Introduction To Management IOM
No ratings yet
Chapter 1 - Introduction To Management IOM
25 pages
GR 1 2nd Quarter Marilog
No ratings yet
GR 1 2nd Quarter Marilog
5 pages
10 Steps To A Positive Lifestyle Part 1
No ratings yet
10 Steps To A Positive Lifestyle Part 1
5 pages
SUMMATIVE Essay Topics for each play
No ratings yet
SUMMATIVE Essay Topics for each play
5 pages
Criteria in The Selection of Content: Approaches To Curriculum
100% (1)
Criteria in The Selection of Content: Approaches To Curriculum
7 pages
Teaching and Learning Sanskrit Through T
No ratings yet
Teaching and Learning Sanskrit Through T
30 pages
Cleveland Central Catholic Viewbook
No ratings yet
Cleveland Central Catholic Viewbook
12 pages
116AC01_Design-of-Formwork-1 (1)
No ratings yet
116AC01_Design-of-Formwork-1 (1)
2 pages
Project and Portfolio I: Fav119-O-Film and Video
No ratings yet
Project and Portfolio I: Fav119-O-Film and Video
7 pages
Peacebuilding and the Arts 1st ed. 2020 Edition Jolyon Mitchell - The full ebook with all chapters is available for download
No ratings yet
Peacebuilding and the Arts 1st ed. 2020 Edition Jolyon Mitchell - The full ebook with all chapters is available for download
54 pages
CFM Training Courses Catalog 2019
No ratings yet
CFM Training Courses Catalog 2019
22 pages
EUTHENICS Titoy Rennier A.
No ratings yet
EUTHENICS Titoy Rennier A.
1 page
FY Micro-Project First Four Pages Sample
No ratings yet
FY Micro-Project First Four Pages Sample
5 pages
P-2 14-18 Only 22
No ratings yet
P-2 14-18 Only 22
223 pages
2978 6496 1 PB
No ratings yet
2978 6496 1 PB
9 pages
Resume 2 Sample
No ratings yet
Resume 2 Sample
1 page
Raw 5 & 6
100% (1)
Raw 5 & 6
4 pages
MAC 425
No ratings yet
MAC 425
25 pages
Chapter 025
100% (2)
Chapter 025
14 pages
Iwip
No ratings yet
Iwip
3 pages
Lesson Plan: Teacher Name Date Subject Area Grade Topic Time General Learner Outcome(s) Specific Learner Outcome(s)
No ratings yet
Lesson Plan: Teacher Name Date Subject Area Grade Topic Time General Learner Outcome(s) Specific Learner Outcome(s)
3 pages
Speaking and Pronunciation Rubric
No ratings yet
Speaking and Pronunciation Rubric
3 pages