AAI Module 3

This are the notes for module 3 of AAI summarized format.

Uploaded by

ahmed.412052.cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

21 views

AAI Module 3

This are the notes for module 3 of AAI summarized format.

Uploaded by

ahmed.412052.cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 11

MODULE 3 ' Variational Autoencoders CHAPTER 3 University Prescribed Syllabus ational Autoencoders (VAEs), Architecture and Training of VAEs the loss erence, Application of VAEs in image generation Introduction : Basi function, Latent space r Types of Autoencoders : U ‘oders, Sparse Autoencoders, Contractive Autoencoders, Denoising Autoencoders, VariationalQi my 3.4 INTRODUCTION > 3.1.1 Basic Components of Variational Autoencoders (VAEs) ‘tiem aco ions) hats the need ofan uoencde, (hs) iat te properties fan uoeeoer (orks) Descbe the achecre cf an atencode (4 Marks) i (eto Vaan cndc AES) we inn 2013 ty Kinga ots anl velo oa Teme ono In neural net 4 variational autoencoder consists of an 2 : i i eo] t Data: x Fi Decoder (v2) Reconstruction S.l : Encoder Decoder ts input is a daapoit x, its output is a hidden representation 2, and it has weights and biases 8, To be comerae, e's say xis a 28 by 28-pixel photo of a handwritten number. ‘The encoder ‘encodes’ the data which is 784784- Gimensional into a latent (hidden) representation Space 2, which is much less than 784784 dimensions. ‘This is typically refered to asa “holtleneck’ because the encoder mast learn en ecient compression of the daa int his lower-dimensonal space, Let's denote the encoder gy (l. ‘We ote thatthe lower-dimensional space is stochastic: the encoder outputs parameters 0 (ls), whichis & Gaussian probability densi fonction. We can sample fiom this distibuon 10 get noisy values of the representations 2 “The decoder is another neural network. input isthe representation 2, it outputs the parameters to the probabtity distation of he data, an has weights and biases. The decoder is denoted by pp. (xk. Executing the handwritten digit example, lets say the photos are black and white and represent each pixel asO¥or Il. The probability distibution of single pixel can be then represented using. a Bernoulli distbation2. explain the architecture and training process of Vssitional_Autooncuders The Architecture of Variational Autoencoder + The encoderdecrdcr architecture fies at the heart of Varstional Autoencoders (VAEs), distinguishing them from traditional autoencoders + The encoder network Ikes raw input data and transforms it into a probability distribution withini fe Intent space. + The latent code generated hy the encoder is a probatilistic encoding allowing the ‘VANE to express not just a sing point inthe latent space but a disitbution of potenti tur, takes 9 sampled point from the Ialen! distribution and reconstruc it back into dats space. + During training, the model refines both the encoder and decoder parumcters 10 minimize the reconstruction loss ~ the disparity between the inpul data ard the decoded output. «The goal isnot just to achieve accurate reconstruction but also to regutarize the latent space, ensuring that itconforms to. specified distribution. + In VATA, the erceder all maps the inp data fo lowsr-dimensiona lent space, but instead ata single pot inthe latent space, the encoder generat a probability diseibution ‘over the Item space. + The decader thea amples from this diviution to generte a new dla point, This robsbiliie approsch to encoding the Inpuallows VAEs to leam a more structed and onlirvous Intent space representation, which it use for gonoraive rnodeling and daw sym “To go from a inditonal auocncoder na VAR, we need to make twa key modifistons. 1, Fit, we need to replace the encoder's output witha probsbilty distribution, Instead ofthe encoder upaing a pola ithe Latent space, it eutputs the paracters ef a probability distbusion, such es mean and variance. This disribution ts typically & multivariate Gaussian distribution but can be some other dstbution as well ep. Berout 2. Second, we introduce a new term in the Hots funeton called the Kulthct-LeiAler (KL) divergence, Thister measures the differeace between the learned probability elsrbation ver the latent space and a predefined ioe distibuon (usually a standard. corral distribution)— pepuanim Sen 7ANDS) _Uurorion-tnepareet aes Pa oO ya 3.1.3 Undercomplete Autoencoder oO isthe need of an indercompete avtoencoder, the @ ete autoencoder. ‘the middle compared to = -asremglte Avoncder hs fer nde (diensions tnt and Output layers which to obtain porta fare fram the dae 12 tnestps, wo tend tcl mid layer a btn + (ba the case of Undercomplete Autoencoders, we are squeezing the information into {ewer dimensions (hence the bottleneck) while trying to ensure that we can still got beck to the original valuei Therefore, we are creating a custom function, that ‘compresses the data, which is a way to reduce the dimensionality and extract ‘meaningful information. : + Aer training the Undereomplete Autoencder, we tpinly discard the Deo nly use the Encoder part. + Aw obnctive of undercompleteautoencoder ito capture thie most important features present in data, It minimizes the loss function by penalising the (fi) for being different from the input x. oder and Fig. 3.17 : Undercomplete autoencoder Architecture‘The figure below shows the I/P and O/P of undercomplete autoencoder. Fig. 3.1.8 : Undercomplete autoencoder I/P and O/P 3.1.4 Overcomplete Autoencoder GQ. What is the need of an overcomplete autoencoder. GQ. Draw the architecture of overcomplete autoencoder. ; (2 Maris) | Overcomplete Autoencoder has more nodes (dimensions) in the middle compared ty Input and Output layers. ‘© While poor generalization can happen even in undercomplete autoencoders, it is an even more serious problem with overcomplete autoencoders. To avoid poor generalization, we need to introduce generalization, Fig. 3.19 : Overcomplete autoencoder ArchitectureCONOR YS 3.2.1 Denoising Autoencoder (9) + /One of the simpler variations of autoencoder is the denoising autoencoder, where th inputs are corrupted and the outputs are clean; the autoencoder basically learns s clean corrupted samples. Such denoising autoencoders can generate more robut representation which improves classification. ‘Autoencoder learns useful features by adding random noise to its inputs and making it recover the original noise-free data. This way the autoencoder can’t simply copy the input to its output with learning the features in data because the input also contains random noise. (MU-New Syllabus w.e.f Academic Year 23-24) (M7-141) [ab rech-neo Publications...A SACHIN SHAH Ventuca poop Leeming (MU-Som 7-AIBDS) (Autoencoders: Unsupervised Learning)...Page no. (3-14) are aking itt subtract the noise and prove the underiyingmesninil date AS is called a denoising autoencoder. : . top row contains the original images. We add random Gaussian noise to them and z Peon eur a ee rae opiginal image. “he bottom row is the autoencoder output. We can do better by using more complex autooncoder architecture, such as convolutional autoencoders. ‘Ongar mages 2] /[ol4) Ei ‘Aulooneader Quiput Z] / Jo} 4 Fig. 3.21 : Original image, notsy UP and O/P + [Basic autooncoder trains to minimize the loss between x and the reconstruction 10 minimize the loss between’ x and g(f(x + w), fix)). Denoising autoencoders train t smorize the input ‘where w is random noise, Denoising autoencoders can't simply me output relationship. + Intuitively, a denoising autoencoder learns a projection from a neighborhood of our training data back onto the training data. In figure below, noise is added to original image, i is encoded into the code which is decoded to return the original input image. cee - eos code Output Original Noisy image, Input Fig, 322: Denoting Autoencoder Advantage of denoising autoencoder s(ft is simpler to implemen [it requires js ling one or two lines of code to regular autoencoder} [Phere is no need to compute ‘obian of hidden layer.) = ou ste Sb wet Academic Your 22-24) 7-141) Tel rech-teo Pubtictons.A SACHIN SHAH Venture3.2.2 Sparse Autoencoder 7 Regularisation is also used to learn useful features apart from keeping the cody small and denoising autoencoders. « Wo can regularize the autoencoder by using a sparsity constraint such that a, fraction of the nodes would have nonzero values, called active nodes. + {Bn particular, we add a ponalty term to the loss function such that only a fr the nodes become active\(Chis forees the autoencoder to represent each input. combination of small number of nodes, and demands it to discover in structure in the data.(Dxis method works evon if the eode size is large, since ay ‘small subset of the nodes will be active at any time} ig. 3.23 : Sparse Auloencoder Architecture |Sparsity was introduced in terms of firing neurons, if the neurons are of high val war about 2), itis allowed to be fired, the rest are not, Sparse autoencoders consti 4 loss function to penalize activations ‘within a layer. They usually regularize the weights of a network and not the activations] individu! ve acisatinel model that activate are data-dependent. Different inp will 2 regions ofthe nor terent nodes through the network. They selectively activa network depending on the input data. Egs. © Li Regularization : Penali layer hor cbernat ete the absolute value ofthe véetor of activations 8- ep Learning (MU-Sem Z-AI8DS) (Autoencoders : Unsupervised Learning)...Page no. (3-16) . jae neuron with sigmoid activation function will have values 0 and 1. We say at neuron is activated when its output is close to 1 and not activated when its output is close to 0. A sparse autoencoder tries to ensure that neuron is inactive most of the times. Sparse autoencoders have hidden nodes greater than input nodes. \ + Sparsity may be introduced by additional terms in the loss function during the training process, either by comparing the probability distribution of the hidden unit activations with some low desired value or by manually zeroing al but the strongest hidden unit activations. . lvantage of sparse autoencoder : We can achieve an information bottleneck (same information with fewer neurons) without reducing the number of neurons in the hidden layers.yy. 5.2.3 Contractive Autoencoder Gs ins eee nbect ste ies 1s it provides a robust learned representation. We can achieve this by addi jeve this by adding pnalty term or regularizer to whatever cost or objective function or loss algorithm is trying to minimize. Pea ‘he rault reduces tho Toned rpresntatins sna toward the traning ip ‘This regularizer neods to conform to the Frobenius norm of the Jacobian matrix for the encoder activation sequence concerning the input. Frobenius norm of the Jacobian matrix forthe hidden layer is ealeulated w-r. the japut and is basically the sum of square ofall eloments as in Figure below. If this we don't observe any change in the learned hidden representations as value is zero, then the learned model is wwe change the input values. But if the value is huge, unstable as the input values change. + We generally employ Contractive autoencoders as one of several other autoencader nodes It is in active mode only when other encoding schemes fail to label a data point Frobenius Norm — Vector norm, L2 norm der partial derivatives of a vector-valued Jacobian matrix — matrix of all first-o function. 2 ot) Regularizing term : || Ju) I] p =24(&e oa + Contractive autoencoders arrange for similar layer activation: ice, the derivative of the hidden TE. no pbiation-A SACHIN SHAY Verne have similar activations inputs to pe tothe input. sare small with respect4 (Autoencoders : Unsupervised Leaming)...Page no. (a, Denoising autooncoders make the reconstruction function (encoder + decoder) ‘small perturbations of the input while Contractive autoencoders make the fa ‘extraction function (ie. encoder) resist infinitesimal perturbations of the input, Masur loss again cfg ag, Fig. 324 : Contractive Autoencoder "= Advantage of contractive autoencoder Sinco gradiert is deterministic, we can use second order optimizers e.g. conjugal? gradient, LBFGS, ete. which might be more stable than denoising autoencoder, and it us a sampled gradient, — Tain odseraons a Leamed reconsuien ‘Siar np fincton are cones near dently funtion ‘oaconsnt (Gerectreconsburton) toutwtina reighborioos, Doce on ceva Sr ‘vaiing ee ea x Fig. 325 : Contractive Autoencoder eb

Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
Autoencoders
No ratings yet
Autoencoders
35 pages
Autoencoders
No ratings yet
Autoencoders
12 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
Unit5 Autoencoders.doc
No ratings yet
Unit5 Autoencoders.doc
45 pages
Study Materials - Denoising Autoencoders
No ratings yet
Study Materials - Denoising Autoencoders
7 pages
MODULE 5 Auto-Encoders and Generative Models
No ratings yet
MODULE 5 Auto-Encoders and Generative Models
25 pages
Auto Encoders
No ratings yet
Auto Encoders
4 pages
Experiment 4
No ratings yet
Experiment 4
26 pages
Autoencoder
No ratings yet
Autoencoder
39 pages
DL M3 Tech
No ratings yet
DL M3 Tech
15 pages
Unit 5e - Autoencoders
No ratings yet
Unit 5e - Autoencoders
32 pages
Autoencoders
No ratings yet
Autoencoders
14 pages
Auto Encoder
No ratings yet
Auto Encoder
39 pages
DL UNIT 4
No ratings yet
DL UNIT 4
21 pages
D5_PPT
No ratings yet
D5_PPT
79 pages
Deep Learning Module-2 & 4
No ratings yet
Deep Learning Module-2 & 4
48 pages
DL Unit 5
No ratings yet
DL Unit 5
19 pages
03 Autoencoders 4
No ratings yet
03 Autoencoders 4
159 pages
Lecture 23b Auto Encoder
No ratings yet
Lecture 23b Auto Encoder
27 pages
DeepLearning Unit IV Notes
No ratings yet
DeepLearning Unit IV Notes
58 pages
465-Lecture 12
No ratings yet
465-Lecture 12
31 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
Auto Encoder s
No ratings yet
Auto Encoder s
16 pages
Vae Gan
No ratings yet
Vae Gan
214 pages
Vae - Gan 1
No ratings yet
Vae - Gan 1
136 pages
Autoencoders
No ratings yet
Autoencoders
4 pages
ch14 Autoencoder
No ratings yet
ch14 Autoencoder
42 pages
AD3501-DL-UNIT 5 NOTES
No ratings yet
AD3501-DL-UNIT 5 NOTES
16 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
UNIT 3
No ratings yet
UNIT 3
23 pages
UNIT-V DL
No ratings yet
UNIT-V DL
31 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Unit II
No ratings yet
Unit II
35 pages
DUnit IV
No ratings yet
DUnit IV
22 pages
DL Unit3 Autoencoder
No ratings yet
DL Unit3 Autoencoder
91 pages
L23_autoencoders
No ratings yet
L23_autoencoders
16 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
ML Lec 19 Autoencoder
No ratings yet
ML Lec 19 Autoencoder
54 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
UNSUPERVISED DEEP LEARNING-UNIT 4
No ratings yet
UNSUPERVISED DEEP LEARNING-UNIT 4
26 pages
Gen AI Unit 2
No ratings yet
Gen AI Unit 2
65 pages
GAPE_module_3 - Copy - Copy
No ratings yet
GAPE_module_3 - Copy - Copy
21 pages
Module 4
No ratings yet
Module 4
10 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
dlunit4
No ratings yet
dlunit4
122 pages
Autoencoders U
No ratings yet
Autoencoders U
44 pages
UNIT V
No ratings yet
UNIT V
32 pages
Autoencoders
No ratings yet
Autoencoders
20 pages
module 03
No ratings yet
module 03
13 pages
Unit IV DL
No ratings yet
Unit IV DL
122 pages
DL Unit - 4
No ratings yet
DL Unit - 4
26 pages
Mod 3 Advanced AI
No ratings yet
Mod 3 Advanced AI
37 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Unit IV DL
No ratings yet
Unit IV DL
122 pages
Auto Encoder S
No ratings yet
Auto Encoder S
32 pages
Auto Encoder
No ratings yet
Auto Encoder
10 pages
Unsupervised Deep Learning
No ratings yet
Unsupervised Deep Learning
11 pages

AAI Module 3

Uploaded by

AAI Module 3

Uploaded by

You might also like