0% found this document useful (0 votes)

90 views31 pages

Introduction To Generative Adversarial Networks: Luke de Oliveira

Generative adversarial networks (GANs) are a class of machine learning frameworks where two neural networks compete against each other in a game. A generator network produces synthetic data to fool a discriminator network, while the discriminator learns to assess whether samples are real or generated. GANs provide a way to estimate generative models via an implicit distribution rather than explicitly specifying a density function. While theoretically powerful, GANs are challenging to train in practice and often require specialized architectures and techniques to generate high quality samples.

Uploaded by

MD. SHAHIDUL ISLAM

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views31 pages

Introduction To Generative Adversarial Networks: Luke de Oliveira

Uploaded by

MD. SHAHIDUL ISLAM

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Introduction to

Generative Adversarial Networks

Luke de Oliveira
Vai Technologies
Lawrence Berkeley National Laboratory

! @lukede0
" @lukedeo
# [email protected]
$ https://ldo.io
1
Outline

• Why Generative Modeling?

• Taxonomy of Generative Models

• Generative Adversarial Networks

• Pitfalls

• Modifications

• Questions

2
Generative Models

3
Generative Modeling

• Asks question - can we build a model to approximate a data

distribution?

• Formally we are given and a finite sample from this

distribution

• Problem: can we find a model such that

• Why might this be useful?

4
Why care about Generative Models?

Oft over-used quote:

“What I cannot create, I do not understand”

-R. Feynman

5
Why care about Generative Models?

• Classic uses:

• Through maximum likelihood, can fit to some interpretable

parameters for a hand-designed

• Learn a joint distribution with labels

and transform to

• More interesting uses:

• Fast-simulation of compute-heavy tasks

• Interpolation between distributions

6
Traditional MLE Approach

• We are given a finite sample from a data distribution

• We construct a parametric model for the distribution, and build

a likelihood

• In practice, we optimize through MCMC or other means, and

obtain

7
Generative Model Taxonomy

Direct
Maximum Likelihood
GAN

Explicit density Implicit density

Markov Chain
Tractable density Approximate density
GSN
Fully visible belief nets:
NADE
MADE
PixelRNN Variational Markov Chain
Change of variables models (nonlinear
ICA) VAE Boltzmann machine

From I. Goodfellow
8
Generative
Adversarial Networks

9
Generative Adversarial Networks

• As before, we have a data distribution

• We cast the process of building a model of the data distribution

as a two-player game between a generator and a discriminator

• Our generator has a latent prior and maps

this to sample space

• Implicitly defines a distribution

• Our discriminator tells how fake or real a sample looks

via a score (in practice, Prob[Fake])

10
Generative Adversarial Networks

Distinguish real samples from fake

samples

Transform noise into a realistic

sample

Real data

11
Vanilla GAN formulation

• How can we jointly optimize G and D?

• Construct a two-person zero-sum minimax game

with a value V

• We have an inner maximization by D and an outer

minimization by G

12
Theoretical Guarantees

• Let’s step through the proof for equilibrium and

implicit minimization of JSD

13
Theoretical Guarantees
• From original paper, know that

• Define generator solving for infinite capacity discriminator,

• We can rewrite value as

• Simplifying notation, and applying some algebra

• But we recognize this as a summation of two KL-divergences

• And can combine these into the Jenson-Shannon divergence

• This yields a unique global minimum precisely when

14
Theoretical Guarantees

• TL;DR from the previous proof is as follows

• If D and G are allowed to come from the space of all continuous functions,
then we have:

• Unique equilibrium

• The discriminator admits a flat posterior, i.e.,

• The implicit distribution defined by the generator exactly recovers

the data distribution

15
Pitfalls

16
GANs in Practice

• This minimax formulation saturates quickly, causing

gradients propagating from the discriminator to vanish when
the generator does poorly. Non saturating formulation:

• Before:

• After:

17
Failure Modes (ways in which GANs fail)

• Mode Collapse — i.e., learn to produce one mode

of data distribution, stop there

• Vanishing/Exploding Gradients from discriminator

to generator

• Generator produces garbage that fools

discriminator

18
Introspection

• GANs do not naturally have a metric for

convergence

• Ideally, all losses go to

• Often does not happen in practice

19
Modifications

20
GANs in Practice

• Even when using “non-saturating” heuristic,

convergence still difficult

• “Tricks” needed to make things work on real data

• Two major (pre-2017) categories

• Architectural Guidelines

• Side information / Information theoretic

21
DCGAN

• Deep Convolutional Generative Adversarial

Networks provide a set of ad-hoc guidelines for
building architectures for images

• Enabled a lot of current progress in GAN research

22
DCGAN

23
Side Information

• Conditional GAN, Auxiliary Classifier GAN,

InfoGAN etc.

• Key idea: can we leverage side information (a label,

description, trait, etc.) to produce either better
quality or conditional samples?

• The discriminator can either be shown the side

information or tasked with reconstructing it

24
Conditional GAN (CGAN)

• Latent variable is passed to the

generator and the discriminator.

• The generator learns side-

information conditional
distributions, as it is able to
disentangle this from the overall
latent space

25
Auxiliary Classifier GAN (ACGAN)

• Similar to CGAN, latent variable is

passed to the generator

• Discriminator is tasked with jointly

learning real-vs-fake and the ability to
reconstruct the latent variable being
passed in

26
InfoGAN

• Instead of the latent variables being known a priori

from a dataset, make parts of latent space randomly
drawn from different distributions

• Bernoulli, Normal, multiclass, etc.

• Make the discriminator reconstruct these arbitrary

elements of latent space that are passed into generator

• Learns disentangled features (maximizes mutual

information)

27
Conclusion

• Showed theoretical guarantees of GANs (in unrealistic

settings) and convergence properties

• Discussed pitfalls of GANS

• Explored a few basic methods with ~no theory that try to

improve GANs

• Architecture improvements, side information

• I didn’t talk about Minibatch Discrimination or Feature

Matching

28
Questions?

29
Thanks!

30
References

(1) Goodfellow, Ian J., Pouget-Abadie, Jean, Mirza, Mehdi, Xu, Bing, Warde-Farley, David, Ozair,
Sherjil, Courville, Aaron C., and Bengio, Yoshua. Generative adversarial nets. NIPS, 2014.

(2) A. Radford, L. Metz and S. Chintala, Unsupervised representation learning with deep
convolutional generative adversarial networks. 2015.

(3) Mirza, Mehdi and Osindero, Simon. Conditional generative adversarial nets. 2014.

(4) Augustus Odena, Christopher Olah, Jonathon Shlens, Conditional Image Synthesis with
Auxiliary Classifier GANs. ICML, 2017.

(5) Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel. InfoGAN:

Interpretable Representation Learning by Information Maximizing Generative Adversarial
Nets. 2016.

(6) Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, Xi Chen.
Improved Techniques for Training GANs. NIPS, 2016.

Readinggroup Gan 20170417 170425005433
No ratings yet
Readinggroup Gan 20170417 170425005433
26 pages
Advanced Design For AI Algorithms: Lec.: 1 GAN
No ratings yet
Advanced Design For AI Algorithms: Lec.: 1 GAN
223 pages
Generative Models
No ratings yet
Generative Models
39 pages
L12 Generative Models en
No ratings yet
L12 Generative Models en
65 pages
Gen AI Unit 3
No ratings yet
Gen AI Unit 3
52 pages
01 GAN & Its Application
No ratings yet
01 GAN & Its Application
21 pages
PDL Unit 5-GAN
No ratings yet
PDL Unit 5-GAN
36 pages
Unit 5
No ratings yet
Unit 5
46 pages
Unit-V Deep Generative Models Part-02
No ratings yet
Unit-V Deep Generative Models Part-02
35 pages
Lec15 Generative Models
No ratings yet
Lec15 Generative Models
51 pages
Figure From Ian Goodfellow, Tutorial On Generative Adversarial /networks, 2017
No ratings yet
Figure From Ian Goodfellow, Tutorial On Generative Adversarial /networks, 2017
88 pages
Gan Framework
No ratings yet
Gan Framework
57 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
CISC 867 Deep Learning: 15. Generative Adversarial Networks
No ratings yet
CISC 867 Deep Learning: 15. Generative Adversarial Networks
71 pages
Generative Adversarial Network
No ratings yet
Generative Adversarial Network
19 pages
Unit 5
No ratings yet
Unit 5
25 pages
Generative Adversarial Networks: Biplab Banerjee
No ratings yet
Generative Adversarial Networks: Biplab Banerjee
54 pages
Chapter8 GANs
No ratings yet
Chapter8 GANs
24 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
Mod5 Slides
No ratings yet
Mod5 Slides
37 pages
DL Unit6 Gan
No ratings yet
DL Unit6 Gan
44 pages
GAPE Module 2
No ratings yet
GAPE Module 2
30 pages
DL Co4 PPT-3
No ratings yet
DL Co4 PPT-3
14 pages
Artificial Neural Networks: Dr. Md. Aminul Haque Akhand Dept. of CSE, KUET
100% (1)
Artificial Neural Networks: Dr. Md. Aminul Haque Akhand Dept. of CSE, KUET
82 pages
Generative Adversarial Networks and Some of GAN Applications - Everything You Need To Know
No ratings yet
Generative Adversarial Networks and Some of GAN Applications - Everything You Need To Know
26 pages
E-Note 28189 Content Document 20241127105359AM
No ratings yet
E-Note 28189 Content Document 20241127105359AM
32 pages
CSCI 5922 Neural Networks and Deep Learning
No ratings yet
CSCI 5922 Neural Networks and Deep Learning
37 pages
Paper4 (GAN)
No ratings yet
Paper4 (GAN)
24 pages
Gans Stanford
No ratings yet
Gans Stanford
39 pages
L19 GANs
No ratings yet
L19 GANs
9 pages
GaNs L7
No ratings yet
GaNs L7
14 pages
Reviewon Generative Adversarial Networks
No ratings yet
Reviewon Generative Adversarial Networks
6 pages
Generative Adversarial Network An Overview of Theory and Applications
No ratings yet
Generative Adversarial Network An Overview of Theory and Applications
9 pages
Image With GAN-topic
No ratings yet
Image With GAN-topic
20 pages
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
No ratings yet
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
12 pages
GANs
No ratings yet
GANs
13 pages
Week 3 - Post - GAN
No ratings yet
Week 3 - Post - GAN
38 pages
Noets 2016 Tutorial:Generative Adversarial Networks PDF
No ratings yet
Noets 2016 Tutorial:Generative Adversarial Networks PDF
57 pages
Generative Nural Network
No ratings yet
Generative Nural Network
5 pages
Generative Adversarial Networks (Gans) - 01: Main Notions About Gans
No ratings yet
Generative Adversarial Networks (Gans) - 01: Main Notions About Gans
2 pages
Gan Tutorial Suwang
No ratings yet
Gan Tutorial Suwang
11 pages
Deep & Reinforcement - Unit 3
No ratings yet
Deep & Reinforcement - Unit 3
8 pages
Gan Tutorial
No ratings yet
Gan Tutorial
57 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
36 pages
Aiml Demo
No ratings yet
Aiml Demo
12 pages
The Six Fronts of The Generative Adversarial Networks
No ratings yet
The Six Fronts of The Generative Adversarial Networks
11 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
14 pages
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
No ratings yet
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
21 pages
Generative Adversarial Networks (Gans) : Date: 14.11.2022
100% (1)
Generative Adversarial Networks (Gans) : Date: 14.11.2022
12 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
6 pages
Evolutionary Generative Adversarial Networks
No ratings yet
Evolutionary Generative Adversarial Networks
14 pages
Lec19 - GANs
No ratings yet
Lec19 - GANs
47 pages
From Adversarial Training To Geenerative Adversarial Networks
No ratings yet
From Adversarial Training To Geenerative Adversarial Networks
12 pages
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
No ratings yet
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
9 pages
A Technical Seminar2018-19
No ratings yet
A Technical Seminar2018-19
15 pages
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
No ratings yet
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
11 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
4 pages
Lecture 2.3.4GAN
No ratings yet
Lecture 2.3.4GAN
4 pages
Preliminaries: Biological Neuron To Artificial Neural Network
No ratings yet
Preliminaries: Biological Neuron To Artificial Neural Network
21 pages
A Survey On Generative Adversarial Networks (GANs)
No ratings yet
A Survey On Generative Adversarial Networks (GANs)
5 pages
The Key Elements of A Communication Model
No ratings yet
The Key Elements of A Communication Model
12 pages
13 Useful Deep Learning Interview Questions and Answer
No ratings yet
13 Useful Deep Learning Interview Questions and Answer
6 pages
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet

Introduction To Generative Adversarial Networks: Luke de Oliveira

Uploaded by

Introduction To Generative Adversarial Networks: Luke de Oliveira

Uploaded by

Introduction to

Generative Adversarial Networks

• Why Generative Modeling?

• Taxonomy of Generative Models

• Generative Adversarial Networks

• Asks question - can we build a model to approximate a data

• Formally we are given and a finite sample from this

• Problem: can we find a model such that

• Why might this be useful?

Oft over-used quote:

“What I cannot create, I do not understand”

• Through maximum likelihood, can fit to some interpretable

• Learn a joint distribution with labels

• More interesting uses:

• Fast-simulation of compute-heavy tasks

• Interpolation between distributions

• We are given a finite sample from a data distribution

• We construct a parametric model for the distribution, and build

• In practice, we optimize through MCMC or other means, and

Explicit density Implicit density

• As before, we have a data distribution

• We cast the process of building a model of the data distribution

• Our generator has a latent prior and maps

• Implicitly defines a distribution

• Our discriminator tells how fake or real a sample looks

Distinguish real samples from fake

Transform noise into a realistic

• How can we jointly optimize G and D?

• Construct a two-person zero-sum minimax game

• We have an inner maximization by D and an outer

• Let’s step through the proof for equilibrium and

• Define generator solving for infinite capacity discriminator,

• We can rewrite value as

• Simplifying notation, and applying some algebra

• But we recognize this as a summation of two KL-divergences

• And can combine these into the Jenson-Shannon divergence

• This yields a unique global minimum precisely when

• TL;DR from the previous proof is as follows

• The discriminator admits a flat posterior, i.e.,

• The implicit distribution defined by the generator exactly recovers

• This minimax formulation saturates quickly, causing

• Mode Collapse — i.e., learn to produce one mode

• Vanishing/Exploding Gradients from discriminator

• Generator produces garbage that fools

• GANs do not naturally have a metric for

• Ideally, all losses go to

• Often does not happen in practice

• Even when using “non-saturating” heuristic,

• “Tricks” needed to make things work on real data

• Two major (pre-2017) categories

• Side information / Information theoretic

• Deep Convolutional Generative Adversarial

• Enabled a lot of current progress in GAN research

• Conditional GAN, Auxiliary Classifier GAN,

• Key idea: can we leverage side information (a label,

• The discriminator can either be shown the side

• Latent variable is passed to the

• The generator learns side-

• Similar to CGAN, latent variable is

• Discriminator is tasked with jointly

• Instead of the latent variables being known a priori

• Bernoulli, Normal, multiclass, etc.

• Make the discriminator reconstruct these arbitrary

• Learns disentangled features (maximizes mutual

• Showed theoretical guarantees of GANs (in unrealistic

• Discussed pitfalls of GANS

• Explored a few basic methods with ~no theory that try to

• Architecture improvements, side information

• I didn’t talk about Minibatch Discrimination or Feature

(5) Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel. InfoGAN:

You might also like