Papers Notes_5_ GAN--Generative Adversarial Nets

最新推荐文章于 2021-11-01 20:45:42 发布

原创最新推荐文章于 2021-11-01 20:45:42 发布 · 237 阅读

0 ·

CC 4.0 BY-SA版权

paper 专栏收录该内容

14 篇文章

订阅专栏

这篇博客深入探讨了生成对抗网络（GANs）的原理和训练过程。GAN由一个判别器（D）和一个生成器（G）组成，其中G试图捕捉数据分布，而D则估计样本是否来自训练数据。训练过程中，D和G通过优化价值函数V(D,G)交替进行，初期着重优化G的输出。博客还提到了激活函数的选择以及训练策略，并提供了相关代码和超参数链接。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Papers Notes_5_ GAN--Generative Adversarial Nets

Adversarial Nets
Training
References

Adversarial Nets

a discriminative model G + a generative model D
G captures the data distribution
(the generative model generates samples by passing random noise through a multilayer perceptron)
D estimates the probability that a sample came from the training data rather than G
unique solution→
G recovers the training data distribution
D is equal to 1/2 everwhere

to learn the generator’s distribution $p_g$ over data $x$
define a prior on input noise variable $p_z(z)$
represent a mapping to data sapce as $G(z;\theta_g)$
define $D(x;\theta_d)$ outputs a single scalar, the probability that $x$ came from the data rather than $p_g$

train $D$ to maximize the probability of assigning the correct label to both training examples and examples ffrom $G$
simultaneously train $G$ to minimize $l o g (1 - D (G (z)))$
value function $V (D, G)$ :
$\min \limits_G \max \limits_D(D,G)=\mathbb E_{x\sim p_{data}(x)}[logD(x)]+\mathbb E_{z\sim p_z(z)}[log(1-D(G(z)))]$

Training

Early in learning, maximize $l o g (G (z))$ rather than minimize $l o g (1 - D (G (z)))$ provides much more stronger gradients

Optimizing D to completion in the inner loop of training is computationally prohibtive, and on finite datasets would result in overfitting.
Instead, we alternate between k steps of optimizing D and one step of optimizing G

在这里插入图片描述
the generator net used a mixture of rectifier linear activations and sigmoid activations
the discriminator used maxout activations and dropout
use noise as the input to only the bootommost layer of the generator network