High-Resolution Image Synthesis and Semantic Manipulation With Conditional Gans
High-Resolution Image Synthesis and Semantic Manipulation With Conditional Gans
Building
Tree
Car
Road Sidewalk
Introduction
Outline
• Introduction
• Related work
• Method
• Results
• Applications
• Conclusion
Related Work
Generative Adversarial Network (GAN)
tio n
s ol u
w re
Lo
Goodfellow et al. [2014] Radford et al. [2015] Arjovsky et al. [2017] Johnson et al. [2016] Isola et al. [2017]
Related Work
Generative Adversarial Network (GAN) Image-to-Image Translation
tio n
s ol u
w re
Lo
Goodfellow et al. [2014] Radford et al. [2015] Arjovsky et al. [2017] Johnson et al. [2016] Isola et al. [2017]
il s
e d eta
s fi n
La ck
Chen and Koltun [2017]
Related Work
Generative Adversarial Network (GAN) Image-to-Image Translation
tio n
s ol u
w re
Lo
Goodfellow et al. [2014] Radford et al. [2015] Arjovsky et al. [2017] Johnson et al. [2016] Isola et al. [2017]
il s
e d eta l uti on
s fi n re s o
La ck Low
Chen and Koltun [2017] Zhu et al. [2016] Zhang et al. [2017]
Our Work
Generative Adversarial Network (GAN) Image-to-Image Translation
High Resolution Image Semantic Manipulation
ti on
s o lu
re
L ow
Goodfellow et al. [2014] Radford et al. [2015] Arjovsky et al. [2017] Johnson et al. [2016] Isola et al. [2017]
ta il s n
de ti o
ck fin e
re s ol u
La Low
Chen and Koltun [2017] Zhu et al. [2016] Zhang et al. [2017]
Outline
• Introduction
• Related work
• Method
• Baseline method
• Our method
• Results
• Applications
• Conclusion
Baseline Method: pix2pix
• Discriminator training
input input
input fake
input
Di Di
D3
real
Residual
blocks
...
match
D2
match
Residual
...
blocks
D1
2 real synthesized
real synthesized
Coarse-to-fine Generator Multi-scale Discriminators Robust Objective
Di Di
D3
real
Residual
blocks
...
match
D2
match
Residual
...
blocks
D1
2 real synthesized
real synthesized
*Similar ideas in Denton et al. 2015, Huang et al. 2017, Chen et al. 2017, Zhang et al. 2017
Coarse-to-fine Generator Multi-scale Discriminators Robust Objective
global Di Di
D3
real
Residua
l blocks
.
..
match
D2
match
Residual
.
..
blocks
D1
2 real synthesized
local real synthesized
*Similar ideas in Durugkar et al. 2016, Iizuka et al. 2017, Zhang et al. 2017
Coarse-to-fine Generator Multi-scale Discriminators Robust Objective
Di Di
D3
real
Residual
blocks
...
match
D2
match
Residual
...
blocks
D1
2 real synthesized
real synthesized
*Similar ideas in Larsen et al. 2016
Our Method
• Extending to high resolution
• Using instance-level segmentation maps
• Boundary improvement
• Multi-modal results using feature embedding
Our Method
• Extending to high resolution
• Using instance-level segmentation maps
• Boundary improvement
• Multi-modal results using feature embedding
Our Method
• Boundary improvement
Our Method
• Boundary improvement
ls
r th
be
tu n
La a y
Fe Image generation
S
network
l s e s
a u r
Re at
Fe
Instance-wise
average
pooling
Feature encoder network
Outline
• Introduction
• Related work
• Method
• Results
• Applications
• Conclusion
Results
• Comparisons with
• pix2pix [Isola et al. 2017]
• CRN [Chen and Koltun 2017]
• Datasets
• Cityscapes [Cordts et al. 2016]
• NYU [Silberman et al. 2012]
• ADE20K [Zhou et al. 2017]
• Helen Face [Smith et al. 2013]
• CelebA-HQ [Karras et al. 2017]
Results
• Quantitative comparisons (Cityscapes)
• Semantic segmentation scores
Mean IoU
Pixel Acc
84.29
83.7883.78
85 0.64
0.69
85 0.8
0.7 0.64
78.34
78.34 0.7
0.6
80
80 0.6 0.39
0.5 0.39
0.5 0.350.35
75 0.4
0.4
75 70.55
70.55 0.3
0.3
70 0.2
0.2
0.1
65 0
pix2pix
pix2pix CRN
CRN 0 Oracle
Oracle(GT)
(GT) pix2pix
pix2pix CRN
CRN 0 Oracle
Oracle(GT)
(GT)
6.2
13.8
Ours
pix2pix Ours CRN
Results
• Qualitative comparisons
Results
• Qualitative comparisons
Results on NYU dataset
Synthesized
Edges Ground truth
Results on CelebA-HQ
Synthesized
Edges Ground truth
Results on CelebA-HQ
Synthesized
Edges Ground truth
Results on CelebA-HQ
Synthesized
Edges Ground truth
Outline
• Introduction
• Related work
• Method
• Results
• Applications
• Conclusion
Applications: style changing
Applications: style changing
Applications: style changing
Applications: label changing
Applications: adding objects
Applications: adding strokes
Applications: adding strokes
Applications: face-to-painting