1
2
Recoloring Stylization
Retexturing Appearance Editing
Meka
et
al.,
SIGGRAPH
2016
Bonneel et al., Eurographics STAR, 2017 Beigpour et al., JPI, 2018 3
4
Design Objectives Feasibility
• Accessibility – Possible on readily available
smartphones.
• Speed – All post-capture processing takes at most
few seconds.
• Interaction – Editing is possible at interactive rates.
• Quality – Rendered output looks plausible.
• Built-In Depth Sensors
• High-End GPU
• Graphics API
5
Optimization-based Methods Learning-based Methods
-- Mostly are not real-time or interactive
-- Resource intensive
-- Limited to low image resolutions
-- Both class of methods assume only diffuse reflection
Bell
et
al.,
SIGGRAPH
2014
Input Albedo
Lettry
et
al.,
Pacific
Graphics
2018
Albedo
Input
6
Diffuse + Depth Albedo
Shading
Output
Depth
RGB-D Input
Specularity
Removal
Intrinsic
Decomposition
Image
Editing
Specular
7
8
Works quite well for images in a controlled setting.
Input Fu et al., 2019 Akashi et al., 2016 Shen et al., 2009
Ground Truth
9
However, does not generalize well for real-world images.
Input Fu et al., 2019 Akashi et al., 2016 Shen et al., 2009
Intensity (In) Intensity (Out)
Chroma (In) Chroma (Out)
Input
Diffuse
Reducing positive coefficients of
subbands in the masked region
Specular
Mask
Inpainting missing information
by iterative bilateral hole-filling
High
Freq.
Low
Freq.
Sub-bands
10
11
Diffuse + Depth Albedo
Shading
Output
Depth
RGB-D Input
Specularity
Removal
Intrinsic
Decomposition
Image
Editing
Specular
11
𝐸 𝑎, 𝑠 =
1
2
𝜆𝑑𝐸𝑑 + 𝜆𝑟𝑎𝐸𝑟𝑎 + 𝜆𝑟𝑠𝐸𝑟𝑠 + 𝜆𝑠𝑝| 𝑎 |1
𝑎𝑟𝑔𝑚𝑖𝑛
𝑎, 𝑠 Data term Retinex-Albedo
term
Retinex-Shading
term
𝐿1- Regularization
of Albedo
Intrinsic Decomposition
12
Intrinsic Decomposition
Diffuse + Depth Shading
Albedo
13
14
15
Input Ours Fu et al., 2019 Akashi et al., 2016 Shen et al., 2009
16
Input Only Low Freq. Reduced Only High Freq. Reduced Both Freq. Reduced Both Freq. Red. + Chroma Inpaint.
17
Input Only Low Freq. Spec. Removal Only High Freq. Spec. Removal Specularity Enhanced
Input
18
Ours (RGB-D) Bell et al., 2014 (RGB) Jeon et al., 2014 (RGB-D) Lettry et al., 2018 (RGB)
Our results are comparable to existing techniques.
19
Input Ours (RGB-D) Bell et al., 2014 (RGB) Jeon et al., 2014 (RGB-D) Lettry et al., 2018 (RGB)
Existing methods do not perform comparatively well for outside scenes
20
𝐸 𝑎, 𝑠 =
1
2
𝜆𝑟𝑎𝐸𝑟𝑎 + 𝜆𝑑𝐸𝑑 + 𝜆𝑟𝑠𝐸𝑟𝑠 + 𝜆𝑠𝑝| 𝑎 |1
𝑎𝑟𝑔𝑚𝑖𝑛
𝑎, 𝑠 Data term
Retinex-Albedo
term
Retinex-Shading
term
𝐿1- Regularization
of Albedo
𝐸1 𝐸2 𝐸3
Input Albedo – Only 𝐸1 Albedo – 𝐸1+ 𝐸2 Albedo – 𝐸1+ 𝐸2 + 𝐸3
21
Input Diffuse Specular
Albedo Shading
Diffuse Input
Silk
Mystique
Glass
God Rays
Toon
22
Mystique/Hulk
Input Glass
Silk Tattoo
23
Input Low-density Fog High-density Fog
24
Input God Rays God Rays + Fog
25
Input Comic Quickshift Halftone
26
27
Input
Ground Truth Ours
Albedo
Edited
Albedo initialization with chromaticity leads to
color shifts in the edited results.
Input Diffuse
Our specular pixel identification step is not robust, thus can
potentially make the diffuse output darker.
28
• Relax some of the existing assumptions such as -- single-color illuminant and only direct illumination.
• Perform super-resolution of depth map thereby enhancing the result quality.
• Make the specular mask generation more robust.
• A novel specularity removal method, that works for real-world images.
• A system for intrinsic decomposition of images on smartphones.
• Our GPU-based implementation allows for interactive image editing.
• We showcase a variety of applications.
Future Work

More Related Content

PPT
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
PDF
Super resolution
PDF
Learning RGB-D Salient Object Detection using background enclosure, depth con...
PDF
iVideo Editor with Background Remover and Image Inpainting
PDF
Natural Language Generation in the Wild
PPTX
HPG 2018 - Game Ray Tracing: State-of-the-Art and Open Problems
PDF
“Efficiently Registering Depth and RGB Images,” a Presentation from eInfochips
PDF
Photo echance. Problems. Solutions. Ideas
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
Super resolution
Learning RGB-D Salient Object Detection using background enclosure, depth con...
iVideo Editor with Background Remover and Image Inpainting
Natural Language Generation in the Wild
HPG 2018 - Game Ray Tracing: State-of-the-Art and Open Problems
“Efficiently Registering Depth and RGB Images,” a Presentation from eInfochips
Photo echance. Problems. Solutions. Ideas

Similar to Interactive Photo Editing on Smartphones via Intrinsic Decomposition (20)

PPT
sampling and quantization notes in image process.ppt
PPTX
Adaptive Spectral Projection
PPTX
Substanceshanghaippt repacked
PPT
Slides: Accelerating Vector Graphics Rendering using the Graphics Hardware Pi...
PDF
presentation644v4
PDF
Unsupervised image to-image translation via pre-trained style gan2 network
PPTX
Parallel Processing for Digital Image Enhancement
PPTX
DeepFake Teori and Implementation Ho When What Best Practice
PPTX
Advances in Visual Quality Restoration with Generative Adversarial Networks
PPTX
Deferred shading
PPTX
Artist Assistant AI(AAA)
PDF
Efficient Image Processing - Nicolas Roard
PPTX
Final ppt
PDF
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...
PDF
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
PDF
Style gan2 review
PPTX
SIGGRAPH 2018 - PICA PICA and NVIDIA Turing
PDF
PPTX
A Certain Slant of Light - Past, Present and Future Challenges of Global Illu...
PPTX
Past, Present and Future Challenges of Global Illumination in Games
sampling and quantization notes in image process.ppt
Adaptive Spectral Projection
Substanceshanghaippt repacked
Slides: Accelerating Vector Graphics Rendering using the Graphics Hardware Pi...
presentation644v4
Unsupervised image to-image translation via pre-trained style gan2 network
Parallel Processing for Digital Image Enhancement
DeepFake Teori and Implementation Ho When What Best Practice
Advances in Visual Quality Restoration with Generative Adversarial Networks
Deferred shading
Artist Assistant AI(AAA)
Efficient Image Processing - Nicolas Roard
Final ppt
AI+ Remote Sensing: Applying Deep Learning to Image Enhancement, Analytics, a...
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
Style gan2 review
SIGGRAPH 2018 - PICA PICA and NVIDIA Turing
A Certain Slant of Light - Past, Present and Future Challenges of Global Illu...
Past, Present and Future Challenges of Global Illumination in Games
Ad

More from Matthias Trapp (20)

PDF
Interactive Control over Temporal Consistency while Stylizing Video Streams
PDF
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
PDF
A Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
PDF
A Service-based Preset Recommendation System for Image Stylization Applications
PDF
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
PDF
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
PDF
Efficient GitHub Crawling using the GraphQL API
PDF
CodeCV - Mining Expertise of GitHub Users from Coding Activities - Online.pdf
PDF
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
PDF
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
PDF
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
PDF
Web-based and Mobile Provisioning of Virtual 3D Reconstructions
PDF
Visualization of Knowledge Distribution across Development Teams using 2.5D S...
PDF
Real-time Screen-space Geometry Draping for 3D Digital Terrain Models
PDF
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
PDF
Interactive Editing of Signed Distance Fields
PDF
Integration of Image Processing Techniques into the Unity Game Engine
PDF
Interactive GPU-based Image Deformation for Mobile Devices
PDF
Service-based Analysis and Abstraction for Content Moderation of Digital Images
PDF
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...
Interactive Control over Temporal Consistency while Stylizing Video Streams
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
A Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
A Service-based Preset Recommendation System for Image Stylization Applications
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
Efficient GitHub Crawling using the GraphQL API
CodeCV - Mining Expertise of GitHub Users from Coding Activities - Online.pdf
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
Web-based and Mobile Provisioning of Virtual 3D Reconstructions
Visualization of Knowledge Distribution across Development Teams using 2.5D S...
Real-time Screen-space Geometry Draping for 3D Digital Terrain Models
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
Interactive Editing of Signed Distance Fields
Integration of Image Processing Techniques into the Unity Game Engine
Interactive GPU-based Image Deformation for Mobile Devices
Service-based Analysis and Abstraction for Content Moderation of Digital Images
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...
Ad

Recently uploaded (20)

PDF
Five Habits of High-Impact Board Members
PDF
Architecture types and enterprise applications.pdf
PDF
Comparative analysis of machine learning models for fake news detection in so...
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
DOCX
search engine optimization ppt fir known well about this
PPT
Geologic Time for studying geology for geologist
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
A review of recent deep learning applications in wood surface defect identifi...
PPT
What is a Computer? Input Devices /output devices
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
Five Habits of High-Impact Board Members
Architecture types and enterprise applications.pdf
Comparative analysis of machine learning models for fake news detection in so...
The influence of sentiment analysis in enhancing early warning system model f...
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
OpenACC and Open Hackathons Monthly Highlights July 2025
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Credit Without Borders: AI and Financial Inclusion in Bangladesh
Custom Battery Pack Design Considerations for Performance and Safety
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
search engine optimization ppt fir known well about this
Geologic Time for studying geology for geologist
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
Zenith AI: Advanced Artificial Intelligence
Flame analysis and combustion estimation using large language and vision assi...
A review of recent deep learning applications in wood surface defect identifi...
What is a Computer? Input Devices /output devices
Consumable AI The What, Why & How for Small Teams.pdf
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx

Interactive Photo Editing on Smartphones via Intrinsic Decomposition

  • 1. 1
  • 2. 2
  • 3. Recoloring Stylization Retexturing Appearance Editing Meka et al., SIGGRAPH 2016 Bonneel et al., Eurographics STAR, 2017 Beigpour et al., JPI, 2018 3
  • 4. 4 Design Objectives Feasibility • Accessibility – Possible on readily available smartphones. • Speed – All post-capture processing takes at most few seconds. • Interaction – Editing is possible at interactive rates. • Quality – Rendered output looks plausible. • Built-In Depth Sensors • High-End GPU • Graphics API
  • 5. 5 Optimization-based Methods Learning-based Methods -- Mostly are not real-time or interactive -- Resource intensive -- Limited to low image resolutions -- Both class of methods assume only diffuse reflection Bell et al., SIGGRAPH 2014 Input Albedo Lettry et al., Pacific Graphics 2018 Albedo Input
  • 6. 6
  • 7. Diffuse + Depth Albedo Shading Output Depth RGB-D Input Specularity Removal Intrinsic Decomposition Image Editing Specular 7
  • 8. 8 Works quite well for images in a controlled setting. Input Fu et al., 2019 Akashi et al., 2016 Shen et al., 2009 Ground Truth
  • 9. 9 However, does not generalize well for real-world images. Input Fu et al., 2019 Akashi et al., 2016 Shen et al., 2009
  • 10. Intensity (In) Intensity (Out) Chroma (In) Chroma (Out) Input Diffuse Reducing positive coefficients of subbands in the masked region Specular Mask Inpainting missing information by iterative bilateral hole-filling High Freq. Low Freq. Sub-bands 10
  • 11. 11 Diffuse + Depth Albedo Shading Output Depth RGB-D Input Specularity Removal Intrinsic Decomposition Image Editing Specular 11
  • 12. 𝐸 𝑎, 𝑠 = 1 2 𝜆𝑑𝐸𝑑 + 𝜆𝑟𝑎𝐸𝑟𝑎 + 𝜆𝑟𝑠𝐸𝑟𝑠 + 𝜆𝑠𝑝| 𝑎 |1 𝑎𝑟𝑔𝑚𝑖𝑛 𝑎, 𝑠 Data term Retinex-Albedo term Retinex-Shading term 𝐿1- Regularization of Albedo Intrinsic Decomposition 12
  • 13. Intrinsic Decomposition Diffuse + Depth Shading Albedo 13
  • 14. 14
  • 15. 15 Input Ours Fu et al., 2019 Akashi et al., 2016 Shen et al., 2009
  • 16. 16 Input Only Low Freq. Reduced Only High Freq. Reduced Both Freq. Reduced Both Freq. Red. + Chroma Inpaint.
  • 17. 17 Input Only Low Freq. Spec. Removal Only High Freq. Spec. Removal Specularity Enhanced
  • 18. Input 18 Ours (RGB-D) Bell et al., 2014 (RGB) Jeon et al., 2014 (RGB-D) Lettry et al., 2018 (RGB) Our results are comparable to existing techniques.
  • 19. 19 Input Ours (RGB-D) Bell et al., 2014 (RGB) Jeon et al., 2014 (RGB-D) Lettry et al., 2018 (RGB) Existing methods do not perform comparatively well for outside scenes
  • 20. 20 𝐸 𝑎, 𝑠 = 1 2 𝜆𝑟𝑎𝐸𝑟𝑎 + 𝜆𝑑𝐸𝑑 + 𝜆𝑟𝑠𝐸𝑟𝑠 + 𝜆𝑠𝑝| 𝑎 |1 𝑎𝑟𝑔𝑚𝑖𝑛 𝑎, 𝑠 Data term Retinex-Albedo term Retinex-Shading term 𝐿1- Regularization of Albedo 𝐸1 𝐸2 𝐸3 Input Albedo – Only 𝐸1 Albedo – 𝐸1+ 𝐸2 Albedo – 𝐸1+ 𝐸2 + 𝐸3
  • 21. 21
  • 22. Input Diffuse Specular Albedo Shading Diffuse Input Silk Mystique Glass God Rays Toon 22
  • 24. Input Low-density Fog High-density Fog 24
  • 25. Input God Rays God Rays + Fog 25
  • 26. Input Comic Quickshift Halftone 26
  • 27. 27 Input Ground Truth Ours Albedo Edited Albedo initialization with chromaticity leads to color shifts in the edited results. Input Diffuse Our specular pixel identification step is not robust, thus can potentially make the diffuse output darker.
  • 28. 28 • Relax some of the existing assumptions such as -- single-color illuminant and only direct illumination. • Perform super-resolution of depth map thereby enhancing the result quality. • Make the specular mask generation more robust. • A novel specularity removal method, that works for real-world images. • A system for intrinsic decomposition of images on smartphones. • Our GPU-based implementation allows for interactive image editing. • We showcase a variety of applications. Future Work