TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Deutch, Gilad; Gal, Rinon; Garibi, Daniel; Patashnik, Or; Cohen-Or, Daniel

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.00735 (cs)

[Submitted on 1 Aug 2024]

Title:TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Authors:Gilad Deutch, Rinon Gal, Daniel Garibi, Or Patashnik, Daniel Cohen-Or

View PDF HTML (experimental)

Abstract:Diffusion models have opened the path to a wide range of text-based image editing frameworks. However, these typically build on the multi-step nature of the diffusion backwards process, and adapting them to distilled, fast-sampling methods has proven surprisingly challenging. Here, we focus on a popular line of text-based editing frameworks - the ``edit-friendly'' DDPM-noise inversion approach. We analyze its application to fast sampling methods and categorize its failures into two classes: the appearance of visual artifacts, and insufficient editing strength. We trace the artifacts to mismatched noise statistics between inverted noises and the expected noise schedule, and suggest a shifted noise schedule which corrects for this offset. To increase editing strength, we propose a pseudo-guidance approach that efficiently increases the magnitude of edits without introducing new artifacts. All in all, our method enables text-based image editing with as few as three diffusion steps, while providing novel insights into the mechanisms behind popular text-based editing approaches.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2408.00735 [cs.CV]
	(or arXiv:2408.00735v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.00735

Submission history

From: Rinon Gal [view email]
[v1] Thu, 1 Aug 2024 17:27:28 UTC (9,194 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators