Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Lee, Junsung; Kang, Minsoo; Han, Bohyung

Computer Science > Computer Vision and Pattern Recognition

arXiv:2409.08077 (cs)

[Submitted on 12 Sep 2024]

Title:Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Authors:Junsung Lee, Minsoo Kang, Bohyung Han

View PDF HTML (experimental)

Abstract:We propose a simple but effective training-free approach tailored to diffusion-based image-to-image translation. Our approach revises the original noise prediction network of a pretrained diffusion model by introducing a noise correction term. We formulate the noise correction term as the difference between two noise predictions; one is computed from the denoising network with a progressive interpolation of the source and target prompt embeddings, while the other is the noise prediction with the source prompt embedding. The final noise prediction network is given by a linear combination of the standard denoising term and the noise correction term, where the former is designed to reconstruct must-be-preserved regions while the latter aims to effectively edit regions of interest relevant to the target prompt. Our approach can be easily incorporated into existing image-to-image translation methods based on diffusion models. Extensive experiments verify that the proposed technique achieves outstanding performance with low latency and consistently improves existing frameworks when combined with them.

Comments:	16 pages, 5 figures, 6 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2409.08077 [cs.CV]
	(or arXiv:2409.08077v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2409.08077

Submission history

From: Junsung Lee [view email]
[v1] Thu, 12 Sep 2024 14:30:45 UTC (11,124 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators