Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models

Jiang, Rui; Fu, Xinghe; Zheng, Guangcong; Li, Teng; Yao, Taiping; Li, Xi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.04215 (cs)

[Submitted on 6 Mar 2025]

Title:Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models

Authors:Rui Jiang, Xinghe Fu, Guangcong Zheng, Teng Li, Taiping Yao, Xi Li

View PDF HTML (experimental)

Abstract:The rapid advancement of pretrained text-driven diffusion models has significantly enriched applications in image generation and editing. However, as the demand for personalized content editing increases, new challenges emerge especially when dealing with arbitrary objects and complex scenes. Existing methods usually mistakes mask as the object shape prior, which struggle to achieve a seamless integration result. The mostly used inversion noise initialization also hinders the identity consistency towards the target object. To address these challenges, we propose a novel training-free framework that formulates personalized content editing as the optimization of edited images in the latent space, using diffusion models as the energy function guidance conditioned by reference text-image pairs. A coarse-to-fine strategy is proposed that employs text energy guidance at the early stage to achieve a natural transition toward the target class and uses point-to-point feature-level image energy guidance to perform fine-grained appearance alignment with the target object. Additionally, we introduce the latent space content composition to enhance overall identity consistency with the target. Extensive experiments demonstrate that our method excels in object replacement even with a large domain gap, highlighting its potential for high-quality, personalized image editing.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.04215 [cs.CV]
	(or arXiv:2503.04215v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.04215

Submission history

From: Xi Li [view email]
[v1] Thu, 6 Mar 2025 08:52:29 UTC (22,241 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators