LogoSticker: Inserting Logos into Diffusion Models for Customized Generation

Zhu, Mingkang; Chen, Xi; Wang, Zhongdao; Zhao, Hengshuang; Jia, Jiaya

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.13752 (cs)

[Submitted on 18 Jul 2024]

Title:LogoSticker: Inserting Logos into Diffusion Models for Customized Generation

Authors:Mingkang Zhu, Xi Chen, Zhongdao Wang, Hengshuang Zhao, Jiaya Jia

View PDF HTML (experimental)

Abstract:Recent advances in text-to-image model customization have underscored the importance of integrating new concepts with a few examples. Yet, these progresses are largely confined to widely recognized subjects, which can be learned with relative ease through models' adequate shared prior knowledge. In contrast, logos, characterized by unique patterns and textual elements, are hard to establish shared knowledge within diffusion models, thus presenting a unique challenge. To bridge this gap, we introduce the task of logo insertion. Our goal is to insert logo identities into diffusion models and enable their seamless synthesis in varied contexts. We present a novel two-phase pipeline LogoSticker to tackle this task. First, we propose the actor-critic relation pre-training algorithm, which addresses the nontrivial gaps in models' understanding of the potential spatial positioning of logos and interactions with other objects. Second, we propose a decoupled identity learning algorithm, which enables precise localization and identity extraction of logos. LogoSticker can generate logos accurately and harmoniously in diverse contexts. We comprehensively validate the effectiveness of LogoSticker over customization methods and large models such as DALLE~3. \href{this https URL}{Project page}.

Comments:	ECCV2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.13752 [cs.CV]
	(or arXiv:2407.13752v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.13752

Submission history

From: Mingkang Zhu [view email]
[v1] Thu, 18 Jul 2024 17:54:49 UTC (9,419 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LogoSticker: Inserting Logos into Diffusion Models for Customized Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LogoSticker: Inserting Logos into Diffusion Models for Customized Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators