IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts

Zeng, Bohan; Li, Shanglin; Feng, Yutang; Yang, Ling; Li, Hong; Gao, Sicheng; Liu, Jiaming; He, Conghui; Zhang, Wentao; Liu, Jianzhuang; Zhang, Baochang; Yan, Shuicheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.05375 (cs)

[Submitted on 9 Oct 2023 (v1), last revised 22 Oct 2024 (this version, v6)]

Title:IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts

Authors:Bohan Zeng, Shanglin Li, Yutang Feng, Ling Yang, Hong Li, Sicheng Gao, Jiaming Liu, Conghui He, Wentao Zhang, Jianzhuang Liu, Baochang Zhang, Shuicheng Yan

View PDF HTML (experimental)

Abstract:Recent advances in 3D generation have been remarkable, with methods such as DreamFusion leveraging large-scale text-to-image diffusion-based models to guide 3D object generation. These methods enable the synthesis of detailed and photorealistic textured objects. However, the appearance of 3D objects produced by such text-to-3D models is often unpredictable, and it is hard for single-image-to-3D methods to deal with images lacking a clear subject, complicating the generation of appearance-controllable 3D objects from complex images. To address these challenges, we present IPDreamer, a novel method that captures intricate appearance features from complex $\textbf{I}$mage $\textbf{P}$rompts and aligns the synthesized 3D object with these extracted features, enabling high-fidelity, appearance-controllable 3D object generation. Our experiments demonstrate that IPDreamer consistently generates high-quality 3D objects that align with both the textual and complex image prompts, highlighting its promising capability in appearance-controlled, complex 3D object generation. Our code is available at this https URL.

Comments:	20 pages, 12 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2310.05375 [cs.CV]
	(or arXiv:2310.05375v6 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.05375

Submission history

From: Shanglin Li [view email]
[v1] Mon, 9 Oct 2023 03:11:08 UTC (2,851 KB)
[v2] Mon, 13 Nov 2023 13:14:50 UTC (4,050 KB)
[v3] Wed, 13 Mar 2024 02:56:47 UTC (9,572 KB)
[v4] Thu, 23 May 2024 15:45:48 UTC (11,747 KB)
[v5] Fri, 24 May 2024 09:17:09 UTC (11,747 KB)
[v6] Tue, 22 Oct 2024 09:52:42 UTC (19,322 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators