SceneGenAgent: Precise Industrial Scene Generation with Coding Agent

Xia, Xiao; Zhang, Dan; Liao, Zibo; Hou, Zhenyu; Sun, Tianrui; Li, Jing; Fu, Ling; Dong, Yuxiao

Computer Science > Computation and Language

arXiv:2410.21909 (cs)

[Submitted on 29 Oct 2024 (v1), last revised 15 May 2025 (this version, v2)]

Title:SceneGenAgent: Precise Industrial Scene Generation with Coding Agent

Authors:Xiao Xia, Dan Zhang, Zibo Liao, Zhenyu Hou, Tianrui Sun, Jing Li, Ling Fu, Yuxiao Dong

View PDF

Abstract:The modeling of industrial scenes is essential for simulations in industrial manufacturing. While large language models (LLMs) have shown significant progress in generating general 3D scenes from textual descriptions, generating industrial scenes with LLMs poses a unique challenge due to their demand for precise measurements and positioning, requiring complex planning over spatial arrangement. To address this challenge, we introduce SceneGenAgent, an LLM-based agent for generating industrial scenes through C# code. SceneGenAgent ensures precise layout planning through a structured and calculable format, layout verification, and iterative refinement to meet the quantitative requirements of industrial scenarios. Experiment results demonstrate that LLMs powered by SceneGenAgent exceed their original performance, reaching up to 81.0% success rate in real-world industrial scene generation tasks and effectively meeting most scene generation requirements. To further enhance accessibility, we construct SceneInstruct, a dataset designed for fine-tuning open-source LLMs to integrate into SceneGenAgent. Experiments show that fine-tuning open-source LLMs on SceneInstruct yields significant performance improvements, with Llama3.1-70B approaching the capabilities of GPT-4o. Our code and data are available at this https URL .

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
Cite as:	arXiv:2410.21909 [cs.CL]
	(or arXiv:2410.21909v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.21909

Submission history

From: Xiao Xia [view email]
[v1] Tue, 29 Oct 2024 10:01:40 UTC (1,750 KB)
[v2] Thu, 15 May 2025 16:40:39 UTC (1,750 KB)

Computer Science > Computation and Language

Title:SceneGenAgent: Precise Industrial Scene Generation with Coding Agent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SceneGenAgent: Precise Industrial Scene Generation with Coding Agent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators