CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

Tevet, Guy; Raab, Sigal; Cohan, Setareh; Reda, Daniele; Luo, Zhengyi; Peng, Xue Bin; Bermano, Amit H.; van de Panne, Michiel

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.03441 (cs)

[Submitted on 4 Oct 2024]

Title:CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

Authors:Guy Tevet, Sigal Raab, Setareh Cohan, Daniele Reda, Zhengyi Luo, Xue Bin Peng, Amit H. Bermano, Michiel van de Panne

View PDF HTML (experimental)

Abstract:Motion diffusion models and Reinforcement Learning (RL) based control for physics-based simulations have complementary strengths for human motion generation. The former is capable of generating a wide variety of motions, adhering to intuitive control such as text, while the latter offers physically plausible motion and direct interaction with the environment. In this work, we present a method that combines their respective strengths. CLoSD is a text-driven RL physics-based controller, guided by diffusion generation for various tasks. Our key insight is that motion diffusion can serve as an on-the-fly universal planner for a robust RL controller. To this end, CLoSD maintains a closed-loop interaction between two modules -- a Diffusion Planner (DiP), and a tracking controller. DiP is a fast-responding autoregressive diffusion model, controlled by textual prompts and target locations, and the controller is a simple and robust motion imitator that continuously receives motion plans from DiP and provides feedback from the environment. CLoSD is capable of seamlessly performing a sequence of different tasks, including navigation to a goal location, striking an object with a hand or foot as specified in a text prompt, sitting down, and getting up. this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.03441 [cs.CV]
	(or arXiv:2410.03441v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.03441

Submission history

From: Guy Tevet [view email]
[v1] Fri, 4 Oct 2024 13:56:48 UTC (987 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators