Guiding Pretraining in Reinforcement Learning with Large Language Models

Du, Yuqing; Watkins, Olivia; Wang, Zihan; Colas, Cédric; Darrell, Trevor; Abbeel, Pieter; Gupta, Abhishek; Andreas, Jacob

Computer Science > Machine Learning

arXiv:2302.06692 (cs)

[Submitted on 13 Feb 2023 (v1), last revised 15 Sep 2023 (this version, v2)]

Title:Guiding Pretraining in Reinforcement Learning with Large Language Models

Authors:Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas

View PDF

Abstract:Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped reward function. Intrinsically motivated exploration methods address this limitation by rewarding agents for visiting novel states or transitions, but these methods offer limited benefits in large environments where most discovered novelty is irrelevant for downstream tasks. We describe a method that uses background knowledge from text corpora to shape exploration. This method, called ELLM (Exploring with LLMs) rewards an agent for achieving goals suggested by a language model prompted with a description of the agent's current state. By leveraging large-scale language model pretraining, ELLM guides agents toward human-meaningful and plausibly useful behaviors without requiring a human in the loop. We evaluate ELLM in the Crafter game environment and the Housekeep robotic simulator, showing that ELLM-trained agents have better coverage of common-sense behaviors during pretraining and usually match or improve performance on a range of downstream tasks. Code available at this https URL.

Comments:	ICML 2023
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2302.06692 [cs.LG]
	(or arXiv:2302.06692v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.06692

Submission history

From: Yuqing Du [view email]
[v1] Mon, 13 Feb 2023 21:16:03 UTC (6,111 KB)
[v2] Fri, 15 Sep 2023 02:42:40 UTC (6,919 KB)

Computer Science > Machine Learning

Title:Guiding Pretraining in Reinforcement Learning with Large Language Models

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Guiding Pretraining in Reinforcement Learning with Large Language Models

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators