Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

被引:0
|
作者
Carta, Thomas [1 ]
Romac, Clement [1 ,2 ]
Wolf, Thomas [2 ]
Lamprier, Sylvain [3 ]
Sigaud, Olivier [4 ]
Oudeyer, Pierre-Yves [1 ]
机构
[1] Univ Bordeaux, Inria Flowers, Bordeaux, France
[2] Hugging Face, Paris, France
[3] Univ Angers, LERIA, SFR MATHSTIC, F-49000 Angers, France
[4] Sorbonne Univ, ISIR, Paris, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works successfully leveraged Large Language Models' (LLM) abilities to capture abstract knowledge about world's physics to solve decision-making problems. Yet, the alignment between LLMs' knowledge and the environment can be wrong and limit functional competence due to lack of grounding. In this paper, we study an approach (named GLAM) to achieve this alignment through functional grounding: we consider an agent using an LLM as a policy that is progressively updated as the agent interacts with the environment, leveraging online Reinforcement Learning to improve its performance to solve goals. Using an interactive textual environment designed to study higher-level forms of functional grounding, and a set of spatial and navigation tasks, we study several scientific questions: 1) Can LLMs boost sample efficiency for online learning of various RL tasks? 2) How can it boost different forms of generalization? 3) What is the impact of online learning? We study these questions by functionally grounding several variants (size, architecture) of FLAN-T5.
引用
收藏
页数:38
相关论文
共 50 条
  • [41] InteraRec: Interactive Recommendations Using Multimodal Large Language Models
    Karra, Saketh Reddy
    Tulabandhula, Theja
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA, 2024, 14658 : 32 - 43
  • [42] Developing an Interactive OpenMP Programming Book with Large Language Models
    Yi, Xinyao
    Wang, Anjia
    Yan, Yonghong
    Liao, Chunhua
    ADVANCING OPENMP FOR FUTURE ACCELERATORS, IWOMP 2024, 2024, 15195 : 176 - 194
  • [43] Recent Advances in Interactive Machine Translation With Large Language Models
    Wang, Yanshu
    Zhang, Jinyi
    Shi, Tianrong
    Deng, Dashuai
    Tian, Ye
    Matsumoto, Tadahiro
    IEEE ACCESS, 2024, 12 : 179353 - 179382
  • [44] Interactive learning environments?
    Greener, Sue
    INTERACTIVE LEARNING ENVIRONMENTS, 2012, 20 (02) : 101 - 102
  • [45] De novo drug design as GPT language modeling: large chemistry models with supervised and reinforcement learning
    Ye, Gavin
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2024, 38 (01)
  • [46] Industrial Internet of Things With Large Language Models (LLMs): An Intelligence-Based Reinforcement Learning Approach
    Ren, Yuzheng
    Zhang, Haijun
    Yu, Fei Richard
    Li, Wei
    Zhao, Pincan
    He, Ying
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) : 4136 - 4152
  • [47] ePUZSOLVED: Learning communication models through interactive online puzzles
    Bangero, Hyacinth Balediata
    COMMUNICATION TEACHER, 2024, 38 (02) : 95 - 101
  • [48] Automated speech therapy through personalized pronunciation correction using reinforcement learning and large language models
    Lakshminarayanan, Ritika
    Shaik, Ayesha
    Balasundaram, Ananthakrishnan
    RESULTS IN ENGINEERING, 2025, 25
  • [49] Aligning large language models with radiologists by reinforcement learning from AI feedback for chest CT reports
    Yang, Lingrui
    Zhou, Yuxing
    Qi, Jun
    Zhen, Xiantong
    Sun, Li
    Shi, Shan
    Su, Qinghua
    Yang, Xuedong
    EUROPEAN JOURNAL OF RADIOLOGY, 2025, 184
  • [50] A working model for intercultural learning and engagement in collaborative online language learning environments
    Lawrence, Geoff
    INTERCULTURAL EDUCATION, 2013, 24 (04) : 303 - 314