Learning to Follow Instructions in Text-Based Games

被引:0
|
作者
Tuli, Mathieu [1 ]
Li, Andrew C.
Vaezipoor, Pashootan
Klassen, Toryn Q. [2 ]
Sanner, Scott
McIlraith, Sheila A. [2 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Schwartz Reisman Inst Technol & Soc, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based games present a unique class of sequential decision making problem in which agents interact with a partially observable, simulated environment via actions and observations conveyed through natural language. Such observations typically include instructions that, in a reinforcement learning (RL) setting, can directly or indirectly guide a player towards completing reward-worthy tasks. In this work, we study the ability of RL agents to follow such instructions. We conduct experiments that show that the performance of state-of-the-art text-based game agents is largely unaffected by the presence or absence of such instructions, and that these agents are typically unable to execute tasks to completion. To further study and address the task of instruction following, we equip RL agents with an internal structured representation of natural language instructions in the form of Linear Temporal Logic (LTL), a formal language that is increasingly used for temporally extended reward specification in RL. Our framework both supports and highlights the benefit of understanding the temporal semantics of instructions and in measuring progress towards achievement of such a temporally extended behaviour. Experiments with 500+ games in TextWorld demonstrate the superior performance of our approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Exploration Based Language Learning for Text-Based Games
    Madotto, Andrea
    Namazifar, Mahdi
    Huizinga, Joost
    Molino, Piero
    Ecoffet, Adrien
    Zheng, Huaixiu
    Yu, Dian
    Papangelis, Alexandros
    Khatri, Chandra
    Tur, Gokhan
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1488 - 1494
  • [2] Learning Dynamic Belief Graphs to Generalize on Text-Based Games
    Adhikari, Ashutosh
    Yuan, Xingdi
    Cote, Marc-Alexandre
    Zelinka, Mikulas
    Rondeau, Marc-Antoine
    Laroche, Romain
    Poupart, Pascal
    Tang, Jian
    Trischler, Adam
    Hamilton, William L.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] Generalization in Text-based Games via Hierarchical Reinforcement Learning
    Xu, Yunqiu
    Fang, Meng
    Chen, Ling
    Du, Yali
    Zhang, Chengqi
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1343 - 1353
  • [4] Learning to Play Text-Based Adventure Games with Maximum Entropy Reinforcement Learning
    Li, Weichen
    Devidze, Rati
    Fellenz, Sophie
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 39 - 54
  • [5] Controllable Video Generation With Text-Based Instructions
    Koksal, Ali
    Ak, Kenan E.
    Sun, Ying
    Rajan, Deepu
    Lim, Joo Hwee
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 190 - 201
  • [6] LeDeepChef Deep Reinforcement Learning Agent for Families of Text-Based Games
    Adolphs, Leonard
    Hofmann, Thomas
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7342 - 7349
  • [7] Self-imitation Learning for Action Generation in Text-based Games
    Shi, Zijing
    Xu, Yunqiu
    Fang, Meng
    Chen, Ling
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 703 - 726
  • [8] A Framework for Games-Based Construction Learning: A Text-Based Programming Languages Approach
    Franca Batista, Andre Luiz
    Connolly, Thomas
    Peres Angotti, Jose Andre
    [J]. PROCEEDINGS OF THE 10TH EUROPEAN CONFERENCE ON GAMES BASED LEARNING, 2016, : 815 - 823
  • [9] A Text-Based Sonification System for Basketball Games
    Yu, Minjing
    Li, Junyi
    Cai, Mingxu
    Pang, Delong
    Zhang, Lianghao
    Zhang, Jiawan
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (07): : 1087 - 1095
  • [10] Perceiving the World: Question-guided Reinforcement Learning for Text-based Games
    Xu, Yunqiu
    Fang, Meng
    Chen, Ling
    Du, Yali
    Zhou, Joey Tianyi
    Zhang, Chengqi
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 538 - 560