Learning to Follow Instructions in Text-Based Games

被引：0

作者：

Tuli, Mathieu ^{[1
]}

Li, Andrew C.

Vaezipoor, Pashootan

Klassen, Toryn Q. ^{[2
]}

Sanner, Scott

McIlraith, Sheila A. ^{[2
]}

机构：

[1] Univ Toronto, Toronto, ON, Canada

[2] Schwartz Reisman Inst Technol & Soc, Toronto, ON, Canada

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022 | 2022年

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-based games present a unique class of sequential decision making problem in which agents interact with a partially observable, simulated environment via actions and observations conveyed through natural language. Such observations typically include instructions that, in a reinforcement learning (RL) setting, can directly or indirectly guide a player towards completing reward-worthy tasks. In this work, we study the ability of RL agents to follow such instructions. We conduct experiments that show that the performance of state-of-the-art text-based game agents is largely unaffected by the presence or absence of such instructions, and that these agents are typically unable to execute tasks to completion. To further study and address the task of instruction following, we equip RL agents with an internal structured representation of natural language instructions in the form of Linear Temporal Logic (LTL), a formal language that is increasingly used for temporally extended reward specification in RL. Our framework both supports and highlights the benefit of understanding the temporal semantics of instructions and in measuring progress towards achievement of such a temporally extended behaviour. Experiments with 500+ games in TextWorld demonstrate the superior performance of our approach.

引用

页数：15

共 50 条

[1] Exploration Based Language Learning for Text-Based Games
Madotto, Andrea
Namazifar, Mahdi
Huizinga, Joost
Molino, Piero
Ecoffet, Adrien
Zheng, Huaixiu
Yu, Dian
Papangelis, Alexandros
Khatri, Chandra
Tur, Gokhan
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1488 - 1494
[2] Learning Dynamic Belief Graphs to Generalize on Text-Based Games
Adhikari, Ashutosh
Yuan, Xingdi
Cote, Marc-Alexandre
Zelinka, Mikulas
Rondeau, Marc-Antoine
Laroche, Romain
Poupart, Pascal
Tang, Jian
Trischler, Adam
Hamilton, William L.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[3] Generalization in Text-based Games via Hierarchical Reinforcement Learning
Xu, Yunqiu
Fang, Meng
Chen, Ling
Du, Yali
Zhang, Chengqi
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1343 - 1353
[4] Learning to Play Text-Based Adventure Games with Maximum Entropy Reinforcement Learning
Li, Weichen
Devidze, Rati
Fellenz, Sophie
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 39 - 54
[5] Controllable Video Generation With Text-Based Instructions
Koksal, Ali
Ak, Kenan E.
Sun, Ying
Rajan, Deepu
Lim, Joo Hwee
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 190 - 201
[6] LeDeepChef Deep Reinforcement Learning Agent for Families of Text-Based Games
Adolphs, Leonard
Hofmann, Thomas
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7342 - 7349
[7] Self-imitation Learning for Action Generation in Text-based Games
Shi, Zijing
Xu, Yunqiu
Fang, Meng
Chen, Ling
[J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 703 - 726
[8] A Framework for Games-Based Construction Learning: A Text-Based Programming Languages Approach
Franca Batista, Andre Luiz
Connolly, Thomas
Peres Angotti, Jose Andre
[J]. PROCEEDINGS OF THE 10TH EUROPEAN CONFERENCE ON GAMES BASED LEARNING, 2016, : 815 - 823
[9] A Text-Based Sonification System for Basketball Games
Yu, Minjing
Li, Junyi
Cai, Mingxu
Pang, Delong
Zhang, Lianghao
Zhang, Jiawan
[J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (07): : 1087 - 1095
[10] Perceiving the World: Question-guided Reinforcement Learning for Text-based Games
Xu, Yunqiu
Fang, Meng
Chen, Ling
Du, Yali
Zhou, Joey Tianyi
Zhang, Chengqi
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 538 - 560

← 1 2 3 4 5 →