Learning to Follow Instructions in Text-Based Games

被引:0
|
作者
Tuli, Mathieu [1 ]
Li, Andrew C.
Vaezipoor, Pashootan
Klassen, Toryn Q. [2 ]
Sanner, Scott
McIlraith, Sheila A. [2 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Schwartz Reisman Inst Technol & Soc, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based games present a unique class of sequential decision making problem in which agents interact with a partially observable, simulated environment via actions and observations conveyed through natural language. Such observations typically include instructions that, in a reinforcement learning (RL) setting, can directly or indirectly guide a player towards completing reward-worthy tasks. In this work, we study the ability of RL agents to follow such instructions. We conduct experiments that show that the performance of state-of-the-art text-based game agents is largely unaffected by the presence or absence of such instructions, and that these agents are typically unable to execute tasks to completion. To further study and address the task of instruction following, we equip RL agents with an internal structured representation of natural language instructions in the form of Linear Temporal Logic (LTL), a formal language that is increasingly used for temporally extended reward specification in RL. Our framework both supports and highlights the benefit of understanding the temporal semantics of instructions and in measuring progress towards achievement of such a temporally extended behaviour. Experiments with 500+ games in TextWorld demonstrate the superior performance of our approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Text-Based Emotion Recognition Using Deep Learning Approach
    Bharti, Santosh Kumar
    Varadhaganapathy, S.
    Gupta, Rajeev Kumar
    Shukla, Prashant Kumar
    Bouye, Mohamed
    Hingaa, Simon Karanja
    Mahmoud, Amena
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [32] Neuro-Symbolic Approaches for Text-Based Policy Learning
    Chaudhury, Subhajit
    Sen, Prithviraj
    Ono, Masaki
    Kimura, Daiki
    Tatsubori, Michiaki
    Munawar, Asim
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3073 - 3078
  • [33] Stock Market Prediction using Text-based Machine Learning
    Jordan, Tristan
    Elgazzar, Heba
    [J]. 2020 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS 2020), 2020, : 322 - 326
  • [34] Text-based Person Search via Virtual Attribute Learning
    Wang C.-J.
    Su J.-W.
    Luo Z.-M.
    Cao D.-L.
    Lin Y.-J.
    Li S.-Z.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2035 - 2050
  • [35] Machine learning in bank merger prediction: A text-based approach 
    Katsafados, Apostolos G.
    Leledakis, George N.
    Pyrgiotakis, Emmanouil G.
    Androutsopoulos, Ion
    Fergadiotis, Manos
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 312 (02) : 783 - 797
  • [36] Selective Learning Confusion Class for Text-Based CAPTCHA Recognition
    Chen, Jun
    Luo, Xiangyang
    Liu, Yingying
    Wang, Jinwei
    Ma, Yuanyuan
    [J]. IEEE ACCESS, 2019, 7 : 22246 - 22259
  • [37] Text-based interfaces and text-based bibliographic enhancements: Thinking beyond standard bibliographic information (and text)
    Wall, TB
    [J]. PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1996, 33 : 278 - 278
  • [38] Student Blogging: Implications for Learning in a Virtual Text-Based Environment
    Deed, Craig
    Edwards, Anthony
    [J]. TECHNOLOGY ENHANCED LEARNING: QUALITY OF TEACHING AND EDUCATIONAL REFORM, 2010, 73 : 18 - +
  • [39] A deep learning model for recognition of complex Text-based CAPTCHAs
    Arain, Rafaqat Hussain
    Shaikh, Riaz Ahmed
    Maitlo, Abdullah
    Kumar, Kamlesh
    Shah, Syed Safdar Ali
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (02): : 103 - 107
  • [40] Text-Based Interactive Recommendation via Offline Reinforcement Learning
    Zhang, Ruiyi
    Yu, Tong
    Shen, Yilin
    Jin, Hongxia
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11694 - 11702