Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction

被引:0
|
作者
Jain, Vishal [1 ,3 ]
Fedus, William [1 ,2 ]
Larochelle, Hugo [1 ,2 ,5 ]
Precup, Doina [1 ,3 ,4 ,5 ]
Bellemare, Marc G. [1 ,2 ,3 ,5 ]
机构
[1] Mila, Montreal, PQ, Canada
[2] Google Brain, Mountain View, CA USA
[3] McGill Univ, Montreal, PQ, Canada
[4] DeepMind, London, England
[5] CIFAR, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based games are a natural challenge domain for deep reinforcement learning algorithms. Their state and action spaces are combinatorially large, their reward function is sparse. and they are partially observable: the agent is informed of the consequences of its actions through textual feedback. In this paper we emphasize this latter point and consider the design of a deep reinforcement learning agent that can play from feedback alone. Our design recognizes and takes advantage of the structural characteristics of text-based games. We first propose a contextualisation mechanism, based on accumulated reward, which simplifies the learning problem and mitigates partial observability. We then study different methods that rely on the notion that most actions are ineffectual in any given situation, following Zahavy et al.'s idea of an admissible action. We evaluate these techniques in a series of text-based games of increasing difficulty based on the TextWorld framework, as well as the iconic game ZORK. Empirically, we find that these techniques improve the performance of a baseline deep reinforcement learning agent applied to text-based games.
引用
收藏
页码:4328 / 4336
页数:9
相关论文
共 50 条
  • [31] Session-based Interactive Recommendation via Deep Reinforcement Learning
    Shi, Longxiang
    Zhang, Zilin
    Wang, Shoujin
    Zhang, Qi
    Wu, Minghui
    Yang, Cheng
    Li, Shijian
    [J]. 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1319 - 1324
  • [32] Learning to Engage with Interactive Systems: A Field Study on Deep Reinforcement Learning in a Public Museum
    Meng, Lingheng
    Lin, Daiwei
    Francey, Adam
    Gorbet, Rob
    Beesley, Philip
    Kulic, Dana
    [J]. ACM TRANSACTIONS ON HUMAN-ROBOT INTERACTION, 2020, 10 (01)
  • [33] A Mean-VaR Based Deep Reinforcement Learning Framework for Practical Algorithmic Trading
    Jin, Boyi
    [J]. IEEE ACCESS, 2023, 11 : 28920 - 28933
  • [34] Optimal Action Space Search: an Effective Deep Reinforcement Learning Method for Algorithmic Trading
    Duan, Zhongjie
    Chen, Cen
    Cheng, Dawei
    Liang, Yuqi
    Qian, Weining
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 406 - 415
  • [35] A multi-agent deep reinforcement learning framework for algorithmic trading in financial markets
    Shavandi, Ali
    Khedmati, Majid
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
  • [36] A novel deep reinforcement learning framework with BiLSTM-Attention networks for algorithmic trading
    Huang, Yuling
    Wan, Xiaoxiao
    Zhang, Lin
    Lu, Xiaoping
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240
  • [37] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    [J]. INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
  • [38] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    [J]. Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [39] Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application
    Gomes, Natanael Magno
    Martins, Felipe N.
    Lima, Jose
    Wortche, Heinrich
    [J]. OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2021, 2021, 1488 : 251 - 265
  • [40] Deep reinforcement learning applied to statistical arbitrage investment strategy on cryptomarket
    Vergara, Gabriel
    Kristjanpoller, Werner
    [J]. APPLIED SOFT COMPUTING, 2024, 153