Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction

被引:0
|
作者
Jain, Vishal [1 ,3 ]
Fedus, William [1 ,2 ]
Larochelle, Hugo [1 ,2 ,5 ]
Precup, Doina [1 ,3 ,4 ,5 ]
Bellemare, Marc G. [1 ,2 ,3 ,5 ]
机构
[1] Mila, Montreal, PQ, Canada
[2] Google Brain, Mountain View, CA USA
[3] McGill Univ, Montreal, PQ, Canada
[4] DeepMind, London, England
[5] CIFAR, Toronto, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-based games are a natural challenge domain for deep reinforcement learning algorithms. Their state and action spaces are combinatorially large, their reward function is sparse. and they are partially observable: the agent is informed of the consequences of its actions through textual feedback. In this paper we emphasize this latter point and consider the design of a deep reinforcement learning agent that can play from feedback alone. Our design recognizes and takes advantage of the structural characteristics of text-based games. We first propose a contextualisation mechanism, based on accumulated reward, which simplifies the learning problem and mitigates partial observability. We then study different methods that rely on the notion that most actions are ineffectual in any given situation, following Zahavy et al.'s idea of an admissible action. We evaluate these techniques in a series of text-based games of increasing difficulty based on the TextWorld framework, as well as the iconic game ZORK. Empirically, we find that these techniques improve the performance of a baseline deep reinforcement learning agent applied to text-based games.
引用
收藏
页码:4328 / 4336
页数:9
相关论文
共 50 条
  • [1] An application of deep reinforcement learning to algorithmic trading
    Theate, Thibaut
    Ernst, Damien
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
  • [2] Rainbow: Combining Improvements in Deep Reinforcement Learning
    Hessel, Matteo
    Modayil, Joseph
    van Hasselt, Hado
    Schaul, Tom
    Ostrovski, Georg
    Dabney, Will
    Horgan, Dan
    Piot, Bilal
    Azar, Mohammad
    Silver, David
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3215 - 3222
  • [3] Overview of Deep Reinforcement Learning Improvements and Applications
    Zhang, Junjie
    Zhang, Cong
    Chien, Wei-Che
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (02): : 239 - 255
  • [4] Deep Robust Reinforcement Learning for Practical Algorithmic Trading
    Li, Yang
    Zheng, Wanshan
    Zheng, Zibin
    [J]. IEEE ACCESS, 2019, 7 : 108014 - 108022
  • [5] Interactive Narrative Personalization with Deep Reinforcement Learning
    Wang, Pengcheng
    Rowe, Jonathan
    Min, Wookhee
    Mott, Bradford
    Lester, James
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3852 - 3858
  • [6] Sentiment and Knowledge Based Algorithmic Trading with Deep Reinforcement Learning
    Nan, Abhishek
    Perumal, Anandh
    Zaiane, Osmar R.
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT I, 2022, 13426 : 167 - 180
  • [7] Algorithmic fairness and bias mitigation for clinical machine learning with deep reinforcement learning
    Jenny Yang
    Andrew A. S. Soltan
    David W. Eyre
    David A. Clifton
    [J]. Nature Machine Intelligence, 2023, 5 : 884 - 894
  • [8] Algorithmic fairness and bias mitigation for clinical machine learning with deep reinforcement learning
    Yang, Jenny
    Soltan, Andrew A. S.
    Eyre, David W.
    Clifton, David A.
    [J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (08) : 884 - +
  • [9] An Approach to Interactive Deep Reinforcement Learning for Serious Games
    Dobrovsky, Aline
    Borghoff, Uwe M.
    Hofmann, Marko
    [J]. 2016 7TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2016, : 85 - 90
  • [10] Interactive Spoken Content Retrieval by Deep Reinforcement Learning
    Wu, Yen-Chen
    Lin, Tzu-Hsiang
    Chen, Yang-De
    Lee, Hung-Yi
    Lee, Lin-Shan
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 943 - 947