Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction

被引：0

作者：

Jain, Vishal ^{[1
,3
]}

Fedus, William ^{[1
,2
]}

Larochelle, Hugo ^{[1
,2
,5
]}

Precup, Doina ^{[1
,3
,4
,5
]}

Bellemare, Marc G. ^{[1
,2
,3
,5
]}

机构：

[1] Mila, Montreal, PQ, Canada

[2] Google Brain, Mountain View, CA USA

[3] McGill Univ, Montreal, PQ, Canada

[4] DeepMind, London, England

[5] CIFAR, Toronto, ON, Canada

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-based games are a natural challenge domain for deep reinforcement learning algorithms. Their state and action spaces are combinatorially large, their reward function is sparse. and they are partially observable: the agent is informed of the consequences of its actions through textual feedback. In this paper we emphasize this latter point and consider the design of a deep reinforcement learning agent that can play from feedback alone. Our design recognizes and takes advantage of the structural characteristics of text-based games. We first propose a contextualisation mechanism, based on accumulated reward, which simplifies the learning problem and mitigates partial observability. We then study different methods that rely on the notion that most actions are ineffectual in any given situation, following Zahavy et al.'s idea of an admissible action. We evaluate these techniques in a series of text-based games of increasing difficulty based on the TextWorld framework, as well as the iconic game ZORK. Empirically, we find that these techniques improve the performance of a baseline deep reinforcement learning agent applied to text-based games.

引用

页码：4328 / 4336

页数：9

共 50 条

[1] An application of deep reinforcement learning to algorithmic trading
Theate, Thibaut
Ernst, Damien
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
[2] Rainbow: Combining Improvements in Deep Reinforcement Learning
Hessel, Matteo
Modayil, Joseph
van Hasselt, Hado
Schaul, Tom
Ostrovski, Georg
Dabney, Will
Horgan, Dan
Piot, Bilal
Azar, Mohammad
Silver, David
[J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3215 - 3222
[3] Overview of Deep Reinforcement Learning Improvements and Applications
Zhang, Junjie
Zhang, Cong
Chien, Wei-Che
[J]. JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (02): : 239 - 255
[4] Deep Robust Reinforcement Learning for Practical Algorithmic Trading
Li, Yang
Zheng, Wanshan
Zheng, Zibin
[J]. IEEE ACCESS, 2019, 7 : 108014 - 108022
[5] Interactive Narrative Personalization with Deep Reinforcement Learning
Wang, Pengcheng
Rowe, Jonathan
Min, Wookhee
Mott, Bradford
Lester, James
[J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3852 - 3858
[6] Sentiment and Knowledge Based Algorithmic Trading with Deep Reinforcement Learning
Nan, Abhishek
Perumal, Anandh
Zaiane, Osmar R.
[J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT I, 2022, 13426 : 167 - 180
[7] Algorithmic fairness and bias mitigation for clinical machine learning with deep reinforcement learning
Jenny Yang
Andrew A. S. Soltan
David W. Eyre
David A. Clifton
[J]. Nature Machine Intelligence, 2023, 5 : 884 - 894
[8] Algorithmic fairness and bias mitigation for clinical machine learning with deep reinforcement learning
Yang, Jenny
Soltan, Andrew A. S.
Eyre, David W.
Clifton, David A.
[J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (08) : 884 - +
[9] An Approach to Interactive Deep Reinforcement Learning for Serious Games
Dobrovsky, Aline
Borghoff, Uwe M.
Hofmann, Marko
[J]. 2016 7TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2016, : 85 - 90
[10] Interactive Spoken Content Retrieval by Deep Reinforcement Learning
Wu, Yen-Chen
Lin, Tzu-Hsiang
Chen, Yang-De
Lee, Hung-Yi
Lee, Lin-Shan
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 943 - 947

← 1 2 3 4 5 →