Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction

被引：0

作者：

Jain, Vishal ^{[1
,3
]}

Fedus, William ^{[1
,2
]}

Larochelle, Hugo ^{[1
,2
,5
]}

Precup, Doina ^{[1
,3
,4
,5
]}

Bellemare, Marc G. ^{[1
,2
,3
,5
]}

机构：

[1] Mila, Montreal, PQ, Canada

[2] Google Brain, Mountain View, CA USA

[3] McGill Univ, Montreal, PQ, Canada

[4] DeepMind, London, England

[5] CIFAR, Toronto, ON, Canada

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-based games are a natural challenge domain for deep reinforcement learning algorithms. Their state and action spaces are combinatorially large, their reward function is sparse. and they are partially observable: the agent is informed of the consequences of its actions through textual feedback. In this paper we emphasize this latter point and consider the design of a deep reinforcement learning agent that can play from feedback alone. Our design recognizes and takes advantage of the structural characteristics of text-based games. We first propose a contextualisation mechanism, based on accumulated reward, which simplifies the learning problem and mitigates partial observability. We then study different methods that rely on the notion that most actions are ineffectual in any given situation, following Zahavy et al.'s idea of an admissible action. We evaluate these techniques in a series of text-based games of increasing difficulty based on the TextWorld framework, as well as the iconic game ZORK. Empirically, we find that these techniques improve the performance of a baseline deep reinforcement learning agent applied to text-based games.

引用

页码：4328 / 4336

页数：9

共 50 条

[31] Session-based Interactive Recommendation via Deep Reinforcement Learning
Shi, Longxiang
Zhang, Zilin
Wang, Shoujin
Zhang, Qi
Wu, Minghui
Yang, Cheng
Li, Shijian
[J]. 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1319 - 1324
[32] Learning to Engage with Interactive Systems: A Field Study on Deep Reinforcement Learning in a Public Museum
Meng, Lingheng
Lin, Daiwei
Francey, Adam
Gorbet, Rob
Beesley, Philip
Kulic, Dana
[J]. ACM TRANSACTIONS ON HUMAN-ROBOT INTERACTION, 2020, 10 (01)
[33] A Mean-VaR Based Deep Reinforcement Learning Framework for Practical Algorithmic Trading
Jin, Boyi
[J]. IEEE ACCESS, 2023, 11 : 28920 - 28933
[34] Optimal Action Space Search: an Effective Deep Reinforcement Learning Method for Algorithmic Trading
Duan, Zhongjie
Chen, Cen
Cheng, Dawei
Liang, Yuqi
Qian, Weining
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 406 - 415
[35] A multi-agent deep reinforcement learning framework for algorithmic trading in financial markets
Shavandi, Ali
Khedmati, Majid
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
[36] A novel deep reinforcement learning framework with BiLSTM-Attention networks for algorithmic trading
Huang, Yuling
Wan, Xiaoxiao
Zhang, Lin
Lu, Xiaoping
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240
[37] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Morales, Eduardo F.
Murrieta-Cid, Rafael
Becerra, Israel
Esquivel-Basaldua, Marco A.
[J]. INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
[38] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Eduardo F. Morales
Rafael Murrieta-Cid
Israel Becerra
Marco A. Esquivel-Basaldua
[J]. Intelligent Service Robotics, 2021, 14 : 773 - 805
[39] Deep Reinforcement Learning Applied to a Robotic Pick-and-Place Application
Gomes, Natanael Magno
Martins, Felipe N.
Lima, Jose
Wortche, Heinrich
[J]. OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2021, 2021, 1488 : 251 - 265
[40] Deep reinforcement learning applied to statistical arbitrage investment strategy on cryptomarket
Vergara, Gabriel
Kristjanpoller, Werner
[J]. APPLIED SOFT COMPUTING, 2024, 153

← 1 2 3 4 5 →