Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning

被引：0

作者：

Zahavy, Tom ^{[1
,2
]}

Haroush, Matan ^{[1
]}

Merlis, Nadav ^{[1
]}

Mankowitz, Daniel J. ^{[3
]}

Mannor, Shie ^{[1
]}

机构：

[1] Technion Israel Inst Technol, Haifa, Israel

[2] Google Res, Haifa, Israel

[3] Deepmind, London, England

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning how to act when there are many available actions in each state is a challenging task for Reinforcement Learning (RL) agents, especially when many of the actions are redundant or irrelevant. In such cases, it is sometimes easier to learn which actions not to take. In this work, we propose the Action-Elimination Deep Q-Network (AE-DQN) architecture that combines a Deep RL algorithm with an Action Elimination Network (AEN) that eliminates sub-optimal actions. The AEN is trained to predict invalid actions, supervised by an external elimination signal provided by the environment. Simulations demonstrate a considerable speedup and added robustness over vanilla DQN in text-based games with over a thousand discrete actions.

引用

页数：12

共 50 条

[1] Learn to Steer through Deep Reinforcement Learning
Wu, Keyu
Esfahani, Mahdi Abolfazli
Yuan, Shenghai
Wang, Han
SENSORS, 2018, 18 (11)
[2] Learn to Navigate Autonomously Through Deep Reinforcement Learning
Wu, Keyu
Wang, Han
Esfahani, Mahdi Abolfazli
Yuan, Shenghai
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (05) : 5342 - 5352
[3] Reinforcement learning in swarms that learn
Peters, JF
Henry, C
Ramanna, S
2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 400 - 406
[4] Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement Learning
Behrens, Michael R.
Ruder, Warren C.
ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (10)
[5] What to Learn How to Learn What not to Learn
Li Jinda
山东师范大学外国语学院学报(基础英语教育), 2006, (01) : 95 - 96
[6] Track-to-Learn: A general framework for tractography with deep reinforcement learning
Theberge, Antoine
Desrosiers, Christian
Descoteaux, Maxime
Jodoin, Pierre-Marc
MEDICAL IMAGE ANALYSIS, 2021, 72
[7] Deep reinforcement learning enabling a BCFbot to learn various undulatory patterns
Hameed, Imran
Chao, Xu
Navarro-Alarcon, David
Jing, Xingjian
OCEAN ENGINEERING, 2025, 320
[8] BND*-DDQN: Learn to Steer Autonomously Through Deep Reinforcement Learning
Wu, Keyu
Wang, Han
Abolfazli Esfahani, Mahdi
Yuan, Shenghai
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (02) : 249 - 261
[9] Learning to learn in the development of action
Adolph, KE
Action as an Organizer of Learning and Development, 2005, 33 : 91 - 122
[10] Learn deep before deep learning
Mayorga, Karina Martinez
Gomez Jimenez, Gabriela
Madariaga-Mazon, Abraham
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257

← 1 2 3 4 5 →