Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning

被引:0
|
作者
Zahavy, Tom [1 ,2 ]
Haroush, Matan [1 ]
Merlis, Nadav [1 ]
Mankowitz, Daniel J. [3 ]
Mannor, Shie [1 ]
机构
[1] Technion Israel Inst Technol, Haifa, Israel
[2] Google Res, Haifa, Israel
[3] Deepmind, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning how to act when there are many available actions in each state is a challenging task for Reinforcement Learning (RL) agents, especially when many of the actions are redundant or irrelevant. In such cases, it is sometimes easier to learn which actions not to take. In this work, we propose the Action-Elimination Deep Q-Network (AE-DQN) architecture that combines a Deep RL algorithm with an Action Elimination Network (AEN) that eliminates sub-optimal actions. The AEN is trained to predict invalid actions, supervised by an external elimination signal provided by the environment. Simulations demonstrate a considerable speedup and added robustness over vanilla DQN in text-based games with over a thousand discrete actions.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Learn to Steer through Deep Reinforcement Learning
    Wu, Keyu
    Esfahani, Mahdi Abolfazli
    Yuan, Shenghai
    Wang, Han
    SENSORS, 2018, 18 (11)
  • [2] Learn to Navigate Autonomously Through Deep Reinforcement Learning
    Wu, Keyu
    Wang, Han
    Esfahani, Mahdi Abolfazli
    Yuan, Shenghai
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (05) : 5342 - 5352
  • [3] Reinforcement learning in swarms that learn
    Peters, JF
    Henry, C
    Ramanna, S
    2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2005, : 400 - 406
  • [4] Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement Learning
    Behrens, Michael R.
    Ruder, Warren C.
    ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (10)
  • [6] Track-to-Learn: A general framework for tractography with deep reinforcement learning
    Theberge, Antoine
    Desrosiers, Christian
    Descoteaux, Maxime
    Jodoin, Pierre-Marc
    MEDICAL IMAGE ANALYSIS, 2021, 72
  • [7] Deep reinforcement learning enabling a BCFbot to learn various undulatory patterns
    Hameed, Imran
    Chao, Xu
    Navarro-Alarcon, David
    Jing, Xingjian
    OCEAN ENGINEERING, 2025, 320
  • [8] BND*-DDQN: Learn to Steer Autonomously Through Deep Reinforcement Learning
    Wu, Keyu
    Wang, Han
    Abolfazli Esfahani, Mahdi
    Yuan, Shenghai
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (02) : 249 - 261
  • [9] Learning to learn in the development of action
    Adolph, KE
    Action as an Organizer of Learning and Development, 2005, 33 : 91 - 122
  • [10] Learn deep before deep learning
    Mayorga, Karina Martinez
    Gomez Jimenez, Gabriela
    Madariaga-Mazon, Abraham
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257