Towards Explainable Reinforcement Learning Using Scoring Mechanism Augmented Agents

被引:3
|
作者
Liu, Yang [1 ]
Wang, Xinzhi [1 ]
Chang, Yudong [1 ]
Jiang, Chao [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
关键词
Deep reinforcement learning; Explainable AI; Adaptive region scoring mechanism;
D O I
10.1007/978-3-031-10986-7_44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning (DRL) is increasingly used in application areas such as medicine and finance. However, the direct mapping from state to action in DRL makes it challenging to explain why decisions are made. Existing algorithms for explaining DRL policy are posteriori, explaining to an agent after it has been trained. As a common limitation, these posteriori methods fail to improve training with the deduced knowledge. Face with that, an end-to-end trainable explanation method is proposed, in which an Adaptive Region Scoring Mechanism (ARS) is embedded into DRL system. The ARS explains the agent's action by evaluating the features of the input state that are most relevant action before DRL re-learn from task-related regions. The proposed method is validated on Atari games. Experiments demonstrate that agent using the explainable proposed mechanism outperforms the original models.
引用
收藏
页码:547 / 558
页数:12
相关论文
共 50 条
  • [1] Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
    Mott, Alex
    Zoran, Daniel
    Chrzanowski, Mike
    Wierstra, Daan
    Rezende, Danilo J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] Explainable Agency in Reinforcement Learning Agents
    Madumal, Prashan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13724 - 13725
  • [3] Reinforcement Learning over Sentiment-Augmented Knowledge Graphs towards Accurate and Explainable Recommendation
    Park, Sung-Jun
    Chae, Dong-Kyu
    Bae, Hong-Kyun
    Park, Sumin
    Kim, Sang-Wook
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 784 - 793
  • [4] Towards Explainable Shared Control using Augmented Reality
    Zolotas, Mark
    Demiris, Yiannis
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3020 - 3026
  • [5] Imagination-Augmented Agents for Deep Reinforcement Learning
    Racaniere, Sebastien
    Weber, Theophane
    Reichert, David P.
    Buesing, Lars
    Guez, Arthur
    Rezende, Danilo
    Badia, Adria Puigdomenech
    Vinyals, Oriol
    Heess, Nicolas
    Li, Yujia
    Pascanu, Razvan
    Battaglia, Peter
    Hassabis, Demis
    Silver, David
    Wierstra, Daan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [6] Portfolio construction using explainable reinforcement learning
    Cortes, Daniel Gonzalez
    Onieva, Enrique
    Pastor, Iker
    Trinchera, Laura
    Wu, Jian
    EXPERT SYSTEMS, 2024, 41 (11)
  • [7] Interestingness elements for explainable reinforcement learning: Understanding agents' capabilities and limitations
    Sequeira, Pedro
    Gervasio, Melinda
    ARTIFICIAL INTELLIGENCE, 2020, 288 (288)
  • [8] Towards Explainable Reinforcement Learning in Optical Networks: The RMSA Use Case
    Ayoub, Omran
    Natalino, Carlos
    Monti, Paolo
    2024 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2024,
  • [9] Explainable navigation system using fuzzy reinforcement learning
    Bautista-Montesano, Rolando
    Bustamante-Bello, Rogelio
    Ramirez-Mendoza, Ricardo A.
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2020, 14 (04): : 1411 - 1428
  • [10] Explainable navigation system using fuzzy reinforcement learning
    Rolando Bautista-Montesano
    Rogelio Bustamante-Bello
    Ricardo A. Ramirez-Mendoza
    International Journal on Interactive Design and Manufacturing (IJIDeM), 2020, 14 : 1411 - 1428