Towards Explainable Reinforcement Learning Using Scoring Mechanism Augmented Agents

被引:3
|
作者
Liu, Yang [1 ]
Wang, Xinzhi [1 ]
Chang, Yudong [1 ]
Jiang, Chao [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
关键词
Deep reinforcement learning; Explainable AI; Adaptive region scoring mechanism;
D O I
10.1007/978-3-031-10986-7_44
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning (DRL) is increasingly used in application areas such as medicine and finance. However, the direct mapping from state to action in DRL makes it challenging to explain why decisions are made. Existing algorithms for explaining DRL policy are posteriori, explaining to an agent after it has been trained. As a common limitation, these posteriori methods fail to improve training with the deduced knowledge. Face with that, an end-to-end trainable explanation method is proposed, in which an Adaptive Region Scoring Mechanism (ARS) is embedded into DRL system. The ARS explains the agent's action by evaluating the features of the input state that are most relevant action before DRL re-learn from task-related regions. The proposed method is validated on Atari games. Experiments demonstrate that agent using the explainable proposed mechanism outperforms the original models.
引用
收藏
页码:547 / 558
页数:12
相关论文
共 50 条
  • [31] Reinforcement Learning Agents
    C. Ribeiro
    Artificial Intelligence Review, 2002, 17 : 223 - 250
  • [32] Memory-Based Explainable Reinforcement Learning
    Cruz, Francisco
    Dazeley, Richard
    Vamplew, Peter
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 66 - 77
  • [33] Causal State Distillation for Explainable Reinforcement Learning
    Lu, Wenhao
    Zhao, Xufeng
    Fryen, Thilo
    Lee, Jae Hee
    Li, Mengdi
    Magg, Sven
    Wermter, Stefan
    CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 106 - 142
  • [34] Towards mental time travel: a hierarchical memory for reinforcement learning agents
    Lampinen, Andrew K.
    Chan, Stephanie C. Y.
    Banino, Andrea
    Hill, Felix
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [35] Fuzzy Centered Explainable Network for Reinforcement Learning
    Ou, Liang
    Chang, Yu-Chen
    Wang, Yu-Kai
    Lin, Chin-Teng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (01) : 203 - 213
  • [36] Inherently Explainable Reinforcement Learning in Natural Language
    Peng, Xiangyu
    Xing, Chen
    Choubey, Prafulla Kumar
    Wu, Chien-Sheng
    Xiong, Caiming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [37] Agents and reinforcement learning
    Harlequin's Adaptive Systems Group
    Dr Dobb's J Software Tools Prof Program, 3 (3pp):
  • [38] Reinforcement learning agents
    Ribeiro, C
    ARTIFICIAL INTELLIGENCE REVIEW, 2002, 17 (03) : 223 - 250
  • [39] Agents and reinforcement learning
    Singh, S
    Norvig, P
    Cohn, D
    DR DOBBS JOURNAL, 1997, 22 (03): : 28 - +
  • [40] Parallel reinforcement learning systems using exploration agents
    Tateyama, Takeshi
    Kawata, Seiichi
    Shimomura, Yoshiki
    Nihon Kikai Gakkai Ronbunshu, C Hen/Transactions of the Japan Society of Mechanical Engineers, Part C, 2008, 74 (03): : 692 - 701