Explainability in deep reinforcement learning

被引:151
|
作者
Heuillet, Alexandre [1 ]
Couthouis, Fabien [2 ]
Diaz-Rodriguez, Natalia [3 ]
机构
[1] Bordeaux INP, ENSEIRB MATMECA, 1 Ave Docteur Albert Schweitzer, F-33400 Talence, France
[2] Bordeaux INP, ENSC, 109 Ave Roul, F-33400 Talence, France
[3] Inst Polytech Paris, Inria Flowers Team, ENSTA Paris, 828 Blvd Marechaux, F-91762 Palaiseau, France
关键词
Reinforcement Learning; Explainable artificial intelligence; Machine Learning; Deep Learning; Responsible artificial intelligence; Representation learning;
D O I
10.1016/j.knosys.2020.106685
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large set of the explainable Artificial Intelligence (XAI) literature is emerging on feature relevance techniques to explain a deep neural network (DNN) output or explaining models that ingest image source data. However, assessing how XAI techniques can help understand models beyond classification tasks, e.g. for reinforcement learning (RL), has not been extensively studied. We review recent works in the direction to attain Explainable Reinforcement Learning (XRL), a relatively new subfield of Explainable Artificial Intelligence, intended to be used in general public applications, with diverse audiences, requiring ethical, responsible and trustable algorithms. In critical situations where it is essential to justify and explain the agent's behaviour, better explainability and interpretability of RL models could help gain scientific insight on the inner workings of what is still considered a black box. We evaluate mainly studies directly linking explainability to RL, and split these into two categories according to the way the explanations are generated: transparent algorithms and post-hoc explainability. We also review the most prominent XAI works from the lenses of how they could potentially enlighten the further deployment of the latest advances in RL, in the demanding present and future of everyday problems. (C) 2020 Elsevier B.V. All rights reserved.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [21] Transparent AI: Explainability of deep learning based load disaggregation
    Murray, David
    Stankovic, Lina
    Stankovic, Vladimir
    BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 268 - 271
  • [22] Machine learning and deep analytics for biocomputing: call for better explainability
    Petkovic, Dragutin
    Kobzik, Lester
    Re, Christopher
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018 (PSB), 2018, : 623 - 627
  • [23] Reproducibility and Explainability of Deep Learning in Mammography: A Systematic Review of Literature
    Bhalla, Deeksha
    Rangarajan, Krithika
    Chandra, Tany
    Banerjee, Subhashis
    Arora, Chetan
    INDIAN JOURNAL OF RADIOLOGY AND IMAGING, 2024, 34 (03): : 469 - 487
  • [24] Explainability of deep learning models in medical video analysis: a survey
    Kolarik M.
    Sarnovsky M.
    Paralic J.
    Babic F.
    PeerJ Computer Science, 2023, 9 : 1 - 39
  • [25] Deep learning generates synthetic cancer histology for explainability and education
    Dolezal, James M.
    Wolk, Rachelle
    Hieromnimon, Hanna M.
    Howard, Frederick M.
    Srisuwananukorn, Andrew
    Karpeyev, Dmitry
    Ramesh, Siddhi
    Kochanny, Sara
    Kwon, Jung Woo
    Agni, Meghana
    Simon, Richard C.
    Desai, Chandni
    Kherallah, Raghad
    Nguyen, Tung D.
    Schulte, Jefree J.
    Cole, Kimberly
    Khramtsova, Galina
    Garassino, Marina Chiara
    Husain, Aliya N.
    Li, Huihua
    Grossman, Robert
    Cipriani, Nicole A.
    Pearson, Alexander T.
    NPJ PRECISION ONCOLOGY, 2023, 7 (01)
  • [26] Explainability of deep learning models in medical video analysis: a survey
    Kolarik, Michal
    Sarnovsky, Martin
    Paralic, Jan
    Babic, Frantisek
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [27] Improving Explainability of Deep Learning for Polarimetric Radar Rainfall Estimation
    Li, Wenyuan
    Chen, Haonan
    Han, Lei
    GEOPHYSICAL RESEARCH LETTERS, 2024, 51 (11)
  • [28] Deep learning generates synthetic cancer histology for explainability and education
    James M. Dolezal
    Rachelle Wolk
    Hanna M. Hieromnimon
    Frederick M. Howard
    Andrew Srisuwananukorn
    Dmitry Karpeyev
    Siddhi Ramesh
    Sara Kochanny
    Jung Woo Kwon
    Meghana Agni
    Richard C. Simon
    Chandni Desai
    Raghad Kherallah
    Tung D. Nguyen
    Jefree J. Schulte
    Kimberly Cole
    Galina Khramtsova
    Marina Chiara Garassino
    Aliya N. Husain
    Huihua Li
    Robert Grossman
    Nicole A. Cipriani
    Alexander T. Pearson
    npj Precision Oncology, 7
  • [29] Deep Reinforcement Learning: An Overview
    Mousavi, Seyed Sajad
    Schukat, Michael
    Howley, Enda
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 426 - 440
  • [30] Deep Reinforcement Learning with Adjustments
    Khorasgani, Hamed
    Wang, Haiyan
    Gupta, Chetan
    Serita, Susumu
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2021,