Explainable Reinforcement Learning through a Causal Lens

被引:0
|
作者
Madumal, Prashan
Miller, Tim
Sonenberg, Liz
Vetere, Frank
机构
基金
澳大利亚研究理事会;
关键词
EXPLANATIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prominent theories in cognitive science propose that humans understand and represent the knowledge of the world through causal relationships. In making sense of the world, we build causal models in our mind to encode cause-effect relations of events and use these to explain why new events happen by referring to counterfactuals - things that did not happen. In this paper, we use causal models to derive causal explanations of the behaviour of model-free reinforcement learning agents. We present an approach that learns a structural causal model during reinforcement learning and encodes causal relationships between variables of interest. This model is then used to generate explanations of behaviour based on counterfactual analysis of the causal model. We computationally evaluate the model in 6 domains and measure performance and task prediction accuracy. We report on a study with 120 participants who observe agents playing a real-time strategy game (Starcraft II) and then receive explanations of the agents' behaviour. We investigate: 1) participants' understanding gained by explanations through task prediction; 2) explanation satisfaction and 3) trust. Our results show that causal model explanations perform better on these measures compared to two other baseline explanation models.
引用
收藏
页码:2493 / 2500
页数:8
相关论文
共 50 条
  • [1] Causal State Distillation for Explainable Reinforcement Learning
    Lu, Wenhao
    Zhao, Xufeng
    Fryen, Thilo
    Lee, Jae Hee
    Li, Mengdi
    Magg, Sven
    Wermter, Stefan
    [J]. CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 106 - 142
  • [2] Explainable Reinforcement Learning via a Causal World Model
    Yu, Zhongwei
    Ruan, Jingqing
    Xing, Dengpeng
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4540 - 4548
  • [3] ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement Learning
    Gajcin, Jasmina
    Dusparic, Ivana
    [J]. EXPLAINABLE AND TRANSPARENT AI AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2022, 2022, 13283 : 38 - 56
  • [4] Neurofeedback through the lens of reinforcement learning
    Lubianiker, Nitzan
    Paret, Christian
    Dayan, Peter
    Hendler, Talma
    [J]. TRENDS IN NEUROSCIENCES, 2022, 45 (08) : 579 - 593
  • [5] Explainable Federated Medical Image Analysis Through Causal Learning and Blockchain
    Mu, Junsheng
    Kadoch, Michel
    Yuan, Tongtong
    Lv, Wenzhe
    Liu, Qiang
    Li, Bohan
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (06) : 3206 - 3218
  • [6] Improving Human-Robot Interaction through Explainable Reinforcement Learning
    Tabrez, Aaquib
    Hayes, Bradley
    [J]. HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 751 - 753
  • [7] Explainable Reinforcement Learning for Longitudinal Control
    Liessner, Roman
    Dohmen, Jan
    Wiering, Marco
    [J]. ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 874 - 881
  • [8] Explainable Agency in Reinforcement Learning Agents
    Madumal, Prashan
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13724 - 13725
  • [9] A Reinforcement Learning Framework for Explainable Recommendation
    Wang, Xiting
    Chen, Yiru
    Yang, Jie
    Wu, Le
    Wu, Zhengtao
    Xie, Xing
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 587 - 596
  • [10] Strategic Tasks for Explainable Reinforcement Learning
    Pocius, Rey
    Neal, Lawrence
    Fern, Alan
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10007 - 10008