Verbal Explanations for Deep Reinforcement Learning Neural Networks with Attention on Extracted Features

被引:6
|
作者
Wang, Xinzhi [1 ]
Yuan, Shengcheng [2 ]
Zhang, Hui [3 ]
Lewis, Michael [4 ]
Sycara, Katia [5 ]
机构
[1] Shanghai Univ, Shanghai, Peoples R China
[2] LazyComposer Inc, Beijing, Peoples R China
[3] Tsinghua Univ, Beijing Key Lab City Integrated Emergency Respons, Beijing, Peoples R China
[4] Univ Pittsburgh, Pittsburgh, PA USA
[5] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
基金
国家重点研发计划;
关键词
D O I
10.1109/ro-man46459.2019.8956301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, there has been increasing interest in transparency in Deep Neural Networks. Most of the works on transparency have been done for image classification. In this paper, we report on work of transparency in Deep Reinforcement Learning Networks (DRLNs). Such networks have been extremely successful in learning action control in Atari games. In this paper, we focus on generating verbal (natural language) descriptions and explanations of deep reinforcement learning policies. Successful generation of verbal explanations would allow better understanding by people (e.g., users, debuggers) of the inner workings of DRLNs which could ultimately increase trust in these systems. We present a generation model which consists of three parts: an encoder on feature extraction, an attention structure on selecting features from the output of the encoder, and a decoder on generating the explanation in natural language. Four variants of the attention structure full attention, global attention, adaptive attention and object attention-are designed and compared. The adaptive attention structure performs the best among all the variants, even though the object attention structure is given additional information on object locations. Additionally, our experiment results showed that the proposed encoder outperforms two baseline encoders (Resnet and VGG) on the capability of distinguishing the game state images.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] On Correlation of Features Extracted by Deep Neural Networks
    Ayinde, Babajide O.
    Inane, Tamer
    Zurada, Jacek M.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] On the Expressivity of Neural Networks for Deep Reinforcement Learning
    Dong, Kefan
    Luo, Yuping
    Yu, Tianhe
    Finn, Chelsea
    Ma, Tengyu
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [3] On the Expressivity of Neural Networks for Deep Reinforcement Learning
    Dong, Kefan
    Luo, Yuping
    Yu, Tianhe
    Finn, Chelsea
    Ma, Tengyu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [4] Attention Map-Guided Visual Explanations for Deep Neural Networks
    An, Junkang
    Joe, Inwhee
    APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [5] Transparency and Explanation in Deep Reinforcement Learning Neural Networks
    Iyer, Rahul
    Li, Yuezhang
    Li, Huao
    Lewis, Michael
    Sundar, Ramitha
    Sycara, Katia
    PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 144 - 150
  • [6] Temporal Explanations of Deep Reinforcement Learning Agents
    Towers, Mark
    Du, Yali
    Freeman, Christopher
    Norman, Tim
    EXPLAINABLE AND TRANSPARENT AI AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2024, 2024, 14847 : 99 - 115
  • [7] Reinforcement Learning and Deep Neural Networks for PI Controller Tuning
    Shipman, William J.
    Coetzee, Loutjie C.
    IFAC PAPERSONLINE, 2019, 52 (14): : 111 - 116
  • [8] Deep Auto-Encoder Neural Networks in Reinforcement Learning
    Lange, Sascha
    Riedmiller, Martin
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [9] Simultaneously Learning Architectures and Features of Deep Neural Networks
    Wang, Tinghuai
    Fan, Lixin
    Wang, Huiling
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 275 - 287
  • [10] Selective particle attention: Rapidly and flexibly selecting features for deep reinforcement learning
    Blakeman, Sam
    Mareschal, Denis
    NEURAL NETWORKS, 2022, 150 : 408 - 421