Verbal Explanations for Deep Reinforcement Learning Neural Networks with Attention on Extracted Features

被引：6

作者：

Wang, Xinzhi ^{[1
]}

Yuan, Shengcheng ^{[2
]}

Zhang, Hui ^{[3
]}

Lewis, Michael ^{[4
]}

Sycara, Katia ^{[5
]}

机构：

[1] Shanghai Univ, Shanghai, Peoples R China

[2] LazyComposer Inc, Beijing, Peoples R China

[3] Tsinghua Univ, Beijing Key Lab City Integrated Emergency Respons, Beijing, Peoples R China

[4] Univ Pittsburgh, Pittsburgh, PA USA

[5] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA

来源：

2019 28TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN) | 2019年

基金：

国家重点研发计划;

关键词：

D O I：

10.1109/ro-man46459.2019.8956301

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, there has been increasing interest in transparency in Deep Neural Networks. Most of the works on transparency have been done for image classification. In this paper, we report on work of transparency in Deep Reinforcement Learning Networks (DRLNs). Such networks have been extremely successful in learning action control in Atari games. In this paper, we focus on generating verbal (natural language) descriptions and explanations of deep reinforcement learning policies. Successful generation of verbal explanations would allow better understanding by people (e.g., users, debuggers) of the inner workings of DRLNs which could ultimately increase trust in these systems. We present a generation model which consists of three parts: an encoder on feature extraction, an attention structure on selecting features from the output of the encoder, and a decoder on generating the explanation in natural language. Four variants of the attention structure full attention, global attention, adaptive attention and object attention-are designed and compared. The adaptive attention structure performs the best among all the variants, even though the object attention structure is given additional information on object locations. Additionally, our experiment results showed that the proposed encoder outperforms two baseline encoders (Resnet and VGG) on the capability of distinguishing the game state images.

引用

页数：7

共 50 条

[1] On Correlation of Features Extracted by Deep Neural Networks
Ayinde, Babajide O.
Inane, Tamer
Zurada, Jacek M.
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[2] On the Expressivity of Neural Networks for Deep Reinforcement Learning
Dong, Kefan
Luo, Yuping
Yu, Tianhe
Finn, Chelsea
Ma, Tengyu
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[3] On the Expressivity of Neural Networks for Deep Reinforcement Learning
Dong, Kefan
Luo, Yuping
Yu, Tianhe
Finn, Chelsea
Ma, Tengyu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[4] Attention Map-Guided Visual Explanations for Deep Neural Networks
An, Junkang
Joe, Inwhee
APPLIED SCIENCES-BASEL, 2022, 12 (08):
[5] Transparency and Explanation in Deep Reinforcement Learning Neural Networks
Iyer, Rahul
Li, Yuezhang
Li, Huao
Lewis, Michael
Sundar, Ramitha
Sycara, Katia
PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 144 - 150
[6] Temporal Explanations of Deep Reinforcement Learning Agents
Towers, Mark
Du, Yali
Freeman, Christopher
Norman, Tim
EXPLAINABLE AND TRANSPARENT AI AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2024, 2024, 14847 : 99 - 115
[7] Reinforcement Learning and Deep Neural Networks for PI Controller Tuning
Shipman, William J.
Coetzee, Loutjie C.
IFAC PAPERSONLINE, 2019, 52 (14): : 111 - 116
[8] Deep Auto-Encoder Neural Networks in Reinforcement Learning
Lange, Sascha
Riedmiller, Martin
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[9] Simultaneously Learning Architectures and Features of Deep Neural Networks
Wang, Tinghuai
Fan, Lixin
Wang, Huiling
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 275 - 287
[10] Selective particle attention: Rapidly and flexibly selecting features for deep reinforcement learning
Blakeman, Sam
Mareschal, Denis
NEURAL NETWORKS, 2022, 150 : 408 - 421

← 1 2 3 4 5 →