Generating Visual Explanations

被引：271

作者：

Hendricks, Lisa Anne ^{[1
]}

Akata, Zeynep ^{[2
]}

Rohrbach, Marcus ^{[1
,3
]}

Donahue, Jeff ^{[1
]}

Schiele, Bernt ^{[2
]}

Darrell, Trevor ^{[1
]}

机构：

[1] UC Berkeley EECS, Berkeley, CA 94720 USA

[2] Max Planck Inst Informat, Saarbrucken, Germany

[3] ICSI, Berkeley, CA USA

来源：

COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷

关键词：

Visual explanation; Image description; Language and vision;

D O I：

10.1007/978-3-319-46493-0_1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Clearly explaining a rationale for a classification decision to an end user can be as important as the decision itself. Existing approaches for deep visual recognition are generally opaque and do not output any justification text; contemporary vision-language models can describe image content but fail to take into account class-discriminative image aspects which justify visual predictions. We propose a new model that focuses on the discriminating properties of the visible object, jointly predicts a class label, and explains why the predicted label is appropriate for the image. Through a novel loss function based on sampling and reinforcement learning, our model learns to generate sentences that realize a global sentence property, such as class specificity. Our results on the CUB dataset show that our model is able to generate explanations which are not only consistent with an image but also more discriminative than descriptions produced by existing captioning methods.

引用

页码：3 / 19

页数：17

共 50 条

[1] Generating Natural Counterfactual Visual Explanations
Zhao, Wenqi
Oyama, Satoshi
Kurihara, Masahito
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 5204 - 5205
[2] Answering Questions about Charts and Generating Visual Explanations
Kim, Dae Hyun
Hoque, Enamul
Agrawala, Maneesh
PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
[3] Generating Explanations for Embodied Action Decision from Visual Observation
Wang, Xiaohan
Liu, Yuehu
Song, Xinhang
Wang, Beibei
Jiang, Shuqiang
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2838 - 2846
[4] Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering
Whitehouse, Chenxi
Weyde, Tillman
Madhyastha, Pranava
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1693 - 1705
[5] On generating trustworthy counterfactual explanations
Del Ser, Javier
Barredo-Arrieta, Alejandro
Diaz-Rodriguez, Natalia
Herrera, Francisco
Saranti, Anna
Holzinger, Andreas
INFORMATION SCIENCES, 2024, 655
[6] GENERATING EXPLANATIONS OF GEOMETRICAL CONCEPTS
MITKOV, R
COMPUTERS AND ARTIFICIAL INTELLIGENCE, 1990, 9 (06): : 589 - 598
[7] What Do You MEME? Generating Explanations for Visual Semantic Role Labelling in Memes
Sharma, Shivam
Agarwal, Siddhant
Suresh, Tharun
Nakov, Preslav
Akhtar, Md. Shad
Chakraborty, Tanmoy
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9763 - 9771
[8] Software components for generating explanations
Devedzic, V
Radovic, D
Jerinic, L
KNOWLEDGE-BASED SOFTWARE ENGINEERING, 1998, 48 : 291 - 294
[9] Generating deductive database explanations
Mallet, S
Ducassé, M
LOGIC PROGRAMMING: PROCEEDINGS OF THE 1999 INTERNATIONAL CONFERENCE ON LOGIC PROGRAMMING, 1999, : 154 - 168
[10] Generating explanations for biomedical queries
Erdem, Esra
Oztok, Umut
THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2015, 15 : 35 - 78

← 1 2 3 4 5 →