Generating Visual Explanations

被引：271

作者：

Hendricks, Lisa Anne ^{[1
]}

Akata, Zeynep ^{[2
]}

Rohrbach, Marcus ^{[1
,3
]}

Donahue, Jeff ^{[1
]}

Schiele, Bernt ^{[2
]}

Darrell, Trevor ^{[1
]}

机构：

[1] UC Berkeley EECS, Berkeley, CA 94720 USA

[2] Max Planck Inst Informat, Saarbrucken, Germany

[3] ICSI, Berkeley, CA USA

来源：

COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷

关键词：

Visual explanation; Image description; Language and vision;

D O I：

10.1007/978-3-319-46493-0_1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Clearly explaining a rationale for a classification decision to an end user can be as important as the decision itself. Existing approaches for deep visual recognition are generally opaque and do not output any justification text; contemporary vision-language models can describe image content but fail to take into account class-discriminative image aspects which justify visual predictions. We propose a new model that focuses on the discriminating properties of the visible object, jointly predicts a class label, and explains why the predicted label is appropriate for the image. Through a novel loss function based on sampling and reinforcement learning, our model learns to generate sentences that realize a global sentence property, such as class specificity. Our results on the CUB dataset show that our model is able to generate explanations which are not only consistent with an image but also more discriminative than descriptions produced by existing captioning methods.

引用

页码：3 / 19

页数：17

共 50 条

[31] Adversarial Counterfactual Visual Explanations
Jeanneret, Guillaume
Simon, Loic
Jurie, Frederic
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16425 - 16435
[32] Exploring Coherence in Visual Explanations
Alikhani, Malihe
Stone, Matthew
IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 272 - 277
[33] Visual Explanations of Probabilistic Reasoning
Erwig, Martin
Walkingshaw, Eric
2009 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING, PROCEEDINGS, 2009, : 23 - 27
[34] Diffusion Visual Counterfactual Explanations
Augustin, Maximilian
Boreiko, Valentyn
Croce, Francesco
Hein, Matthias
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[35] A MODELING TECHNIQUE FOR GENERATING CAUSAL EXPLANATIONS OF PHYSICAL SYSTEMS
HEWETT, R
LECTURE NOTES IN COMPUTER SCIENCE, 1991, 497 : 657 - 668
[36] On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
Yuasa, Mikihisa
Tran, Huy T.
Sreenivas, Ramavarapu S.
IEEE Control Systems Letters, 2024, 8 : 3027 - 3032
[37] Generating Explanations for an Emergent Process: The Movement of Sand Dunes
Barth-Cohen, Lauren
2010 PHYSICS EDUCATION RESEARCH CONFERENCE, 2010, 1289 : 77 - 80
[38] Generating Explanations for Internet-based Business Games
Fischer, M.
Lusti, M.
INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2007, 2 (02):
[39] Generating and Understanding Personalized Explanations in Hybrid Recommender Systems
Kouki, Pigi
Schaffer, James
Pujara, Jay
O'Donovan, John
Getoor, Lise
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2020, 10 (04)
[40] Generating Explanations for Conceptual Validation of Graph Neural Networks
Finzel, Bettina
Saranti, Anna
Angerschmid, Alessa
Tafler, David
Pfeifer, Bastian
Holzinger, Andreas
KUNSTLICHE INTELLIGENZ, 2022, 36 (3-4): : 271 - 285

← 1 2 3 4 5 →