Generating Visual Explanations

被引:271
|
作者
Hendricks, Lisa Anne [1 ]
Akata, Zeynep [2 ]
Rohrbach, Marcus [1 ,3 ]
Donahue, Jeff [1 ]
Schiele, Bernt [2 ]
Darrell, Trevor [1 ]
机构
[1] UC Berkeley EECS, Berkeley, CA 94720 USA
[2] Max Planck Inst Informat, Saarbrucken, Germany
[3] ICSI, Berkeley, CA USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
关键词
Visual explanation; Image description; Language and vision;
D O I
10.1007/978-3-319-46493-0_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clearly explaining a rationale for a classification decision to an end user can be as important as the decision itself. Existing approaches for deep visual recognition are generally opaque and do not output any justification text; contemporary vision-language models can describe image content but fail to take into account class-discriminative image aspects which justify visual predictions. We propose a new model that focuses on the discriminating properties of the visible object, jointly predicts a class label, and explains why the predicted label is appropriate for the image. Through a novel loss function based on sampling and reinforcement learning, our model learns to generate sentences that realize a global sentence property, such as class specificity. Our results on the CUB dataset show that our model is able to generate explanations which are not only consistent with an image but also more discriminative than descriptions produced by existing captioning methods.
引用
收藏
页码:3 / 19
页数:17
相关论文
共 50 条
  • [31] Adversarial Counterfactual Visual Explanations
    Jeanneret, Guillaume
    Simon, Loic
    Jurie, Frederic
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16425 - 16435
  • [32] Exploring Coherence in Visual Explanations
    Alikhani, Malihe
    Stone, Matthew
    IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 272 - 277
  • [33] Visual Explanations of Probabilistic Reasoning
    Erwig, Martin
    Walkingshaw, Eric
    2009 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING, PROCEEDINGS, 2009, : 23 - 27
  • [34] Diffusion Visual Counterfactual Explanations
    Augustin, Maximilian
    Boreiko, Valentyn
    Croce, Francesco
    Hein, Matthias
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [36] On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
    Yuasa, Mikihisa
    Tran, Huy T.
    Sreenivas, Ramavarapu S.
    IEEE Control Systems Letters, 2024, 8 : 3027 - 3032
  • [37] Generating Explanations for an Emergent Process: The Movement of Sand Dunes
    Barth-Cohen, Lauren
    2010 PHYSICS EDUCATION RESEARCH CONFERENCE, 2010, 1289 : 77 - 80
  • [38] Generating Explanations for Internet-based Business Games
    Fischer, M.
    Lusti, M.
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2007, 2 (02):
  • [39] Generating and Understanding Personalized Explanations in Hybrid Recommender Systems
    Kouki, Pigi
    Schaffer, James
    Pujara, Jay
    O'Donovan, John
    Getoor, Lise
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2020, 10 (04)
  • [40] Generating Explanations for Conceptual Validation of Graph Neural Networks
    Finzel, Bettina
    Saranti, Anna
    Angerschmid, Alessa
    Tafler, David
    Pfeifer, Bastian
    Holzinger, Andreas
    KUNSTLICHE INTELLIGENZ, 2022, 36 (3-4): : 271 - 285