Generating Visual Explanations

被引:271
|
作者
Hendricks, Lisa Anne [1 ]
Akata, Zeynep [2 ]
Rohrbach, Marcus [1 ,3 ]
Donahue, Jeff [1 ]
Schiele, Bernt [2 ]
Darrell, Trevor [1 ]
机构
[1] UC Berkeley EECS, Berkeley, CA 94720 USA
[2] Max Planck Inst Informat, Saarbrucken, Germany
[3] ICSI, Berkeley, CA USA
来源
关键词
Visual explanation; Image description; Language and vision;
D O I
10.1007/978-3-319-46493-0_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clearly explaining a rationale for a classification decision to an end user can be as important as the decision itself. Existing approaches for deep visual recognition are generally opaque and do not output any justification text; contemporary vision-language models can describe image content but fail to take into account class-discriminative image aspects which justify visual predictions. We propose a new model that focuses on the discriminating properties of the visible object, jointly predicts a class label, and explains why the predicted label is appropriate for the image. Through a novel loss function based on sampling and reinforcement learning, our model learns to generate sentences that realize a global sentence property, such as class specificity. Our results on the CUB dataset show that our model is able to generate explanations which are not only consistent with an image but also more discriminative than descriptions produced by existing captioning methods.
引用
收藏
页码:3 / 19
页数:17
相关论文
共 50 条
  • [1] Generating Natural Counterfactual Visual Explanations
    Zhao, Wenqi
    Oyama, Satoshi
    Kurihara, Masahito
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 5204 - 5205
  • [2] Answering Questions about Charts and Generating Visual Explanations
    Kim, Dae Hyun
    Hoque, Enamul
    Agrawala, Maneesh
    PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
  • [3] Generating Explanations for Embodied Action Decision from Visual Observation
    Wang, Xiaohan
    Liu, Yuehu
    Song, Xinhang
    Wang, Beibei
    Jiang, Shuqiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2838 - 2846
  • [4] Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering
    Whitehouse, Chenxi
    Weyde, Tillman
    Madhyastha, Pranava
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1693 - 1705
  • [5] On generating trustworthy counterfactual explanations
    Del Ser, Javier
    Barredo-Arrieta, Alejandro
    Diaz-Rodriguez, Natalia
    Herrera, Francisco
    Saranti, Anna
    Holzinger, Andreas
    INFORMATION SCIENCES, 2024, 655
  • [6] GENERATING EXPLANATIONS OF GEOMETRICAL CONCEPTS
    MITKOV, R
    COMPUTERS AND ARTIFICIAL INTELLIGENCE, 1990, 9 (06): : 589 - 598
  • [7] What Do You MEME? Generating Explanations for Visual Semantic Role Labelling in Memes
    Sharma, Shivam
    Agarwal, Siddhant
    Suresh, Tharun
    Nakov, Preslav
    Akhtar, Md. Shad
    Chakraborty, Tanmoy
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9763 - 9771
  • [8] Software components for generating explanations
    Devedzic, V
    Radovic, D
    Jerinic, L
    KNOWLEDGE-BASED SOFTWARE ENGINEERING, 1998, 48 : 291 - 294
  • [9] Generating deductive database explanations
    Mallet, S
    Ducassé, M
    LOGIC PROGRAMMING: PROCEEDINGS OF THE 1999 INTERNATIONAL CONFERENCE ON LOGIC PROGRAMMING, 1999, : 154 - 168
  • [10] Generating explanations for biomedical queries
    Erdem, Esra
    Oztok, Umut
    THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2015, 15 : 35 - 78