CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations

被引:8
|
作者
Salewski, Leonard [1 ]
Koepke, A. Sophia [1 ]
Lensch, Hendrik P. A. [1 ]
Akata, Zeynep [1 ,2 ,3 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] MPI Informat, Saarbrucken, Germany
[3] MPI Intelligent Syst, Tubingen, Germany
关键词
Visual question answering; Natural language explanations;
D O I
10.1007/978-3-031-04083-2_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Providing explanations in the context of Visual Question Answering (VQA) presents a fundamental problem in machine learning. To obtain detailed insights into the process of generating natural language explanations for VQA, we introduce the large-scale CLEVR-X dataset that extends the CLEVR dataset with natural language explanations. For each image-question pair in the CLEVR dataset, CLEVR-X contains multiple structured textual explanations which are derived from the original scene graphs. By construction, the CLEVR-X explanations are correct and describe the reasoning and visual information that is necessary to answer a given question. We conducted a user study to confirm that the ground-truth explanations in our proposed dataset are indeed complete and relevant. We present baseline results for generating natural language explanations in the context of VQA using two state-of-the-art frameworks on the CLEVR-X dataset. Furthermore, we provide a detailed analysis of the explanation generation quality for different question and answer types. Additionally, we study the influence of using different numbers of ground-truth explanations on the convergence of natural language generation (NLG) metrics. The CLEVR-X dataset is publicly available at https://github.com/ExplainableML/CLEVR-X.
引用
收藏
页码:69 / 88
页数:20
相关论文
共 50 条
  • [21] A Dataset and Architecture for Visual Reasoning with a Working Memory
    Yang, Guangyu Robert
    Ganichev, Igor
    Wang, Xiao-Jing
    Shlens, Jonathon
    Sussillo, David
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 729 - 745
  • [22] FACILITATING STUDENTS REASONING WITH CAUSAL EXPLANATIONS AND VISUAL REPRESENTATIONS
    REISER, BJ
    RANNEY, M
    LOVETT, MC
    KIMBERG, DY
    ARTIFICIAL INTELLIGENCE AND EDUCATION /: SYNTHESIS AND REFLECTION, 1989, : 228 - 235
  • [23] Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations
    Yang, Qian
    Li, Yunxin
    Hu, Baotian
    Ma, Lin
    Ding, Yuxin
    Zhang, Min
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3587 - 3597
  • [24] Recommendation with Dynamic Natural Language Explanations
    Li, Xi
    Zhang, Jingsen
    Bo, Xiaohe
    Wang, Lei
    Chen, Xu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [25] Faithfulness Tests for Natural Language Explanations
    Atanasova, Pepa
    Camburu, Oana-Maria
    Lioma, Christina
    Lukasiewicz, Thomas
    Simonsen, Jakob Grue
    Augenstein, Isabelle
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 283 - 294
  • [26] ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset
    Jin, Zhihua
    Wang, Xingbo
    Cheng, Furui
    Sun, Chunhui
    Liu, Qun
    Qu, Huamin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3594 - 3608
  • [27] Generating Natural Counterfactual Visual Explanations
    Zhao, Wenqi
    Oyama, Satoshi
    Kurihara, Masahito
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 5204 - 5205
  • [28] A survey on XAI and natural language explanations
    Cambria, Erik
    Malandri, Lorenzo
    Mercorio, Fabio
    Mezzanzanica, Mario
    Nobani, Navid
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (01)
  • [29] Natural Language Explanations of Classifier Behavior
    de Aquino, Rodrigo Monteiro
    Cozman, Fabio Gagliardi
    2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 239 - 242
  • [30] Training Classifiers with Natural Language Explanations
    Hancock, Braden
    Varma, Paroma
    Wang, Stephanie
    Bringmann, Martin
    Liang, Percy
    Re, Christopher
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1884 - 1895