COIN: Counterfactual Image Generation for Visual Question Answering Interpretation

被引:3
|
作者
Boukhers, Zeyd [1 ]
Hartmann, Timo [1 ]
Juerjens, Jan [1 ,2 ]
机构
[1] Univ Koblenz Landau, Fac Comp Sci, D-56070 Koblenz, Germany
[2] Fraunhofer Inst Software & Syst Engn ISST, D-44227 Dortmund, Germany
关键词
ML interpretability; VQA; GAN; UXE; LANGUAGE;
D O I
10.3390/s22062245
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Due to the significant advancement of Natural Language Processing and Computer Vision-based models, Visual Question Answering (VQA) systems are becoming more intelligent and advanced. However, they are still error-prone when dealing with relatively complex questions. Therefore, it is important to understand the behaviour of the VQA models before adopting their results. In this paper, we introduce an interpretability approach for VQA models by generating counterfactual images. Specifically, the generated image is supposed to have the minimal possible change to the original image and leads the VQA model to give a different answer. In addition, our approach ensures that the generated image is realistic. Since quantitative metrics cannot be employed to evaluate the interpretability of the model, we carried out a user study to assess different aspects of our approach. In addition to interpreting the result of VQA models on single images, the obtained results and the discussion provides an extensive explanation of VQA models' behaviour.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Efficient Counterfactual Debiasing for Visual Question Answering
    Kolling, Camila
    More, Martin
    Gavenski, Nathan
    Pooch, Eduardo
    Parraga, Otavio
    Barros, Rodrigo C.
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2572 - 2581
  • [2] Counterfactual Mix-Up for Visual Question Answering
    Cho, Jae Won
    Kim, Dong-Jin
    Jung, Yunjae
    Kweon, In So
    [J]. IEEE ACCESS, 2023, 11 : 95201 - 95212
  • [3] Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
    Shin, Andrew
    Ushiku, Yoshitaka
    Harada, Tatsuya
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8925 - 8933
  • [4] Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering
    Liang, Zujie
    Jiang, Weitao
    Hu, Haifeng
    Zhu, Jiaying
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3285 - 3292
  • [5] Debiasing Medical Visual Question Answering via Counterfactual Training
    Zhan, Chenlu
    Peng, Peng
    Zhang, Hanrong
    Sun, Haiyue
    Shang, Chunnan
    Chen, Tao
    Wang, Hongsen
    Wang, Gaoang
    Wang, Hongwei
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 382 - 393
  • [6] Overcoming Language Priors with Counterfactual Inference for Visual Question Answering
    Ren, Zhibo
    Wang, Huizhen
    Zhu, Muhua
    Wang, Yichao
    Xiao, Tong
    Zhu, Jingbo
    [J]. CHINESE COMPUTATIONAL LINGUISTICS, CCL 2023, 2023, 14232 : 58 - 71
  • [7] Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
    Chen, Long
    Zheng, Yuhang
    Niu, Yulei
    Zhang, Hanwang
    Xiao, Jun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13218 - 13234
  • [8] Visual Question Generation as Dual Task of Visual Question Answering
    Li, Yikang
    Duan, Nan
    Zhou, Bolei
    Chu, Xiao
    Ouyang, Wanli
    Wang, Xiaogang
    Zhou, Ming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6116 - 6124
  • [9] Robust Visual Question Answering Based on Counterfactual Samples and Relationship Perception
    Qin, Hong
    An, Gaoyun
    Ruan, Qiuqi
    [J]. IMAGE AND GRAPHICS TECHNOLOGIES AND APPLICATIONS, IGTA 2021, 2021, 1480 : 145 - 158
  • [10] HCCL: Hierarchical Counterfactual Contrastive Learning for Robust Visual Question Answering
    Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of Automation, Chinese Academy of Sciences, China
    不详
    [J]. ACM Trans. Multimedia Comput. Commun. Appl, 2024, 10