Explicit Bias Discovery in Visual Question Answering Models

被引:34
|
作者
Manjunatha, Varun [1 ]
Saini, Nirat [2 ]
Davis, Larry S. [2 ]
机构
[1] Adobe Res, San Jose 95110, Costa Rica
[2] Univ Maryland, College Pk, MD 20742 USA
关键词
D O I
10.1109/CVPR.2019.00979
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Researchers have observed that Visual Question Answering (VQA) models tend to answer questions by learning statistical biases in the data. For example, their answer to the question "What is the color of the grass?" is usually "Green", whereas a question like "What is the title of the book?" cannot be answered by inferring statistical biases. It is of interest to the community to explicitly discover such biases, both for understanding the behavior of such models, and towards debugging them. Our work address this problem. In a database, we store the words of the question, answer and visual words corresponding to regions of interest in attention maps. By running simple rule mining algorithms on this database, we discover human -interpretable rules which give us unique insight into the behavior of such models. Our results also show examples of unusual behaviors learned by models in attempting VQA tasks.
引用
收藏
页码:9554 / 9563
页数:10
相关论文
共 50 条
  • [21] Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering
    Aditya, Somak
    Yang, Yezhou
    Baral, Chitta
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 629 - 637
  • [22] Visual Question Generation as Dual Task of Visual Question Answering
    Li, Yikang
    Duan, Nan
    Zhou, Bolei
    Chu, Xiao
    Ouyang, Wanli
    Wang, Xiaogang
    Zhou, Ming
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6116 - 6124
  • [23] Explicit History Selection for Conversational Question Answering
    Zhang Zhiyuan
    Feng Qiaoqiao
    Wang Yujie
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1405 - 1410
  • [24] Answer Them All! Toward Universal Visual Question Answering Models
    Shrestha, Robik
    Kafle, Kushal
    Kanan, Christopher
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10464 - 10473
  • [25] A Critical Analysis of Benchmarks, Techniques, and Models in Medical Visual Question Answering
    Al-Hadhrami, Suheer
    Menai, Mohamed El Bachir
    Al-Ahmadi, Saad
    Alnafessah, Ahmed
    IEEE ACCESS, 2023, 11 : 136507 - 136540
  • [26] Question answering as a knowledge discovery technique
    Chali, Yllias
    WEBIST 2008: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2008, : 339 - 342
  • [27] Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering
    Vedantam, Ramakrishna
    Desai, Karan
    Lee, Stefan
    Rohrbach, Marcus
    Batra, Dhruv
    Parikh, Devi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [28] RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering
    Wang, Yuduo
    Ghamisi, Pedram
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [29] Sequential Visual Reasoning for Visual Question Answering
    Liu, Jinlai
    Wu, Chenfei
    Wang, Xiaojie
    Dong, Xuan
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 410 - 415
  • [30] Overcoming Language Priors via Shuffling Language Bias for Robust Visual Question Answering
    Zhao, J.
    Yu, Z.
    Zhang, X.
    Yang, Y.
    IEEE ACCESS, 2023, 11 : 85980 - 85989