Answer-Type Prediction for Visual Question Answering

被引:65
|
作者
Kafle, Kushal [1 ]
Kanan, Christopher [1 ]
机构
[1] Rochester Inst Technol, Chester F Carlson Ctr Imaging Sci, Rochester, NY 14623 USA
关键词
D O I
10.1109/CVPR.2016.538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, algorithms for object recognition and related tasks have become sufficiently proficient that new vision tasks can now be pursued. In this paper, we build a system capable of answering open-ended text-based questions about images, which is known as Visual Question Answering (VQA). Our approach's key insight is that we can predict the form of the answer from the question. We formulate our solution in a Bayesian framework. When our approach is combined with a discriminative model, the combined model achieves state-of-the-art results on four benchmark datasets for open-ended VQA: DAQUAR, COCO-QA, The VQA Dataset, and Visual7W.
引用
收藏
页码:4976 / 4984
页数:9
相关论文
共 50 条
  • [1] Extreme Classification for Answer Type Prediction in Question Answering
    Setty, Vinay
    2023 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, JCDL, 2023, : 232 - 236
  • [2] Question-aware prediction with candidate answer recommendation for visual question answering
    Kim, B.
    Kim, J.
    ELECTRONICS LETTERS, 2017, 53 (18) : 1244 - 1245
  • [3] Answer Distillation for Visual Question Answering
    Fang, Zhiwei
    Liu, Jing
    Tang, Qu
    Li, Yong
    Lu, Hanqing
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 72 - 87
  • [4] Learning Answer Embeddings for Visual Question Answering
    Hu, Hexiang
    Chao, Wei-Lun
    Sha, Fei
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5428 - 5436
  • [5] An Answer FeedBack Network for Visual Question Answering
    Tian, Weidong
    Tian, Ruihua
    Zhao, Zhongqiu
    Ren, Quan
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Verification of the Expected Answer Type for Biomedical Question Answering
    Kamath, Sanjay
    Grau, Brigitte
    Ma, Yue
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1093 - 1097
  • [7] RANKVQA: ANSWER RE-RANKING FOR VISUAL QUESTION ANSWERING
    Qiao, Yanyuan
    Yu, Zheng
    Liu, Jing
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [8] Descriptive Question Answering with Answer Type Independent Features
    Yoon, Yeo-Chan
    Lee, Chang-Ki
    Kim, Hyun-Ki
    Jang, Myung-Gil
    Ryu, Pum Mo
    Park, So-Young
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (07) : 2009 - 2012
  • [9] Answer type validation in a question-answering system
    Grappy, Arnaud
    Grau, Brigitte
    CORIA 2010: Actes de la COnference en Recherche d'Information et Applications - Proceedings of the Conference on Information Retrieval and Applications, 2010, : 131 - 146
  • [10] Question Type Guided Attention in Visual Question Answering
    Shi, Yang
    Furlanello, Tommaso
    Zha, Sheng
    Anandkumar, Animashree
    COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 158 - 175