Answer-Type Prediction for Visual Question Answering

被引:65
|
作者
Kafle, Kushal [1 ]
Kanan, Christopher [1 ]
机构
[1] Rochester Inst Technol, Chester F Carlson Ctr Imaging Sci, Rochester, NY 14623 USA
关键词
D O I
10.1109/CVPR.2016.538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, algorithms for object recognition and related tasks have become sufficiently proficient that new vision tasks can now be pursued. In this paper, we build a system capable of answering open-ended text-based questions about images, which is known as Visual Question Answering (VQA). Our approach's key insight is that we can predict the form of the answer from the question. We formulate our solution in a Bayesian framework. When our approach is combined with a discriminative model, the combined model achieves state-of-the-art results on four benchmark datasets for open-ended VQA: DAQUAR, COCO-QA, The VQA Dataset, and Visual7W.
引用
收藏
页码:4976 / 4984
页数:9
相关论文
共 50 条
  • [31] Answer Category-Aware Answer Selection for Question Answering
    Wu, Weijing
    Deng, Yang
    Liang, Yuzhi
    Lei, Kai
    IEEE ACCESS, 2021, 9 : 126357 - 126365
  • [32] Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
    Li, Qing
    Fu, Jianlong
    Yu, Dongfei
    Mei, Tao
    Luo, Jiebo
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1338 - 1346
  • [33] Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
    Agrawal, Aishwarya
    Batra, Dhruv
    Parikh, Devi
    Kembhavi, Aniruddha
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4971 - 4980
  • [34] Improving Answer Type Classification Quality Through Combined Question Answering Datasets
    Perevalov, Aleksandr
    Both, Andreas
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2021, PT II, 2021, 12816 : 191 - 204
  • [36] Question Condensing Networks for Answer Selection in Community Question Answering
    Wu, Wei
    Sun, Xu
    Wang, Houfeng
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1746 - 1755
  • [37] Efficient Question Answering with Question Decomposition and Multiple Answer Streams
    Hartrumpf, Sven
    Gloeckner, Ingo
    Leveling, Johannes
    EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 421 - +
  • [38] Word/Phrase based Answer Type Classification for Bengali Question Answering System
    Islam, Md. Aminul
    Kabir, Md. Fasihul
    Abdullah-Al-Mamun, Khandaker
    Huda, Mohammad Nurul
    2016 5TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION (ICIEV), 2016, : 445 - 448
  • [39] A proposal of Expected Answer Type and Named Entity annotation in a Question Answering context
    Boldrini, E.
    Ferrandez, S.
    Izquierdo, R.
    Ferrandez O., Tomas D.
    Vicedo, J. L.
    HSI: 2009 2ND CONFERENCE ON HUMAN SYSTEM INTERACTIONS, 2009, : 315 - 319
  • [40] Exploring Answer Information for Question Classification in Community Question Answering
    Wang, Jian
    Lin, Hongfei
    Dong, Hualei
    Xiong, Daping
    Yang, Zhihao
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2018, 31 (1-2) : 67 - 84