Answering knowledge-based visual questions via the exploration of Question Purpose

被引:7
|
作者
Song, Lingyun [1 ,2 ]
Li, Jianao [1 ,2 ]
Liu, Jun [3 ,4 ]
Yang, Yang [5 ]
Shang, Xuequn [1 ,2 ]
Sun, Mingxuan [6 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Big Data Storage & Management, Minist Ind & Informat Technol, Xian 710129, Peoples R China
[3] SPKLSTN Lab, Dept Comp Sci & Technol, Xian 710049, Peoples R China
[4] Jiaotong Univ, Xian 710049, Peoples R China
[5] Univ Elect Sci & Technol, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[6] Louisiana State Univ, Sch Elect Engn & Comp Sci, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA
关键词
Visual question answering; DNN; Question Purpose;
D O I
10.1016/j.patcog.2022.109015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual question answering has been greatly advanced by deep learning technologies, but still remains an open problem subjected to two aspects of factors. First, previous works estimate the correctness of each candidate answer mainly by its semantic correlations with visual questions, overlooking the fact that some questions and their answers are semantically inconsistent. Second, previous works that require external knowledge mainly uses the knowledge facts retrieved by key words or visual objects. However, the retrieved knowledge facts may only be related to the semantics of the question, but are useless or even misleading for answer prediction. To address these issues, we investigate how to capture the pur-pose of visual questions and propose a Purpose Guided Visual Question Answering model, called PGVQA. It mainly has two appealing properties: (1) It can estimate the correctness of candidate answers based on the Question Purpose (QP) that reveals which aspects of the concept are examined by visual questions. This is helpful for avoiding the negative effect of the semantic inconsistency between answers and ques-tions. (2) It can incorporate the knowledge facts accordant with the QP into answer prediction, which helps to improve the probability of answering visual questions correctly. Empirical studies on benchmark datasets show that PGVQA achieves state-of-the-art performance.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Towards Knowledge-Based Tourism Chinese Question Answering System
    Li, Jiahui
    Luo, Zhiyi
    Huang, Hongyun
    Ding, Zuohua
    [J]. MATHEMATICS, 2022, 10 (04)
  • [42] Explainable Knowledge-Based Learning for Online Medical Question Answering
    Cui, Menglin
    Li, Xiang
    Qin, Peng
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT V, KSEM 2024, 2024, 14888 : 294 - 304
  • [43] Knowledge Acquisition for Visual Question Answering via Iterative Querying
    Zhu, Yuke
    Lim, Joseph J.
    Li Fei-Fei
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6146 - 6155
  • [44] Knowledge-based question answering using the semantic embedding space
    Yang, Min-Chul
    Lee, Do-Gil
    Park, So-Young
    Rim, Hae-Chang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) : 9086 - 9104
  • [45] Knowledge-based question answering by tree-to-sequence learning
    Zhu, Shuguang
    Cheng, Xiang
    Su, Sen
    [J]. NEUROCOMPUTING, 2020, 372 : 64 - 72
  • [46] A Relateness-Based Ranking Method for Knowledge-Based Question Answering
    Ni, Han
    Lin, Liansheng
    Xu, Ge
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 393 - 400
  • [47] Video Question Answering via Knowledge-based Progressive Spatial-Temporal Attention Network
    Jin, Weike
    Zhao, Zhou
    Li, Yimeng
    Li, Jie
    Xiao, Jun
    Zhuang, Yueting
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
  • [48] Question answering with a conceptual framework for knowledge-based system development "Node of Knowledge"
    Pavlic, Mile
    Han, Zdravko Dovedan
    Jakupovic, Alen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (12) : 5264 - 5286
  • [49] Localized Questions in Medical Visual Question Answering
    Tascon-Morales, Sergio
    Marquez-Neila, Pablo
    Sznitman, Raphael
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 361 - 370
  • [50] A knowledge-based question answering system to provide cognitive assistance to radiologists
    Pillai, Anup
    Katouzian, Amin
    Kanjaria, Karina
    Shivade, Chaitanya
    Jadhav, Ashutosh
    Bendersky, Marina
    Mukherjee, Vandana
    Syeda-Mahmood, Tanveer
    [J]. MEDICAL IMAGING 2019: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2019, 10954