Computational construction grammar for visual question answering

被引:5
|
作者
Nevens, Jens [1 ]
Van Eecke, Paul [1 ]
Beuls, Katrien [1 ]
机构
[1] Vrije Univ Brussel, Artificial Intelligence Lab, Pl Laan 2, B-1050 Brussels, Belgium
来源
LINGUISTICS VANGUARD | 2019年 / 5卷 / 01期
关键词
Computational Construction Grammar; Fluid Construction Grammar; Natural Language Understanding; Procedural Semantics; Visual Question Answering;
D O I
10.1515/lingvan-2018-0070
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In order to be able to answer a natural language question, a computational system needs three main capabilities. First, the system needs to be able to analyze the question into a structured query, revealing its component parts and how these are combined. Second, it needs to have access to relevant knowledge sources, such as databases, texts or images. Third, it needs to be able to execute the query on these knowledge sources. This paper focuses on the first capability, presenting a novel approach to semantically parsing questions expressed in natural language. The method makes use of a computational construction grammar model for mapping questions onto their executable semantic representations. We demonstrate and evaluate the methodology on the CLEVR visual question answering benchmark task. Our system achieves a 100% accuracy, effectively solving the language understanding part of the benchmark task. Additionally, we demonstrate how this solution can be embedded in a full visual question answering system, in which a question is answered by executing its semantic representation on an image. The main advantages of the approach include (i) its transparent and interpretable properties, (ii) its extensibility, and (iii) the fact that the method does not rely on any annotated training data.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Question Modifiers in Visual Question Answering
    Britton, William
    Sarkhel, Somdeb
    Venugopal, Deepak
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
  • [2] Safety compliance checking of construction behaviors using visual question answering
    Ding, Yuexiong
    Liu, Muyang
    Luo, Xiaowei
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 144
  • [3] A GRAMMAR BASE QUESTION-ANSWERING PROCEDURE
    ROSENBAUM, PS
    [J]. COMMUNICATIONS OF THE ACM, 1967, 10 (10) : 630 - +
  • [4] VQA: Visual Question Answering
    Antol, Stanislaw
    Agrawal, Aishwarya
    Lu, Jiasen
    Mitchell, Margaret
    Batra, Dhruv
    Zitnick, C. Lawrence
    Parikh, Devi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
  • [5] VQA: Visual Question Answering
    Agrawal, Aishwarya
    Lu, Jiasen
    Antol, Stanislaw
    Mitchell, Margaret
    Zitnick, C. Lawrence
    Parikh, Devi
    Batra, Dhruv
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (01) : 4 - 31
  • [6] Visual Question Answering A tutorial
    Teney, Damien
    Wu, Qi
    van den Hengel, Anton
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 63 - 75
  • [7] Visual Question Generation as Dual Task of Visual Question Answering
    Li, Yikang
    Duan, Nan
    Zhou, Bolei
    Chu, Xiao
    Ouyang, Wanli
    Wang, Xiaogang
    Zhou, Ming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6116 - 6124
  • [8] Sequential Visual Reasoning for Visual Question Answering
    Liu, Jinlai
    Wu, Chenfei
    Wang, Xiaojie
    Dong, Xuan
    [J]. PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 410 - 415
  • [9] Constrained Semantic Grammar Enabled Question Answering System
    Wang, Dongsheng
    Wang, Shi
    Wang, Weiming
    Fu, Jianhui
    Dai, Yun
    [J]. CHALLENGES AND OPPORTUNITY WITH BIG DATA, 2017, 10228 : 55 - 65
  • [10] A constraint grammar based question answering system for Portuguese
    Bick, E
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE-B, 2003, 2902 : 414 - 418