Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction

被引:173
|
作者
Noh, Hyeonwoo [1 ]
Seo, Paul Hongsuck [1 ]
Han, Bohyung [1 ]
机构
[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
关键词
D O I
10.1109/CVPR.2016.11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle image question answering (ImageQA) problem by learning a convolutional neural network (CNN) with a dynamic parameter layer whose weights are determined adaptively based on questions. For the adaptive parameter prediction, we employ a separate parameter prediction network, which consists of gated recurrent unit (GRU) taking a question as its input and a fully-connected layer generating a set of candidate weights as its output. However, it is challenging to construct a parameter prediction network for a large number of parameters in the fully-connected dynamic parameter layer of the CNN. We reduce the complexity of this problem by incorporating a hashing technique, where the candidate weights given by the parameter prediction network are selected using a predefined hash function to determine individual weights in the dynamic parameter layer. The proposed network-joint network with the CNN for ImageQA and the parameter prediction network-is trained end-to-end through back-propagation, where its weights are initialized using a pre-trained CNN and GRU. The proposed algorithm illustrates the state-of-the-art performance on all available public ImageQA benchmarks.
引用
收藏
页码:30 / 38
页数:9
相关论文
共 50 条
  • [1] A Novel Approach on Visual Question Answering by Parameter Prediction using Faster Region Based Convolutional Neural Network
    Jha, Sudan
    Dey, Anirban
    Kumar, Raghvendra
    Kumar-Solanki, Vijender
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2019, 5 (05): : 30 - 37
  • [2] Applying a Convolutional Neural Network to Legal Question Answering
    Kim, Mi-Young
    Xu, Ying
    Goebel, Randy
    [J]. NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2017, 10091 : 282 - 294
  • [3] Visual Question Answering with Dynamic Parameter Prediction using Functional Hashing
    Li, Chunye
    Zhou, Zhiping
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, : 330 - 335
  • [4] Predicting closed questions on community question answering sites using convolutional neural network
    Roy, Pradeep Kumar
    Singh, Jyoti Prakash
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14): : 10555 - 10572
  • [5] Predicting closed questions on community question answering sites using convolutional neural network
    Pradeep Kumar Roy
    Jyoti Prakash Singh
    [J]. Neural Computing and Applications, 2020, 32 : 10555 - 10572
  • [6] A Shallow Convolutional Neural Network Architecture for Open Domain Question Answering
    Rosso-Mateus, Andres
    Gonzalez, Fabio A.
    Montes-y-Gomez, Manuel
    [J]. ADVANCES IN COMPUTING, CCC 2017, 2017, 735 : 485 - 494
  • [7] Recurrent convolutional neural network for answer selection in community question answering
    Zhou, Xiaoqiang
    Hu, Baotian
    Chen, Qingcai
    Wang, Xiaolong
    [J]. NEUROCOMPUTING, 2018, 274 : 8 - 18
  • [8] A Novel Hierarchical Convolutional Neural Network for Question Answering over Paragraphs
    Zheng, Suncong
    Bao, Hongyun
    Zhao, Jun
    Zhang, Jie
    Qi, Zhenyu
    Hao, Hongwei
    [J]. 2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 1, 2015, : 60 - 66
  • [9] Dynamic graph convolutional neural network for image sentiment distribution prediction
    Su, Yu-Ting
    Wang, Ji
    Zhao, Wei
    Jing, Pei-Guang
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (09): : 2601 - 2610
  • [10] Static Correlative Filter based Convolutional Neural Network for Visual Question Answering
    Chen, Lijun
    Li, Qinyu
    Wang, Hanli
    Long, Yu
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 526 - 529