An Answer FeedBack Network for Visual Question Answering

被引:0
|
作者
Tian, Weidong [1 ]
Tian, Ruihua [1 ]
Zhao, Zhongqiu [1 ]
Ren, Quan [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/IJCNN54540.2023.10191079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances have explored the power of transformer architecture in Visual Question Answering(VQA). However, most of the models suffer from misalignment of multimodal features, and they focus on unimportant image regions when answering the given questions. To address this, in this paper, we propose an Answer FeedBack Network (AFBN) to focus on image region features that are more beneficial for answering questions. The generate answers of the backbone network are again inputted into the network as feedback information. Then, we propose a FeedBack module (FB) to control the answer feedback. Additionally, we adopt the consistency loss function to reconstruct the image region features. By this function, the model can ensure the same of the image region features related to the question or answer. Extensive experiments on VQA-v2 benchmark dataset show that our method achieves better performance than the state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Answer-Based Entity Extraction and Alignment for Visual Text Question Answering
    Yu, Jun
    Jing, Mohan
    Liu, Weihao
    Luo, Tongxu
    Zhang, Bingyuan
    Lu, Keda
    Lei, Fangyu
    Sun, Jianqing
    Liang, Jiaen
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9487 - 9491
  • [32] Question recommendation and answer extraction in question answering community
    Xianfeng, Yang
    Pengfei, Liu
    [J]. International Journal of Database Theory and Application, 2016, 9 (01): : 35 - 44
  • [33] Question Modifiers in Visual Question Answering
    Britton, William
    Sarkhel, Somdeb
    Venugopal, Deepak
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
  • [34] Question-guided feature pyramid network for medical visual question answering
    Yu, Yonglin
    Li, Haifeng
    Shi, Hanrong
    Li, Lin
    Xiao, Jun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [35] Leveraging the network information for evaluating answer quality in a collaborative question answering portal
    Lin Chen
    Richi Nayak
    [J]. Social Network Analysis and Mining, 2012, 2 (3) : 197 - 215
  • [36] Social Question Answering: Textual, User, and Network Features for Best Answer Prediction
    Molino, Piero
    Aiello, Luca Maria
    Lops, Pasquale
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2016, 35 (01)
  • [37] Answer Category-Aware Answer Selection for Question Answering
    Wu, Weijing
    Deng, Yang
    Liang, Yuzhi
    Lei, Kai
    [J]. IEEE ACCESS, 2021, 9 : 126357 - 126365
  • [38] Deep Neural Network to Predict Answer Votes on Community Question Answering Sites
    Roy, Pradeep Kumar
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (02) : 1633 - 1646
  • [39] Leveraging the network information for evaluating answer quality in a collaborative question answering portal
    Chen, Lin
    Nayak, Richi
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2012, 2 (03) : 197 - 215
  • [40] Deep Neural Network to Predict Answer Votes on Community Question Answering Sites
    Pradeep Kumar Roy
    [J]. Neural Processing Letters, 2021, 53 : 1633 - 1646