BENet: bi-directional enhanced network for image captioning

被引:0
|
作者
Peixin Yan
Zuoyong Li
Rong Hu
Xinrong Cao
机构
[1] Fujian University of Technology,Fujian Provincial Key Laboratory of Big Data Mining and Applications, School of Computer Science and Mathematics
[2] Minjiang University,Fujian Provincial Key Laboratory of Information Processing and Intelligent Control, College of Computer and Control Engineering
来源
Multimedia Systems | 2024年 / 30卷
关键词
Image captioning; Transformer; Bi-directional enhanced network; Memory bank; Reconstruct;
D O I
暂无
中图分类号
学科分类号
摘要
Transformer-based models have been used in image captioning to generate a natural language text for describing a given image accurately. In this paper, we propose a bi-directional enhanced network, which strengthens the correlation between image features and text features by the memory bank to improve the performance of the transformer-based encoder–decoder framework for image captioning. In addition, we fine-tune the connection method in the encoder to obtain rich image features. Specifically, during training, the memory bank is first used to store the correspondences between images and annotated texts in the dataset as additional information of image features. After processing through the encoder, we feed the visual features composed of image features and the additional information in the memory bank into the decoder to generate better caption. Subsequently, we utilize a decoder-like architecture to reconstruct visual features from the generated caption. Finally, we calculate the similarity loss between the reconstructed features and the visual features to optimize the encoder. Extensive experiments on the MSCOCO benchmark demonstrate that the proposed method has shown promising results on both the Karpathy test split and the online test server, providing evidence of its effectiveness.
引用
收藏
相关论文
共 50 条
  • [41] A novel WDM passive optical network with bi-directional protection
    Chan, TJ
    Chan, CK
    Chan, K
    Hung, W
    Chen, LK
    APOC 2002: ASIA-PACIFIC OPTICAL AND WIRELESS COMMUNICATIONS; NETWORK DESIGN AND MANAGEMENT, 2002, 4909 : 167 - 173
  • [42] Bi-directional complementary cascade lightweight network for edge detection
    Peng, Jiansheng
    Luo, Zhengqiao
    Lin, Chuan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, : 8965 - 8974
  • [43] Bi-directional astrocytic regulation of neuronal activity within a network
    Gordleeva, S. Yu
    Stasenko, S. V.
    Semyanov, A. V.
    Dityatev, A. E.
    Kazantsev, V. B.
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2012, 6
  • [44] Bi-directional Features Reuse Network for Salient Object Detection
    Jia, Fengwei
    Wang, Xuan
    Guan, Jian
    Qi, Shuhan
    Liao, Qing
    Li, Huale
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 29 - 41
  • [45] Conflict-free AGV routing in bi-directional network
    Maza, S
    Castagna, P
    ETFA 2001: 8TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, VOL 2, PROCEEDINGS, 2001, : 761 - 764
  • [46] A Deep Bi-directional Attention Network for Human Motion Recovery
    Cui, Qiongjie
    Sun, Huaijiang
    Li, Yupeng
    Kong, Yue
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 701 - 707
  • [47] BI-DIRECTIONAL IMMUNE-RESPONSES WITHIN AN IDIOTYPE NETWORK
    HORNG, WJ
    KAZDIN, DS
    ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 1983, 418 (DEC) : 317 - 323
  • [48] Deep Bi-Directional LSTM Network for Query Intent Detection
    Sreelakshmi, K.
    Rafeeque, P. C.
    Sreetha, S.
    Gayathri, E. S.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 939 - 946
  • [49] Bi-directional information guidance network for UAV vehicle detection
    Yang, Jianxiu
    Xie, Xuemei
    Wang, Zhenyuan
    Zhang, Peng
    Zhong, Wei
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 5301 - 5316
  • [50] BiDFDC-Net: a dense connection network based on bi-directional feedback for skin image segmentation
    Jiang, Jinyun
    Sun, Zitong
    Zhang, Qile
    Lan, Kun
    Jiang, Xiaoliang
    Wu, Jun
    FRONTIERS IN PHYSIOLOGY, 2023, 14