Global-local feature attention network with reranking strategy for image caption generation

被引:1
|
作者
吴捷 [1 ]
谢斯雅 [1 ]
史新宝 [1 ]
陈耀文 [2 ]
机构
[1] College of Engineering, Shantou University
[2] Key Laboratory of Digital Signal and Image Processing of Guangdong, Shantou University
关键词
RS;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
In this paper, a novel framework, named as global-local feature attention network with reranking strategy(GLAN-RS), is presented for image captioning task. Rather than only adopting unitary visual information in the classical models, GLAN-RS explores the attention mechanism to capture local convolutional salient image maps. Furthermore, we adopt reranking strategy to adjust the priority of the candidate captions and select the best one. The proposed model is verified using the Microsoft Common Objects in Context(MSCOCO) benchmark dataset across seven standard evaluation metrics. Experimental results show that GLAN-RS significantly outperforms the state-of-the-art approaches, such as multimodal recurrent neural network(MRNN) and Google NIC, which gets an improvement of 20% in terms of BLEU4 score and 13 points in terms of CIDER score.
引用
收藏
页码:448 / 451
页数:4
相关论文
共 50 条
  • [21] Image Caption Generation Using Attention Model
    Ramalakshmi, Eliganti
    Jain, Moksh Sailesh
    Uddin, Mohammed Ameer
    [J]. INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 1009 - 1017
  • [22] Global-local attention for emotion recognition
    Le, Nhat
    Nguyen, Khanh
    Nguyen, Anh
    Le, Bac
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21625 - 21639
  • [23] Hyperspectral Image Classification Based on Adaptive Global-Local Feature Fusion
    Yang, Chunlan
    Kong, Yi
    Wang, Xuesong
    Cheng, Yuhu
    [J]. REMOTE SENSING, 2024, 16 (11)
  • [24] SELF ADAPTIVE GLOBAL-LOCAL FEATURE ENHANCEMENT FOR RADIOLOGY REPORT GENERATION
    Wang, Yuhao
    Wang, Kai
    Liu, Xiaohong
    Gao, Tianrun
    Zhang, Jingyue
    Wang, Guangyu
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2275 - 2279
  • [25] Global-local attention for emotion recognition
    Nhat Le
    Khanh Nguyen
    Anh Nguyen
    Bac Le
    [J]. Neural Computing and Applications, 2022, 34 : 21625 - 21639
  • [26] Global-local graph attention: unifying global and local attention for node classification
    Lin, Keao
    Xie, Xiaozhu
    Weng, Wei
    Du, Xiaofeng
    [J]. COMPUTER JOURNAL, 2024, : 2959 - 2969
  • [27] GLOBAL-LOCAL AWARENESS NETWORK FOR IMAGE SUPER-RESOLUTION
    Pan, Pin-Chi
    Hsu, Tzu-Hao
    Wei, Wen-Li
    Lin, Jen-Chun
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1150 - 1154
  • [28] Dehazeformer: Nonhomogeneous Image Dehazing With Collaborative Global-local Network
    Luo, Xiao-Tong
    Yang, Wen-Jin
    Qu, Yan-Yun
    Xie, Yuan
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (07): : 1333 - 1344
  • [29] Dual Attention-Based Global-Local Feature Extraction Network for Unsupervised Change Detection in PolSAR Images
    Xu, Dazhi
    Li, Ming
    Wu, Yan
    Zhang, Peng
    Xin, Xinyue
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 10842 - 10861
  • [30] Dynamic Global-Local Attention Network Based On Capsules for Text Classification
    Wang, Ji
    Chen, Qiaohong
    Pei, Haolei
    Sun, Qi
    Jia, Yubo
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,