Scene Attention Mechanism for Remote Sensing Image Caption Generation

被引:23
|
作者
Wu, Shiqi [1 ]
Zhang, Xiangrong [1 ]
Wang, Xin [1 ]
Li, Chen [2 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing image captioning; convolutional neural network; long short-term memory network; scene attention mechanism;
D O I
10.1109/ijcnn48605.2020.9207381
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remote sensing images play an important role in various applications. To make it easier for humans to understand remote sensing images, the task of remote sensing image captioning attracts more and more researchers' attention. Inspired from the way human receives visual information, attention mechanism has been widely used in remote sensing image understanding. To catch more scene information and improve the stability of the generated sentences, a new attention mechanism called scene attention is proposed. Except for the current attention via the current hidden state of the long short-term memory network (LSTM), our proposed method simultaneously explores the global visual information from the mean feature of all convolutional features. The effectiveness of the proposed method is evaluated on UCM-captions, Sydney-captions and RSICD datasets. The results of our experiment show that comparing with some other captioning methods, our method is more stable and obtains a better performance.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Image caption generation with dual attention mechanism
    Liu, Maofu
    Li, Lingjun
    Hu, Huijun
    Guan, Weili
    Tian, Jing
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
  • [2] Image caption generation using a dual attention mechanism
    Padate, Roshni
    Jain, Amit
    Kalla, Mukesh
    Sharma, Arvind
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [3] Improved Attention Mechanism and Residual Network for Remote Sensing Image Scene Classification
    Kong, Jiayuan
    Gao, Yurong
    Zhang, Yanjun
    Lei, Huimin
    Wang, Yao
    Zhang, Hesheng
    IEEE ACCESS, 2021, 9 : 134800 - 134808
  • [4] Exploring Models and Data for Remote Sensing Image Caption Generation
    Lu, Xiaoqiang
    Wang, Binqiang
    Zheng, Xiangtao
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (04): : 2183 - 2195
  • [5] Remote Sensing Image Caption Method Based on Attention and Reinforcement Learning
    Nong Yuanjun
    Wang Junjie
    ACTA OPTICA SINICA, 2021, 41 (22)
  • [6] Remote Sensing Image Caption Method Based on Attention and Reinforcement Learning
    Nong Y.
    Wang J.
    Guangxue Xuebao/Acta Optica Sinica, 2021, 41 (22):
  • [7] Assamese news image caption generation using attention mechanism
    Das, Ringki
    Singh, Thoudam Doren
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (07) : 10051 - 10069
  • [8] Image caption generation method based on adaptive attention mechanism
    Jin, Huazhong
    Wu, Yu
    Wan, Fang
    Hu, Man
    Li, Qingqing
    MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [9] Assamese news image caption generation using attention mechanism
    Ringki Das
    Thoudam Doren Singh
    Multimedia Tools and Applications, 2022, 81 : 10051 - 10069
  • [10] Remote sensing image caption generation via transformer and reinforcement learning
    Shen, Xiangqing
    Liu, Bing
    Zhou, Yong
    Zhao, Jiaqi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 26661 - 26682