Scene Attention Mechanism for Remote Sensing Image Caption Generation

被引:30
|
作者
Wu, Shiqi [1 ]
Zhang, Xiangrong [1 ]
Wang, Xin [1 ]
Li, Chen [2 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing image captioning; convolutional neural network; long short-term memory network; scene attention mechanism;
D O I
10.1109/ijcnn48605.2020.9207381
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remote sensing images play an important role in various applications. To make it easier for humans to understand remote sensing images, the task of remote sensing image captioning attracts more and more researchers' attention. Inspired from the way human receives visual information, attention mechanism has been widely used in remote sensing image understanding. To catch more scene information and improve the stability of the generated sentences, a new attention mechanism called scene attention is proposed. Except for the current attention via the current hidden state of the long short-term memory network (LSTM), our proposed method simultaneously explores the global visual information from the mean feature of all convolutional features. The effectiveness of the proposed method is evaluated on UCM-captions, Sydney-captions and RSICD datasets. The results of our experiment show that comparing with some other captioning methods, our method is more stable and obtains a better performance.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] A Multi-Branch Feature Fusion Strategy Based on an Attention Mechanism for Remote Sensing Image Scene Classification
    Shi, Cuiping
    Zhao, Xin
    Wang, Liguo
    REMOTE SENSING, 2021, 13 (10)
  • [32] Fine-grained attention for image caption generation
    Chang, Yan-Shuo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 2959 - 2971
  • [33] Fine-grained attention for image caption generation
    Yan-Shuo Chang
    Multimedia Tools and Applications, 2018, 77 : 2959 - 2971
  • [34] Combining Multilevel Features for Remote Sensing Image Scene Classification With Attention Model
    Ji, Jinsheng
    Zhang, Tao
    Jiang, Linfeng
    Zhong, Weilin
    Xiong, Huilin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) : 1647 - 1651
  • [35] Remote Sensing Image Scene Classification Based on Multidimensional Attention and Feature Enhancement
    Liu, Chengrui
    Dai, Hong
    Wang, Shuang
    Chen, Junhong
    IAENG International Journal of Computer Science, 2023, 50 (04)
  • [36] Remote Sensing Image Segmentation Model Based on Attention Mechanism
    Hang, Liu
    Wang Xili
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (04)
  • [37] Remote Sensing Image Retrieval Based on Regional Attention Mechanism
    Peng Yanfei
    Mei Jinye
    Wang Kaixin
    Zi Lingling
    Sang Yu
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (10)
  • [38] Research for image caption based on global attention mechanism
    Tong, Wu
    Tao, Ku
    Hao, Zhang
    SECOND TARGET RECOGNITION AND ARTIFICIAL INTELLIGENCE SUMMIT FORUM, 2020, 11427
  • [39] Improved method for image caption with global attention mechanism
    Ma S.
    Zhang G.
    Jiao Y.
    Shi G.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (02): : 17 - 22
  • [40] Lie Group spatial attention mechanism model for remote sensing scene classification
    Xu, Chengjun
    Zhu, Guobin
    Shu, Jingqian
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (07) : 2461 - 2474