Scene Attention Mechanism for Remote Sensing Image Caption Generation

被引:30
|
作者
Wu, Shiqi [1 ]
Zhang, Xiangrong [1 ]
Wang, Xin [1 ]
Li, Chen [2 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing image captioning; convolutional neural network; long short-term memory network; scene attention mechanism;
D O I
10.1109/ijcnn48605.2020.9207381
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remote sensing images play an important role in various applications. To make it easier for humans to understand remote sensing images, the task of remote sensing image captioning attracts more and more researchers' attention. Inspired from the way human receives visual information, attention mechanism has been widely used in remote sensing image understanding. To catch more scene information and improve the stability of the generated sentences, a new attention mechanism called scene attention is proposed. Except for the current attention via the current hidden state of the long short-term memory network (LSTM), our proposed method simultaneously explores the global visual information from the mean feature of all convolutional features. The effectiveness of the proposed method is evaluated on UCM-captions, Sydney-captions and RSICD datasets. The results of our experiment show that comparing with some other captioning methods, our method is more stable and obtains a better performance.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Topic Scene Graph Generation by Attention Distillation from Caption
    Wang, Wenbin
    Wang, Ruiping
    Chen, Xilin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15880 - 15890
  • [22] Remote sensing scene image classification model based on multi-scale features and attention mechanism
    Wang, Guowei
    Xu, Haixia
    Wang, Xinyu
    Yuan, Liming
    Wen, Xianbin
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [23] A Discriminative Feature Representation Method Based on Dual Attention Mechanism for Remote Sensing Image Scene Classification
    Xu Congan
    Lu Yafei
    Zhang Xiaohan
    Liu Yu
    Cui Chenhao
    Gu Xiangqi
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (03) : 683 - 691
  • [24] Ensemble model with cascade attention mechanism for high-resolution remote sensing image scene classification
    Li, Fengpeng
    Feng, Ruyi
    Han, Wei
    Wang, Lizhe
    OPTICS EXPRESS, 2020, 28 (15) : 22358 - 22387
  • [25] A image caption method of construction scene based on attention mechanism and encoding-decoding architecture
    Nong Y.-J.
    Wang J.-J.
    Chen H.
    Sun W.-H.
    Geng H.
    Li S.-Y.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (02): : 236 - 244
  • [26] Few-Shot Scene Classification with Attention Mechanism in Remote Sensing
    Zhang, Duona
    Zhao, Hongjia
    Lu, Yuanyao
    Cui, Jian
    Zhang, Baochang
    Computer Engineering and Applications, 2024, 60 (04) : 173 - 182
  • [27] Rethinking Image Generation From Scene Graphs With Attention Mechanism
    Amuche, Chikwendu Ijeoma
    Zhang, Xiaoling
    Ukwuoma, Chiagoziem Chima
    Adjei-Mensah, Isaac
    Abdou, Assila Abdallah
    Onyedikachi, Chikwendu Chinyere
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATION SYSTEMS, CCCIS 2024, 2024, : 80 - 84
  • [28] CAPFORMER: PURE TRANSFORMER FOR REMOTE SENSING IMAGE CAPTION
    Wang, Junjue
    Chen, Zihang
    Ma, Ailong
    Zhong, Yanfei
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 7996 - 7999
  • [29] Remote Sensing Image Generation Based on Attention Mechanism and VAE-MSGAN for ROI Extraction
    Zhang, Libao
    Liu, Yanan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [30] Bahdanau Attention Based Bengali Image Caption Generation
    Alam, Md Sahrial
    Rahman, Md Sayedur
    Hosen, Md Ikbal
    Mubin, Khairul Anam
    Hossen, Sharif
    Mridha, M. F.
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 1073 - 1077