Scene Attention Mechanism for Remote Sensing Image Caption Generation

被引:30
|
作者
Wu, Shiqi [1 ]
Zhang, Xiangrong [1 ]
Wang, Xin [1 ]
Li, Chen [2 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing image captioning; convolutional neural network; long short-term memory network; scene attention mechanism;
D O I
10.1109/ijcnn48605.2020.9207381
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Remote sensing images play an important role in various applications. To make it easier for humans to understand remote sensing images, the task of remote sensing image captioning attracts more and more researchers' attention. Inspired from the way human receives visual information, attention mechanism has been widely used in remote sensing image understanding. To catch more scene information and improve the stability of the generated sentences, a new attention mechanism called scene attention is proposed. Except for the current attention via the current hidden state of the long short-term memory network (LSTM), our proposed method simultaneously explores the global visual information from the mean feature of all convolutional features. The effectiveness of the proposed method is evaluated on UCM-captions, Sydney-captions and RSICD datasets. The results of our experiment show that comparing with some other captioning methods, our method is more stable and obtains a better performance.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] A novel approach for image retrieval in remote sensing using vision-language-based image caption generation
    Prem Shanker Yadav
    Dinesh Kumar Tyagi
    Santosh Kumar Vipparthi
    Multimedia Tools and Applications, 2025, 84 (6) : 2985 - 3014
  • [42] REL-SAGAN: Relative Generation Adversarial Network Integrated With Attention Mechanism for Scene Data Augmentation of Remote Sensing
    Cao, Yungang
    Sui, Baikai
    Zhang, Wei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 3107 - 3119
  • [43] REL-SAGAN: Relative Generation Adversarial Network Integrated With Attention Mechanism for Scene Data Augmentation of Remote Sensing
    Cao, Yungang
    Sui, Baikai
    Zhang, Wei
    IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15 : 3107 - 3119
  • [44] Adaptive scene-aware deep attention network for remote sensing image compression
    Zhai, Guowei
    Liu, Gang
    He, Xiaohai
    Wang, Zhengyong
    Ren, Chao
    Chen, Zhengxin
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (05)
  • [45] An Approach to Generate a Caption for an Image Collection Using Scene Graph Generation
    Phueaksri, Itthisak
    Kastner, Marc A.
    Kawanishi, Yasutomo
    Komamizu, Takahiro
    Ide, Ichiro
    IEEE ACCESS, 2023, 11 : 128245 - 128260
  • [46] Channel-Attention-Based DenseNet Network for Remote Sensing Image Scene Classification
    Tong, Wei
    Chen, Weitao
    Han, Wei
    Li, Xianju
    Wang, Lizhe
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 4121 - 4132
  • [47] Self-Attention Network With Joint Loss for Remote Sensing Image Scene Classification
    Wu, Honglin
    Zhao, Shuzhen
    Li, Liang
    Lu, Chaoquan
    Chen, Wen
    IEEE ACCESS, 2020, 8 : 210347 - 210359
  • [48] Attention-Aware Deep Feature Embedding for Remote Sensing Image Scene Classification
    Chen, Xiaoning
    Han, Zonghao
    Li, Yong
    Ma, Mingyang
    Mei, Shaohui
    Cheng, Wei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1171 - 1184
  • [49] Remote Sensing Image Scene Classification Based on Global Self-Attention Module
    Li, Qingwen
    Yan, Dongmei
    Wu, Wanrong
    REMOTE SENSING, 2021, 13 (22)
  • [50] Recurrent Attention LSTM Model for Image Chinese Caption Generation
    Zhang, Chaoying
    Dai, Yaping
    Cheng, Yanyan
    Jia, Zhiyang
    Hirota, Kaoru
    2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 808 - 813