TSFE: Two-Stage Feature Enhancement for Remote Sensing Image Captioning

被引:3
|
作者
Guo, Jie [1 ]
Li, Ze [1 ]
Song, Bin [1 ]
Chi, Yuhao [1 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
关键词
attention mechanism; fine-grained feature; two-stage enhancement; remote sensing image captioning; feature interaction decoder; FUSION;
D O I
10.3390/rs16111843
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the field of remote sensing image captioning (RSIC), mainstream methods typically adopt an encoder-decoder framework. Methods based on this framework often use only simple feature fusion strategies, failing to fully mine the fine-grained features of the remote sensing image. Moreover, the lack of context information introduction in the decoder results in less accurate generated sentences. To address these problems, we propose a two-stage feature enhancement model (TSFE) for remote sensing image captioning. In the first stage, we adopt an adaptive feature fusion strategy to acquire multi-scale features. In the second stage, we further mine fine-grained features based on multi-scale features by establishing associations between different regions of the image. In addition, we introduce global features with scene information in the decoder to help generate descriptions. Experimental results on the RSICD, UCM-Captions, and Sydney-Captions datasets demonstrate that the proposed method outperforms existing state-of-the-art approaches.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Structural Representative Network for Remote Sensing Image Captioning
    Sharma, Jaya
    Divya, Peketi
    Sravani, Yenduri
    Shekar, B. H.
    Mohan, Krishna C.
    FIFTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2022, 2023, 12701
  • [32] Cooperative Connection Transformer for Remote Sensing Image Captioning
    Zhao, Kai
    Xiong, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [33] DFEN: Dual Feature Enhancement Network for Remote Sensing Image Caption
    Zhao, Weihua
    Yang, Wenzhong
    Chen, Danny
    Wei, Fuyuan
    ELECTRONICS, 2023, 12 (07)
  • [34] Remote sensing image fusion based on enhancement of edge feature information
    1600, International Frequency Sensor Association (167):
  • [35] GLCM: Global-Local Captioning Model for Remote Sensing Image Captioning
    Wang, Qi
    Huang, Wei
    Zhang, Xueting
    Li, Xuelong
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (11) : 6910 - 6922
  • [36] A two-stage feature extraction for hyperspectral image data classification
    Chen, GS
    Ko, LW
    Kuo, BC
    Shih, SC
    IGARSS 2004: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM PROCEEDINGS, VOLS 1-7: SCIENCE FOR SOCIETY: EXPLORING AND MANAGING A CHANGING PLANET, 2004, : 1212 - 1215
  • [37] A Two-Stage Unsupervised Approach for Low Light Image Enhancement
    Hu, Junjie
    Guo, Xiyue
    Chen, Junfeng
    Liang, Guanqi
    Deng, Fuqin
    Lam, Tin Lun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 8363 - 8370
  • [38] Mammogram image enhancement by two-stage adaptive histogram equalization
    Anand, S.
    Gayathri, S.
    OPTIK, 2015, 126 (21): : 3150 - 3152
  • [39] Traffic Scene Captioning with Multi-Stage Feature Enhancement
    Zhang, Dehai
    Ma, Yu
    Liu, Qing
    Wang, Haoxing
    Ren, Anquan
    Liang, Jiashu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (03): : 2901 - 2920
  • [40] A Two-Stage Pansharpening Method for the Fusion of Remote-Sensing Images
    Wang, Yazhen
    Liu, Guojun
    Zhang, Rui
    Liu, Junmin
    REMOTE SENSING, 2022, 14 (05)