PROGRESSIVE SCALE-AWARE NETWORK FOR REMOTE SENSING IMAGE CHANGE CAPTIONING

被引:5
|
作者
Liu, Chenyang [1 ,3 ]
Yang, Jiajun [1 ,3 ]
Qi, Zipeng [1 ,3 ]
Zou, Zhengxia [2 ,3 ]
Shi, Zhenwei [1 ,3 ]
机构
[1] Beihang Univ, Image Proc Ctr, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ, Dept Guidance Nav & Control, Sch Astronaut, Beijing 100191, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Remote sensing image; change captioning; Transformer; scale-aware reinforcement;
D O I
10.1109/IGARSS52108.2023.10283451
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Remote sensing (RS) images contain numerous objects of different scales, which poses significant challenges for the RS image change captioning (RSICC) task to identify visual changes of interest in complex scenes and describe them via language. However, current methods still have some weaknesses in sufficiently extracting and utilizing multi-scale information. In this paper, we propose a progressive scale-aware network (PSNet) to address the problem. PSNet is a pure Transformer-based model. To sufficiently extract multi-scale visual features, multiple progressive difference perception (PDP) layers are stacked to progressively exploit the differencing features of bitemporal features. To sufficiently utilize the extracted multi-scale features for captioning, we propose a scale-aware reinforcement (SR) module and combine it with the Transformer decoding layer to progressively utilize the features from different PDP layers. Experiments show that the PDP layer and SR module are effective and our PSNet outperforms previous methods.
引用
收藏
页码:6668 / 6671
页数:4
相关论文
共 50 条
  • [21] PSTNet: Progressive Sampling Transformer Network for Remote Sensing Image Change Detection
    Song, Xinyang
    Hua, Zhen
    Li, Jinjiang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8442 - 8455
  • [22] Learning consensus-aware semantic knowledge for remote sensing image captioning
    Li, Yunpeng
    Zhang, Xiangrong
    Cheng, Xina
    Tang, Xu
    Jiao, Licheng
    [J]. PATTERN RECOGNITION, 2024, 145
  • [23] Scale-Aware Distillation Network for Lightweight Image Super-Resolution
    Lu, Haowei
    Lu, Yao
    Li, Gongping
    Sun, Yanbei
    Wang, Shunzhou
    Li, Yugang
    [J]. PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 128 - 139
  • [24] RSCaMa: Remote Sensing Image Change Captioning With State Space Model
    Liu, Chenyang
    Chen, Keyan
    Chen, Bowen
    Zhang, Haotian
    Zou, Zhengxia
    Shi, Zhenwei
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [25] A Decoupling Paradigm With Prompt Learning for Remote Sensing Image Change Captioning
    Liu, Chenyang
    Zhao, Rui
    Chen, Jianqi
    Qi, Zipeng
    Zou, Zhengxia
    Shi, Zhenwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [26] MULTI-SCALE CROPPING MECHANISM FOR REMOTE SENSING IMAGE CAPTIONING
    Zhang, Xueting
    Wang, Qi
    Chen, Shangdong
    Li, Xuelong
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 10039 - 10042
  • [27] A Lightweight Sparse Focus Transformer for Remote Sensing Image Change Captioning
    Sun, Dongwei
    Bao, Yajie
    Liu, Junmin
    Cao, Xiangyong
    [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 17 : 18727 - 18738
  • [28] Change Captioning: A New Paradigm for Multitemporal Remote Sensing Image Analysis
    Hoxha, Genc
    Chouaf, Seloua
    Melgani, Farid
    Smara, Youcef
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [29] Retrieval Topic Recurrent Memory Network for Remote Sensing Image Captioning
    Wang, Binqiang
    Zheng, Xiangtao
    Qu, Bo
    Lu, Xiaoqiang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 256 - 270
  • [30] Attention to Scale: Scale-aware Semantic Image Segmentation
    Chen, Liang-Chieh
    Yang, Yi
    Wang, Jiang
    Xu, Wei
    Yuille, Alan L.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649