PROGRESSIVE SCALE-AWARE NETWORK FOR REMOTE SENSING IMAGE CHANGE CAPTIONING

被引:5
|
作者
Liu, Chenyang [1 ,3 ]
Yang, Jiajun [1 ,3 ]
Qi, Zipeng [1 ,3 ]
Zou, Zhengxia [2 ,3 ]
Shi, Zhenwei [1 ,3 ]
机构
[1] Beihang Univ, Image Proc Ctr, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ, Dept Guidance Nav & Control, Sch Astronaut, Beijing 100191, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Remote sensing image; change captioning; Transformer; scale-aware reinforcement;
D O I
10.1109/IGARSS52108.2023.10283451
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Remote sensing (RS) images contain numerous objects of different scales, which poses significant challenges for the RS image change captioning (RSICC) task to identify visual changes of interest in complex scenes and describe them via language. However, current methods still have some weaknesses in sufficiently extracting and utilizing multi-scale information. In this paper, we propose a progressive scale-aware network (PSNet) to address the problem. PSNet is a pure Transformer-based model. To sufficiently extract multi-scale visual features, multiple progressive difference perception (PDP) layers are stacked to progressively exploit the differencing features of bitemporal features. To sufficiently utilize the extracted multi-scale features for captioning, we propose a scale-aware reinforcement (SR) module and combine it with the Transformer decoding layer to progressively utilize the features from different PDP layers. Experiments show that the PDP layer and SR module are effective and our PSNet outperforms previous methods.
引用
收藏
页码:6668 / 6671
页数:4
相关论文
共 50 条
  • [1] Interactive Change-Aware Transformer Network for Remote Sensing Image Change Captioning
    Cai, Chen
    Wang, Yi
    Yap, Kim-Hui
    [J]. REMOTE SENSING, 2023, 15 (23)
  • [2] Scale-aware Progressive Optimization Network
    Chen, Ying
    Huang, Lifeng
    Gao, Chengying
    Liu, Ning
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2211 - 2219
  • [3] Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning
    Chen, Cai
    Wang, Yi
    Yap, Kim-Hui
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [4] Global to Local: A Scale-Aware Network for Remote Sensing Object Detection
    Gao, Tao
    Niu, Qianqian
    Zhang, Jing
    Chen, Ting
    Mei, Shaohui
    Jubair, Ahmad
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Object Counting in Remote Sensing via Triple Attention and Scale-Aware Network
    Guo, Xiangyu
    Anisetti, Marco
    Gao, Mingliang
    Jeon, Gwanggil
    [J]. REMOTE SENSING, 2022, 14 (24)
  • [6] Multi-scale feature progressive fusion network for remote sensing image change detection
    Di Lu
    Shuli Cheng
    Liejun Wang
    Shiji Song
    [J]. Scientific Reports, 12
  • [7] Multi-scale feature progressive fusion network for remote sensing image change detection
    Lu, Di
    Cheng, Shuli
    Wang, Liejun
    Song, Shiji
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [8] Scale-Aware Neural Network for Semantic Segmentation of Multi-Resolution Remote Sensing Images
    Wang, Libo
    Zhang, Ce
    Li, Rui
    Duan, Chenxi
    Meng, Xiaoliang
    Atkinson, Peter M.
    [J]. REMOTE SENSING, 2021, 13 (24)
  • [9] Intensive Positioning Network for Remote Sensing Image Captioning
    Wang, Shengsheng
    Chen, Jiawei
    Wang, Guangyao
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 567 - 576
  • [10] Multiscale Multiinteraction Network for Remote Sensing Image Captioning
    Wang, Yong
    Zhang, Wenkai
    Zhang, Zhengyuan
    Gao, Xin
    Sun, Xian
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 2154 - 2165