A Self-Attentive Hybrid Coding Network for 3D Change Detection in High-Resolution Optical Stereo Images

被引:7
|
作者
Pan, Jianping [1 ]
Li, Xin [1 ]
Cai, Zhuoyan [1 ]
Sun, Bowen [1 ]
Cui, Wei [1 ]
机构
[1] Chongqing Jiaotong Univ, Smart City Coll, Chongqing 400074, Peoples R China
基金
中国国家自然科学基金;
关键词
multimodal fusion; self-attention; multi-path hybrid coding; dense skip-connection decoding; 3D change detection; stereo mapping satellite; REMOTE-SENSING IMAGES; BUILDING CHANGE DETECTION; TIME-SERIES; SEGMENTATION; MULTISOURCE; FUSION;
D O I
10.3390/rs14092046
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Real-time monitoring of urban building development provides a basis for urban planning and management. Remote sensing change detection is a key technology for achieving this goal. Intelligent change detection based on deep learning of remote sensing images is a current focus of research. However, most methods only use unimodal remote sensing data and ignore vertical features, leading to incomplete characterization, poor detection of small targets, and false detections and omissions. To solve these problems, we propose a multi-path self-attentive hybrid coding network model (MAHNet) that fuses high-resolution remote sensing images and digital surface models (DSMs) for 3D change detection of urban buildings. We use stereo images from the Gaofen-7 (GF-7) stereo mapping satellite as the data source. In the encoding stage, we propose a multi-path hybrid encoder, which is a structure that can efficiently perform multi-dimensional feature mining of multimodal data. In the deep feature fusion link, a dual self-attentive fusion structure is designed that can improve the deep feature fusion and characterization of multimodal data. In the decoding stage, a dense skip-connection decoder is designed that can fuse multi-scale features flexibly and reduce spatial information losses in small-change regions in the down-sampling process, while enhancing feature utilization and propagation efficiency. Experimental results show that MAHNet achieves accurate pixel-level change detection in complex urban scenes with an overall accuracy of 97.44% and F1-score of 92.59%, thereby outperforming other methods of change detection.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Self-Attentive Generative Adversarial Network for Cloud Detection in High Resolution Remote Sensing Images
    Wu, Zhaocong
    Li, Jun
    Wang, Yisong
    Hu, Zhongwen
    Molinier, Matthieu
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (10) : 1792 - 1796
  • [2] A Deeply Supervised Attentive High-Resolution Network for Change Detection in Remote Sensing Images
    Wu, Jinming
    Xie, Chunhui
    Zhang, Zuxi
    Zhu, Yongxin
    [J]. REMOTE SENSING, 2023, 15 (01)
  • [3] MixFormer: A Self-Attentive Convolutional Network for 3D Mesh Object Recognition
    Huang, Lingfeng
    Zhao, Jieyu
    Chen, Yu
    [J]. ALGORITHMS, 2023, 16 (03)
  • [4] 3D BUILDING CHANGE DETECTION USING HIGH RESOLUTION STEREO IMAGES AND A GIS DATABASE
    Dini, G. R.
    Jacobsen, K.
    Rottensteiner, F.
    Al Rajhi, M.
    Heipke, C.
    [J]. XXII ISPRS CONGRESS, TECHNICAL COMMISSION VII, 2012, 39 (B7): : 299 - 304
  • [5] ADHR-CDNet: Attentive Differential High-Resolution Change Detection Network for Remote Sensing Images
    Zhang, Xiuwei
    Tian, Mu
    Xing, Yinghui
    Yue, Yuanzeng
    Li, Yanping
    Yin, Hanlin
    Xia, Runliang
    Jin, Jin
    Zhang, Yanning
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] 3D mapping with high-resolution images
    Toutin, T
    Chénier, R
    Carbonneau, Y
    Alcaïde, N
    [J]. GEOINFORMATION FOR EUROPEAN-WIDE INTEGRATION, 2003, : 121 - 125
  • [7] 3D Reconstruction of Building Based on High-Resolution SAR and Optical Images
    Zhu Junjie
    Ding Chibiao
    You Hongjian
    Xie Minghong
    [J]. 2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 3794 - 3797
  • [8] Self-attentive 3D human pose and shape estimation from videos
    Chen, Yun-Chun
    Piccirilli, Marco
    Piramuthu, Robinson
    Yang, Ming-Hsuan
    [J]. Computer Vision and Image Understanding, 2021, 213
  • [9] Self-attentive 3D human pose and shape estimation from videos
    Chen, Yun-Chun
    Piccirilli, Marco
    Piramuthu, Robinson
    Yang, Ming-Hsuan
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 213
  • [10] Windthrow damage detection in Nordic forests by 3D reconstruction of very high-resolution stereo optical satellite imagery
    Zubkov, Peter
    Solberg, Svein
    McInnes, Harold
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (16) : 4963 - 4988