HMSM-Net: Hierarchical multi-scale matching network for disparity estimation of high-resolution satellite stereo images

被引:26
|
作者
He, Sheng [1 ]
Li, Shenhong [1 ]
Jiang, San [2 ]
Jiang, Wanshou [1 ,3 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[3] Wuhan Univ, Collaborat Innovat Ctr Geospatial Technol, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Satellite stereo images; Disparity estimation; Convolutional neural network; Hierarchical multi-scale matching; GaoFen-7; dataset;
D O I
10.1016/j.isprsjprs.2022.04.020
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Disparity estimation of satellite stereo images is an essential and challenging task in photogrammetry and remote sensing. Recent researches have greatly promoted the development of disparity estimation algorithms by using CNN (Convolutional Neural Networks) based deep learning techniques. However, it is still difficult to handle intractable regions that are mainly caused by occlusions, disparity discontinuities, texture-less areas, and re-petitive patterns. Besides, the lack of training datasets for satellite stereo images remains another major issue that blocks the usage of CNN techniques due to the difficulty of obtaining ground-truth disparities. In this paper, we propose an end-to-end disparity learning model, termed hierarchical multi-scale matching network (HMSM-Net), for the disparity estimation of high-resolution satellite stereo images. First, multi-scale cost volumes are con-structed by using pyramidal features that capture spatial information of multiple levels, which learn corre-spondences at multiple scales and enable HMSM-Net to be more robust in intractable regions. Second, stereo matching is executed in a hierarchical coarse-to-fine manner by applying supervision to each scale, which allows a lower scale to act as prior knowledge and guides a higher scale to attain finer matching results. Third, a refinement module that incorporates the intensity and gradient information of the input left image is designed to regress a detailed full-resolution disparity map for local structure preservation. For network training and testing, a dense stereo matching dataset is created and published by using GaoFen-7 satellite stereo images. Finally, the proposed network is evaluated on the Urban Semantic 3D and GaoFen-7 datasets. Experimental results demonstrate that HMSM-Net achieves superior accuracy compared with state-of-the-art methods, and the improvement on intractable regions is noteworthy. Additionally, results and comparisons of different methods on the GaoFen-7 dataset show that it can severs as a challenging benchmark for performance assessment of methods applied to disparity estimation of satellite stereo images. The source codes and evaluation dataset are made publicly available at https://github.com/Sheng029/HMSM-Net.
引用
收藏
页码:314 / 330
页数:17
相关论文
共 50 条
  • [31] MFPANet: Multi-Scale Feature Perception and Aggregation Network for High-Resolution Snow Depth Estimation
    Zhao, Liling
    Chen, Junyu
    Shahzad, Muhammad
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2024, 16 (12)
  • [32] A light-weight stereo matching network based on multi-scale features fusion and robust disparity refinement
    Yang, Xiaowei
    Zhao, Yong
    Feng, Zhiguo
    Sang, Haiwei
    Zhang, Zhenbo
    Zhang, Guiying
    He, Lin
    IET IMAGE PROCESSING, 2023, 17 (06) : 1797 - 1811
  • [33] MTPose: Human Pose Estimation with High-Resolution Multi-scale Transformers
    Rui Wang
    Fudi Geng
    Xiangyang Wang
    Neural Processing Letters, 2022, 54 : 3941 - 3964
  • [34] MTPose: Human Pose Estimation with High-Resolution Multi-scale Transformers
    Wang, Rui
    Geng, Fudi
    Wang, Xiangyang
    NEURAL PROCESSING LETTERS, 2022, 54 (05) : 3941 - 3964
  • [35] MSDC-Net: Multi-Scale Dense and Contextual Networks for Stereo Matching
    Rao, Zhibo
    He, Mingyi
    Dai, Yuchao
    Zhu, Zhidong
    Li, Bo
    He, Renjie
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 578 - 583
  • [36] Double-Branch Multi-Scale Contextual Network: A Model for Multi-Scale Street Tree Segmentation in High-Resolution Remote Sensing Images
    Zhang, Hongyang
    Liu, Shuo
    SENSORS, 2024, 24 (04)
  • [37] Change Detection for High-Resolution Remote Sensing Images Based on a Multi-Scale Attention Siamese Network
    Li, Jiankang
    Zhu, Shanyou
    Gao, Yiyao
    Zhang, Guixin
    Xu, Yongming
    REMOTE SENSING, 2022, 14 (14)
  • [38] PAIRWISE STEREO IMAGE DISPARITY AND SEMANTICS ESTIMATION WITH THE COMBINATION OF U-NET AND PYRAMID STEREO MATCHING NETWORK
    Qin, Rongjun
    Huang, Xu
    Liu, Wei
    Xiao, Changlin
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 4971 - 4974
  • [39] MFINet: Multi-Scale Feature Interaction Network for Change Detection of High-Resolution Remote Sensing Images
    Ren, Wuxu
    Wang, Zhongchen
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2024, 16 (07)
  • [40] Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection
    Ding, Mengyuan
    Zhang, Shanshan
    Yang, Jian
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9076 - 9082