Height estimation from single aerial imagery using contrastive learning based multi-scale refinement network

被引:2
|
作者
Zhao, Wufan [1 ]
Ding, Hu [2 ]
Na, Jiaming [3 ]
Li, Mengmeng [4 ]
Tiede, Dirk [5 ]
机构
[1] Katholieke Univ Leuven, Fac Engn Technol, Dept Civil Engn, Geomat Sect, Leuven, Belgium
[2] South China Normal Univ, Sch Geog, Guangzhou 510631, Peoples R China
[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China
[4] Fuzhou Univ, Acad Digital China Fujian, Fuzhou, Peoples R China
[5] Paris Lodron Univ Salzburg, Fac Digital & Analyt Sci, Dept Geoinformat Z GIS, Salzburg, Austria
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Height estimation; aerial imagery; digital surface models; contrastive learning; local implicit constrain; EXTRACTION; STEREO;
D O I
10.1080/17538947.2023.2225881
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Height map estimation from a single aerial image plays a crucial role in localization, mapping, and 3D object detection. Deep convolutional neural networks have been used to predict height information from single-view remote sensing images, but these methods rely on large volumes of training data and often overlook geometric features present in orthographic images. To address these issues, this study proposes a gradient-based self-supervised learning network with momentum contrastive loss to extract geometric information from non-labeled images in the pretraining stage. Additionally, novel local implicit constraint layers are used at multiple decoding stages in the proposed supervised network to refine high-resolution features in height estimation. The structural-aware loss is also applied to improve the robustness of the network to positional shift and minor structural changes along the boundary area. Experimental evaluation on the ISPRS benchmark datasets shows that the proposed method outperforms other baseline networks, with minimum MAE and RMSE of 0.116 and 0.289 for the Vaihingen dataset and 0.077 and 0.481 for the Potsdam dataset, respectively. The proposed method also shows around threefold data efficiency improvements on the Potsdam dataset and domain generalization on the Enschede datasets. These results demonstrate the effectiveness of the proposed method in height map estimation from single-view remote sensing images.
引用
收藏
页码:2346 / 2364
页数:19
相关论文
共 50 条
  • [11] Multi-scale network for single image deblurring based on ensemble learning module
    Wu W.
    Pan Y.
    Su N.
    Wang J.
    Wu S.
    Xu Z.
    Yu Y.
    Liu Y.
    Multimedia Tools and Applications, 2025, 84 (11) : 9045 - 9064
  • [12] Semantic Segmentation in Aerial Imagery Using Multi-level Contrastive Learning with Local Consistency
    Tang, Maofeng
    Georgiou, Konstantinos
    Qi, Hairong
    Champion, Cody
    Bosch, Marc
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3787 - 3796
  • [13] Saliency Region Selection in Large Aerial Imagery Using Multi-scale SLIC Segmentation
    Sahli, Samir
    Lavigne, Daniel A.
    Sheng, Yunlong
    AIRBORNE INTELLIGENCE, SURVEILLANCE, RECONNAISSANCE (ISR) SYSTEMS AND APPLICATIONS IX, 2012, 8360
  • [14] Adaptive aerial object detection based on multi-scale deep learning
    Liu F.
    Han X.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (05):
  • [15] Robust log anomaly detection based on contrastive learning and multi-scale MASS
    Wang, Xuejie
    Cao, Qilei
    Wang, Qiaozheng
    Cao, Zhiying
    Zhang, Xiuguo
    Wang, Peipeng
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (16): : 17491 - 17512
  • [16] Robust log anomaly detection based on contrastive learning and multi-scale MASS
    Xuejie Wang
    Qilei Cao
    Qiaozheng Wang
    Zhiying Cao
    Xiuguo Zhang
    Peipeng Wang
    The Journal of Supercomputing, 2022, 78 : 17491 - 17512
  • [17] Depth Estimation from a Single Defocused Image using Multi-scale Kernels
    Wang, Haoqian
    Tian, Yushi
    Wu, Wei
    Wang, Xingzheng
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 1524 - 1527
  • [18] Aerial-Image Denoising Based on Convolutional Neural Network with Multi-Scale Residual Learning Approach
    Chen, Chong
    Xu, Zengbo
    INFORMATION, 2018, 9 (07)
  • [19] Normality Learning-based Graph Anomaly Detection via Multi-Scale Contrastive Learning
    Duan, Jingcan
    Zhang, Pei
    Wang, Siwei
    Hu, Jingtao
    Jin, Hu
    Zhang, Jiaxin
    Zhou, Haifang
    Liu, Xinwang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7502 - 7511
  • [20] Multi-Scale Contrastive Learning based Weakly Supervised Learning for Remote Sensing Scene Classification
    Peng, Rui
    Zhao, Wenzhi
    Zhang, Liqiang
    Chen, Xuehong
    Journal of Geo-Information Science, 2022, 24 (07) : 1375 - 1390