Height estimation from single aerial imagery using contrastive learning based multi-scale refinement network

被引:2
|
作者
Zhao, Wufan [1 ]
Ding, Hu [2 ]
Na, Jiaming [3 ]
Li, Mengmeng [4 ]
Tiede, Dirk [5 ]
机构
[1] Katholieke Univ Leuven, Fac Engn Technol, Dept Civil Engn, Geomat Sect, Leuven, Belgium
[2] South China Normal Univ, Sch Geog, Guangzhou 510631, Peoples R China
[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China
[4] Fuzhou Univ, Acad Digital China Fujian, Fuzhou, Peoples R China
[5] Paris Lodron Univ Salzburg, Fac Digital & Analyt Sci, Dept Geoinformat Z GIS, Salzburg, Austria
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Height estimation; aerial imagery; digital surface models; contrastive learning; local implicit constrain; EXTRACTION; STEREO;
D O I
10.1080/17538947.2023.2225881
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Height map estimation from a single aerial image plays a crucial role in localization, mapping, and 3D object detection. Deep convolutional neural networks have been used to predict height information from single-view remote sensing images, but these methods rely on large volumes of training data and often overlook geometric features present in orthographic images. To address these issues, this study proposes a gradient-based self-supervised learning network with momentum contrastive loss to extract geometric information from non-labeled images in the pretraining stage. Additionally, novel local implicit constraint layers are used at multiple decoding stages in the proposed supervised network to refine high-resolution features in height estimation. The structural-aware loss is also applied to improve the robustness of the network to positional shift and minor structural changes along the boundary area. Experimental evaluation on the ISPRS benchmark datasets shows that the proposed method outperforms other baseline networks, with minimum MAE and RMSE of 0.116 and 0.289 for the Vaihingen dataset and 0.077 and 0.481 for the Potsdam dataset, respectively. The proposed method also shows around threefold data efficiency improvements on the Potsdam dataset and domain generalization on the Enschede datasets. These results demonstrate the effectiveness of the proposed method in height map estimation from single-view remote sensing images.
引用
收藏
页码:2346 / 2364
页数:19
相关论文
共 50 条
  • [1] Multi-Scale Contrastive Learning for Human Pose Estimation
    Bao, Wenxia
    Lin, An
    Huang, Hua
    Yang, Xianjun
    Chen, Hemu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (10) : 1332 - 1341
  • [2] Water extraction from optical high-resolution remote sensing imagery: a multi-scale feature extraction network with contrastive learning
    Liu, Bo
    Du, Shihong
    Bai, Lubin
    Ouyang, Song
    Wang, Haoyu
    Zhang, Xiuyuan
    GISCIENCE & REMOTE SENSING, 2023, 60 (01)
  • [3] Motor Imagery EEG Classification Based on Transfer Learning and Multi-Scale Convolution Network
    Chang, Zhanyuan
    Zhang, Congcong
    Li, Chuanjiang
    MICROMACHINES, 2022, 13 (06)
  • [4] Multi-scale and multi-modal contrastive learning network for biomedical time series
    Guo, Hongbo
    Xu, Xinzi
    Wu, Hao
    Liu, Bin
    Xia, Jiahui
    Cheng, Yibang
    Guo, Qianhui
    Chen, Yi
    Xu, Tingyan
    Wang, Jiguang
    Wang, Guoxing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 106
  • [5] MULTI-SCALE REFINEMENT NETWORK BASED ACOUSTIC ECHO CANCELLATION
    Cui, Fan
    Guo, Liyong
    Li, Wenfeng
    Gao, Peng
    Wang, Yujun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9132 - 9136
  • [6] Height Estimation From Single Aerial Images Using a Deep Ordinal Regression Network
    Li, Xiang
    Wang, Mingyang
    Fang, Yi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [7] SDMNet: Spatially dilated multi-scale network for object detection for drone aerial imagery
    Battish, Neeraj
    Kaur, Dapinder
    Chugh, Moksh
    Poddar, Shashi
    IMAGE AND VISION COMPUTING, 2024, 150
  • [8] Selective Learning of Human Pose Estimation Based on Multi-Scale Convergence Network
    Liu, Wenkai
    Qin, Cuizhu
    Wu, Menglong
    Bai, Wenle
    Dong, Hongxia
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 1081 - 1084
  • [9] Image manipulation detection and localization using multi-scale contrastive learning
    Bai, Ruyi
    APPLIED SOFT COMPUTING, 2024, 163
  • [10] Unsupervised dehazing of multi-scale residuals based on weighted contrastive learning
    Wang, Jianing
    Zhang, Yongsheng
    Liu, Zuoyang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (06)