Height estimation from single aerial imagery using contrastive learning based multi-scale refinement network

被引:2
|
作者
Zhao, Wufan [1 ]
Ding, Hu [2 ]
Na, Jiaming [3 ]
Li, Mengmeng [4 ]
Tiede, Dirk [5 ]
机构
[1] Katholieke Univ Leuven, Fac Engn Technol, Dept Civil Engn, Geomat Sect, Leuven, Belgium
[2] South China Normal Univ, Sch Geog, Guangzhou 510631, Peoples R China
[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China
[4] Fuzhou Univ, Acad Digital China Fujian, Fuzhou, Peoples R China
[5] Paris Lodron Univ Salzburg, Fac Digital & Analyt Sci, Dept Geoinformat Z GIS, Salzburg, Austria
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Height estimation; aerial imagery; digital surface models; contrastive learning; local implicit constrain; EXTRACTION; STEREO;
D O I
10.1080/17538947.2023.2225881
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Height map estimation from a single aerial image plays a crucial role in localization, mapping, and 3D object detection. Deep convolutional neural networks have been used to predict height information from single-view remote sensing images, but these methods rely on large volumes of training data and often overlook geometric features present in orthographic images. To address these issues, this study proposes a gradient-based self-supervised learning network with momentum contrastive loss to extract geometric information from non-labeled images in the pretraining stage. Additionally, novel local implicit constraint layers are used at multiple decoding stages in the proposed supervised network to refine high-resolution features in height estimation. The structural-aware loss is also applied to improve the robustness of the network to positional shift and minor structural changes along the boundary area. Experimental evaluation on the ISPRS benchmark datasets shows that the proposed method outperforms other baseline networks, with minimum MAE and RMSE of 0.116 and 0.289 for the Vaihingen dataset and 0.077 and 0.481 for the Potsdam dataset, respectively. The proposed method also shows around threefold data efficiency improvements on the Potsdam dataset and domain generalization on the Enschede datasets. These results demonstrate the effectiveness of the proposed method in height map estimation from single-view remote sensing images.
引用
收藏
页码:2346 / 2364
页数:19
相关论文
共 50 条
  • [41] Large-Scale River Mapping Using Contrastive Learning and Multi-Source Satellite Imagery
    Wei, Zhihao
    Jia, Kebin
    Liu, Pengyu
    Jia, Xiaowei
    Xie, Yiqun
    Jiang, Zhe
    REMOTE SENSING, 2021, 13 (15)
  • [42] A flow-based multi-scale learning network for single image stochastic super-resolution
    Wu, Qianyu
    Hu, Zhongqian
    Zhu, Aichun
    Tang, Hui
    Zou, Jiaxin
    Xi, Yan
    Chen, Yang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 125
  • [43] GPS: graph contrastive learning via multi-scale augmented views from adversarial pooling
    Wei JU
    Yiyang GU
    Zhengyang MAO
    Ziyue QIAO
    Yifang QIN
    Xiao LUO
    Hui XIONG
    Ming ZHANG
    Science China(Information Sciences), 2025, 68 (01) : 145 - 158
  • [44] GPS: graph contrastive learning via multi-scale augmented views from adversarial pooling
    Ju, Wei
    Gu, Yiyang
    Mao, Zhengyang
    Qiao, Ziyue
    Qin, Yifang
    Luo, Xiao
    Xiong, Hui
    Zhang, Ming
    SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (01)
  • [45] Multi-scale Learning based Malware Variant Detection using Spatial Pyramid Pooling Network
    Sriram, S.
    Vinayakumar, R.
    Sowmya, V
    Alazab, Mamoun
    Soman, K. P.
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 740 - 745
  • [46] Single image dehazing based on multi-scale segmentation and deep learning
    Yu, Tianhe
    Zhu, Ming
    Chen, Haiming
    MACHINE VISION AND APPLICATIONS, 2022, 33 (02)
  • [47] Single image dehazing based on multi-scale segmentation and deep learning
    Tianhe Yu
    Ming Zhu
    Haiming Chen
    Machine Vision and Applications, 2022, 33
  • [48] CT synthesis from MRI with an improved multi-scale learning network
    Li, Yan
    Xu, Sisi
    Lu, Yao
    Qi, Zhenyu
    FRONTIERS IN PHYSICS, 2023, 11
  • [49] Robust DOA Estimation Using Multi-Scale Fusion Network with Attention Mask
    Yan, Yuting
    Huang, Qinghua
    APPLIED SCIENCES-BASEL, 2024, 14 (11):
  • [50] Learning Multi-Scale Context Mask-RCNN Network for Slant Angled Aerial Imagery in Instance Segmentation in a Sim2Real setup
    Saadiyean, Qiranul
    Samprithi, S. P.
    Sundaram, Suresh
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 13573 - 13580