Height estimation from single aerial imagery using contrastive learning based multi-scale refinement network

被引:2
|
作者
Zhao, Wufan [1 ]
Ding, Hu [2 ]
Na, Jiaming [3 ]
Li, Mengmeng [4 ]
Tiede, Dirk [5 ]
机构
[1] Katholieke Univ Leuven, Fac Engn Technol, Dept Civil Engn, Geomat Sect, Leuven, Belgium
[2] South China Normal Univ, Sch Geog, Guangzhou 510631, Peoples R China
[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China
[4] Fuzhou Univ, Acad Digital China Fujian, Fuzhou, Peoples R China
[5] Paris Lodron Univ Salzburg, Fac Digital & Analyt Sci, Dept Geoinformat Z GIS, Salzburg, Austria
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Height estimation; aerial imagery; digital surface models; contrastive learning; local implicit constrain; EXTRACTION; STEREO;
D O I
10.1080/17538947.2023.2225881
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Height map estimation from a single aerial image plays a crucial role in localization, mapping, and 3D object detection. Deep convolutional neural networks have been used to predict height information from single-view remote sensing images, but these methods rely on large volumes of training data and often overlook geometric features present in orthographic images. To address these issues, this study proposes a gradient-based self-supervised learning network with momentum contrastive loss to extract geometric information from non-labeled images in the pretraining stage. Additionally, novel local implicit constraint layers are used at multiple decoding stages in the proposed supervised network to refine high-resolution features in height estimation. The structural-aware loss is also applied to improve the robustness of the network to positional shift and minor structural changes along the boundary area. Experimental evaluation on the ISPRS benchmark datasets shows that the proposed method outperforms other baseline networks, with minimum MAE and RMSE of 0.116 and 0.289 for the Vaihingen dataset and 0.077 and 0.481 for the Potsdam dataset, respectively. The proposed method also shows around threefold data efficiency improvements on the Potsdam dataset and domain generalization on the Enschede datasets. These results demonstrate the effectiveness of the proposed method in height map estimation from single-view remote sensing images.
引用
收藏
页码:2346 / 2364
页数:19
相关论文
共 50 条
  • [31] IM2ELEVATION: Building Height Estimation from Single-View Aerial Imagery
    Liu, Chao-Jung
    Krylov, Vladimir A.
    Kane, Paul
    Kavanagh, Geraldine
    Dahyot, Rozenn
    REMOTE SENSING, 2020, 12 (17)
  • [32] Head Pose Estimation Based on Multi-Scale Convolutional Neural Network
    Liang Lingyu
    Zhang Tiantian
    He Wei
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (13)
  • [33] Multi-task contrastive learning for semi-supervised medical image segmentation with multi-scale uncertainty estimation
    Xing, Chengcheng
    Dong, Haoji
    Xi, Heran
    Ma, Jiquan
    Zhu, Jinghua
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (18):
  • [34] Cotton Yield Estimation From Aerial Imagery Using Machine Learning Approaches
    Rodriguez-Sanchez, Javier
    Li, Changying
    Paterson, Andrew H.
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [35] An aerial image segmentation approach based on enhanced multi-scale convolutional neural network
    Li, Xiang
    Jiang, Yuchen
    Peng, Hu
    Yin, Shen
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER PHYSICAL SYSTEMS (ICPS 2019), 2019, : 47 - 52
  • [36] Continuous Blood Pressure Estimation Based on Multi-Scale Feature Extraction by the Neural Network With Multi-Task Learning
    Jiang, Hengbing
    Zou, Lili
    Huang, Dequn
    Feng, Qianjin
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [37] Multi-scale spatiotemporal attention network for neuron based motor imagery EEG classification
    Chunduri, Venkata
    Aoudni, Yassine
    Khan, Samiullah
    Aziz, Abdul
    Rizwan, Ali
    Deb, Nabamita
    Keshta, Ismail
    Soni, Mukesh
    JOURNAL OF NEUROSCIENCE METHODS, 2024, 406
  • [38] Contrastive learning-based general Deepfake detection with multi-scale RGB frequency clues
    Dong, Fengkai
    Zou, Xiaoqiang
    Wang, Jiahui
    Liu, Xiyao
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (04) : 90 - 99
  • [39] Orthorectification of Large Datasets of Multi-scale Archival Aerial Imagery: A Case Study from Türkiye
    Xin Hong
    Christopher H. Roosevelt
    Journal of Geovisualization and Spatial Analysis, 2023, 7
  • [40] Depth Estimation from Multi-scale SLIC Superpixels Using Non-parametric Learning
    Jiang, Yifeng
    Zhu, Yuesheng
    Qing, Yin
    Yang, Fan
    NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420