Height estimation from single aerial imagery using contrastive learning based multi-scale refinement network

被引:2
|
作者
Zhao, Wufan [1 ]
Ding, Hu [2 ]
Na, Jiaming [3 ]
Li, Mengmeng [4 ]
Tiede, Dirk [5 ]
机构
[1] Katholieke Univ Leuven, Fac Engn Technol, Dept Civil Engn, Geomat Sect, Leuven, Belgium
[2] South China Normal Univ, Sch Geog, Guangzhou 510631, Peoples R China
[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China
[4] Fuzhou Univ, Acad Digital China Fujian, Fuzhou, Peoples R China
[5] Paris Lodron Univ Salzburg, Fac Digital & Analyt Sci, Dept Geoinformat Z GIS, Salzburg, Austria
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Height estimation; aerial imagery; digital surface models; contrastive learning; local implicit constrain; EXTRACTION; STEREO;
D O I
10.1080/17538947.2023.2225881
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Height map estimation from a single aerial image plays a crucial role in localization, mapping, and 3D object detection. Deep convolutional neural networks have been used to predict height information from single-view remote sensing images, but these methods rely on large volumes of training data and often overlook geometric features present in orthographic images. To address these issues, this study proposes a gradient-based self-supervised learning network with momentum contrastive loss to extract geometric information from non-labeled images in the pretraining stage. Additionally, novel local implicit constraint layers are used at multiple decoding stages in the proposed supervised network to refine high-resolution features in height estimation. The structural-aware loss is also applied to improve the robustness of the network to positional shift and minor structural changes along the boundary area. Experimental evaluation on the ISPRS benchmark datasets shows that the proposed method outperforms other baseline networks, with minimum MAE and RMSE of 0.116 and 0.289 for the Vaihingen dataset and 0.077 and 0.481 for the Potsdam dataset, respectively. The proposed method also shows around threefold data efficiency improvements on the Potsdam dataset and domain generalization on the Enschede datasets. These results demonstrate the effectiveness of the proposed method in height map estimation from single-view remote sensing images.
引用
收藏
页码:2346 / 2364
页数:19
相关论文
共 50 条
  • [21] Height estimation from single aerial images using a deep convolutional encoder-decoder network
    Amirkolaee, Hamed Amini
    Arefi, Hossein
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 149 : 50 - 66
  • [22] Robust Object Detection in Aerial Imagery Based on Multi-Scale Detector and Soft Densely Connected
    Zhang, Miaohui
    Zhang, Bo
    Liu, Mengya
    Xin, Ming
    IEEE ACCESS, 2020, 8 : 92791 - 92801
  • [23] Deep Multi-scale Convolutional Neural Network Method for Depth Estimation from a Single Image
    Ma, Zhaowei
    Niu, Yifeng
    Hu, Jia
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3984 - 3988
  • [24] Multi illumination color constancy based on multi-scale supervision and single-scale estimation cascade convolution neural network
    Wang, Fei
    Wang, Wei
    Wu, Dan
    Gao, Guowang
    Wang, Zetian
    FRONTIERS IN NEUROINFORMATICS, 2022, 16
  • [25] Multi-scale Face Detection Based on Single Neural Network
    Liu Hongzhe
    Yang Shaopeng
    Yuan Jiazheng
    Wang Xuecliao
    Xue Jianming
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (11) : 2598 - 2605
  • [26] A multi-scale sentiment recognition network based on deep learning
    Zhang, Ning
    Zhang, Xiufeng
    2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 526 - 530
  • [27] Using multi-scale product spectrum for single and multi-pitch estimation
    Messaoud, M. A. B.
    Bouzid, A.
    Ellouze, N.
    IET SIGNAL PROCESSING, 2011, 5 (03) : 344 - 355
  • [28] Orthorectification of Large Datasets of Multi-scale Archival Aerial Imagery: A Case Study from Turkiye
    Hong, Xin
    Roosevelt, Christopher H. H.
    JOURNAL OF GEOVISUALIZATION AND SPATIAL ANALYSIS, 2023, 7 (02)
  • [29] Multi-scale estimation of poverty rate using night-time light imagery
    Shao, Zixuan
    Li, Xi
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 121
  • [30] Depth Map Prediction from a Single Image using a Multi-Scale Deep Network
    Eigen, David
    Puhrsch, Christian
    Fergus, Rob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27