Height estimation from single aerial imagery using contrastive learning based multi-scale refinement network

被引：2

作者：

Zhao, Wufan ^{[1
]}

Ding, Hu ^{[2
]}

Na, Jiaming ^{[3
]}

Li, Mengmeng ^{[4
]}

Tiede, Dirk ^{[5
]}

机构：

[1] Katholieke Univ Leuven, Fac Engn Technol, Dept Civil Engn, Geomat Sect, Leuven, Belgium

[2] South China Normal Univ, Sch Geog, Guangzhou 510631, Peoples R China

[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China

[4] Fuzhou Univ, Acad Digital China Fujian, Fuzhou, Peoples R China

[5] Paris Lodron Univ Salzburg, Fac Digital & Analyt Sci, Dept Geoinformat Z GIS, Salzburg, Austria

来源：

INTERNATIONAL JOURNAL OF DIGITAL EARTH | 2023年 / 16卷 / 01期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Height estimation; aerial imagery; digital surface models; contrastive learning; local implicit constrain; EXTRACTION; STEREO;

D O I：

10.1080/17538947.2023.2225881

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Height map estimation from a single aerial image plays a crucial role in localization, mapping, and 3D object detection. Deep convolutional neural networks have been used to predict height information from single-view remote sensing images, but these methods rely on large volumes of training data and often overlook geometric features present in orthographic images. To address these issues, this study proposes a gradient-based self-supervised learning network with momentum contrastive loss to extract geometric information from non-labeled images in the pretraining stage. Additionally, novel local implicit constraint layers are used at multiple decoding stages in the proposed supervised network to refine high-resolution features in height estimation. The structural-aware loss is also applied to improve the robustness of the network to positional shift and minor structural changes along the boundary area. Experimental evaluation on the ISPRS benchmark datasets shows that the proposed method outperforms other baseline networks, with minimum MAE and RMSE of 0.116 and 0.289 for the Vaihingen dataset and 0.077 and 0.481 for the Potsdam dataset, respectively. The proposed method also shows around threefold data efficiency improvements on the Potsdam dataset and domain generalization on the Enschede datasets. These results demonstrate the effectiveness of the proposed method in height map estimation from single-view remote sensing images.

引用

页码：2346 / 2364

页数：19

共 50 条

[1] Multi-Scale Contrastive Learning for Human Pose Estimation
Bao, Wenxia
Lin, An
Huang, Hua
Yang, Xianjun
Chen, Hemu
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (10) : 1332 - 1341
[2] Water extraction from optical high-resolution remote sensing imagery: a multi-scale feature extraction network with contrastive learning
Liu, Bo
Du, Shihong
Bai, Lubin
Ouyang, Song
Wang, Haoyu
Zhang, Xiuyuan
GISCIENCE & REMOTE SENSING, 2023, 60 (01)
[3] Motor Imagery EEG Classification Based on Transfer Learning and Multi-Scale Convolution Network
Chang, Zhanyuan
Zhang, Congcong
Li, Chuanjiang
MICROMACHINES, 2022, 13 (06)
[4] Multi-scale and multi-modal contrastive learning network for biomedical time series
Guo, Hongbo
Xu, Xinzi
Wu, Hao
Liu, Bin
Xia, Jiahui
Cheng, Yibang
Guo, Qianhui
Chen, Yi
Xu, Tingyan
Wang, Jiguang
Wang, Guoxing
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 106
[5] MULTI-SCALE REFINEMENT NETWORK BASED ACOUSTIC ECHO CANCELLATION
Cui, Fan
Guo, Liyong
Li, Wenfeng
Gao, Peng
Wang, Yujun
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9132 - 9136
[6] Height Estimation From Single Aerial Images Using a Deep Ordinal Regression Network
Li, Xiang
Wang, Mingyang
Fang, Yi
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[7] SDMNet: Spatially dilated multi-scale network for object detection for drone aerial imagery
Battish, Neeraj
Kaur, Dapinder
Chugh, Moksh
Poddar, Shashi
IMAGE AND VISION COMPUTING, 2024, 150
[8] Selective Learning of Human Pose Estimation Based on Multi-Scale Convergence Network
Liu, Wenkai
Qin, Cuizhu
Wu, Menglong
Bai, Wenle
Dong, Hongxia
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 1081 - 1084
[9] Image manipulation detection and localization using multi-scale contrastive learning
Bai, Ruyi
APPLIED SOFT COMPUTING, 2024, 163
[10] Unsupervised dehazing of multi-scale residuals based on weighted contrastive learning
Wang, Jianing
Zhang, Yongsheng
Liu, Zuoyang
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (06)

← 1 2 3 4 5 →