Height estimation from single aerial imagery using contrastive learning based multi-scale refinement network

被引：2

作者：

Zhao, Wufan ^{[1
]}

Ding, Hu ^{[2
]}

Na, Jiaming ^{[3
]}

Li, Mengmeng ^{[4
]}

Tiede, Dirk ^{[5
]}

机构：

[1] Katholieke Univ Leuven, Fac Engn Technol, Dept Civil Engn, Geomat Sect, Leuven, Belgium

[2] South China Normal Univ, Sch Geog, Guangzhou 510631, Peoples R China

[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China

[4] Fuzhou Univ, Acad Digital China Fujian, Fuzhou, Peoples R China

[5] Paris Lodron Univ Salzburg, Fac Digital & Analyt Sci, Dept Geoinformat Z GIS, Salzburg, Austria

来源：

INTERNATIONAL JOURNAL OF DIGITAL EARTH | 2023年 / 16卷 / 01期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Height estimation; aerial imagery; digital surface models; contrastive learning; local implicit constrain; EXTRACTION; STEREO;

D O I：

10.1080/17538947.2023.2225881

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Height map estimation from a single aerial image plays a crucial role in localization, mapping, and 3D object detection. Deep convolutional neural networks have been used to predict height information from single-view remote sensing images, but these methods rely on large volumes of training data and often overlook geometric features present in orthographic images. To address these issues, this study proposes a gradient-based self-supervised learning network with momentum contrastive loss to extract geometric information from non-labeled images in the pretraining stage. Additionally, novel local implicit constraint layers are used at multiple decoding stages in the proposed supervised network to refine high-resolution features in height estimation. The structural-aware loss is also applied to improve the robustness of the network to positional shift and minor structural changes along the boundary area. Experimental evaluation on the ISPRS benchmark datasets shows that the proposed method outperforms other baseline networks, with minimum MAE and RMSE of 0.116 and 0.289 for the Vaihingen dataset and 0.077 and 0.481 for the Potsdam dataset, respectively. The proposed method also shows around threefold data efficiency improvements on the Potsdam dataset and domain generalization on the Enschede datasets. These results demonstrate the effectiveness of the proposed method in height map estimation from single-view remote sensing images.

引用

页码：2346 / 2364

页数：19

共 50 条

[41] Large-Scale River Mapping Using Contrastive Learning and Multi-Source Satellite Imagery
Wei, Zhihao
Jia, Kebin
Liu, Pengyu
Jia, Xiaowei
Xie, Yiqun
Jiang, Zhe
REMOTE SENSING, 2021, 13 (15)
[42] A flow-based multi-scale learning network for single image stochastic super-resolution
Wu, Qianyu
Hu, Zhongqian
Zhu, Aichun
Tang, Hui
Zou, Jiaxin
Xi, Yan
Chen, Yang
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 125
[43] GPS: graph contrastive learning via multi-scale augmented views from adversarial pooling
Wei JU
Yiyang GU
Zhengyang MAO
Ziyue QIAO
Yifang QIN
Xiao LUO
Hui XIONG
Ming ZHANG
Science China(Information Sciences), 2025, 68 (01) : 145 - 158
[44] GPS: graph contrastive learning via multi-scale augmented views from adversarial pooling
Ju, Wei
Gu, Yiyang
Mao, Zhengyang
Qiao, Ziyue
Qin, Yifang
Luo, Xiao
Xiong, Hui
Zhang, Ming
SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (01)
[45] Multi-scale Learning based Malware Variant Detection using Spatial Pyramid Pooling Network
Sriram, S.
Vinayakumar, R.
Sowmya, V
Alazab, Mamoun
Soman, K. P.
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 740 - 745
[46] Single image dehazing based on multi-scale segmentation and deep learning
Yu, Tianhe
Zhu, Ming
Chen, Haiming
MACHINE VISION AND APPLICATIONS, 2022, 33 (02)
[47] Single image dehazing based on multi-scale segmentation and deep learning
Tianhe Yu
Ming Zhu
Haiming Chen
Machine Vision and Applications, 2022, 33
[48] CT synthesis from MRI with an improved multi-scale learning network
Li, Yan
Xu, Sisi
Lu, Yao
Qi, Zhenyu
FRONTIERS IN PHYSICS, 2023, 11
[49] Robust DOA Estimation Using Multi-Scale Fusion Network with Attention Mask
Yan, Yuting
Huang, Qinghua
APPLIED SCIENCES-BASEL, 2024, 14 (11):
[50] Learning Multi-Scale Context Mask-RCNN Network for Slant Angled Aerial Imagery in Instance Segmentation in a Sim2Real setup
Saadiyean, Qiranul
Samprithi, S. P.
Sundaram, Suresh
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 13573 - 13580

← 1 2 3 4 5 →