Pyramid frequency network with spatial attention residual refinement module for monocular depth estimation

被引：9

作者：

Lu, Zhengyang ^{[1
]}

Chen, Ying ^{[1
]}

机构：

[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Jiangsu, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2022年 / 31卷 / 02期

基金：

中国国家自然科学基金;

关键词：

monocular depth estimation; three-dimensional reconstruction; frequency domain; convolutional neural network;

D O I：

10.1117/1.JEI.31.2.023005

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep-learning-based approaches to depth estimation are rapidly advancing, offering superior performance over existing methods. To estimate the depth in real-world scenarios, depth estimation models require the robustness of various noise environments. We propose a pyramid frequency network (PFN) with spatial attention residual refinement module (SARRM) to deal with the weak robustness of existing deep-learning methods. To reconstruct depth maps with accurate details, the SARRM constructs a residual fusion method with an attention mechanism to refine the blur depth. The frequency division strategy is designed, and the frequency pyramid network is developed to extract features from multiple frequency bands. With the frequency strategy, PFN achieves better visual accuracy than state-of-the-art methods in both indoor and outdoor scenes on Make3D, KITTI depth, and NYUv2 datasets. Additional experiments on the noisy NYUv2 dataset demonstrate that PFN is more reliable than existing deep-learning methods in high noise scenes. (C) 2022 SPIE and IS&T

引用

下载

页数：18

共 50 条

[31] Monocular Depth Prediction With Residual DenseASPP Network
Wu, Kewei
Zhang, Shunran
Xie, Zhao
IEEE ACCESS, 2020, 8 : 129899 - 129910
[32] Dermoscopic image segmentation based on Pyramid Residual Attention Module
Jiang, Yun
Cheng, Tongtong
Dong, Jinkun
Liang, Jing
Zhang, Yuan
Lin, Xin
Yao, Huixia
PLOS ONE, 2022, 17 (09):
[33] LAM-Depth: Laplace-Attention Module-Based Self-Supervised Monocular Depth Estimation
Wei, Jiansheng
Pan, Shuguo
Gao, Wang
Guo, Peng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 1 - 11
[34] Monocular Image Depth Estimation Based on Multi-Scale Attention Oriented Network
Liu J.
Wen J.
Liang Y.
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (12): : 52 - 62
[35] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
Sumanta Bhattacharyya
Ju Shen
Stephen Welch
Chen Chen
Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
[36] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
Bhattacharyya, Sumanta
Shen, Ju
Welch, Stephen
Chen, Chen
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (04) : 1357 - 1368
[37] Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module
Yuanzhen Li
Fei Luo
Chunxia Xiao
Computational Visual Media, 2022, 8 : 631 - 647
[38] Trap Attention: Monocular Depth Estimation with Manual Traps
Ning, Chao
Gan, Hongping
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5033 - 5043
[39] Triaxial Squeeze Attention Module and Mutual-Exclusion Loss Based Unsupervised Monocular Depth Estimation
Wei, Jiansheng
Pan, Shuguo
Gao, Wang
Zhao, Tao
NEURAL PROCESSING LETTERS, 2022, 54 (05) : 4375 - 4390
[40] Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module
Li, Yuanzhen
Luo, Fei
Xiao, Chunxia
COMPUTATIONAL VISUAL MEDIA, 2022, 8 (04) : 631 - 647

← 1 2 3 4 5 →