Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals

被引:139
|
作者
Song, Minsoo [1 ]
Lim, Seokjae [1 ]
Kim, Wonjun [1 ]
机构
[1] Konkuk Univ, Dept Elect & Elect Engn, Seoul 05029, South Korea
关键词
Estimation; Laplace equations; Decoding; Feature extraction; Convolution; Color; Image reconstruction; Monocular depth estimation; depth residuals; depth boundary; Laplacian pyramid; weight standardization; IMAGE; MODEL;
D O I
10.1109/TCSVT.2021.3049869
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With a great success of the generative model via deep neural networks, monocular depth estimation has been actively studied by exploiting various encoder-decoder architectures. However, the decoding process in most previous methods, which repeats simple up-sampling operations, probably fails to fully utilize underlying properties of well-encoded features for monocular depth estimation. To resolve this problem, we propose a simple but effective scheme by incorporating the Laplacian pyramid into the decoder architecture. Specifically, encoded features are fed into different streams for decoding depth residuals, which are defined by decomposition of the Laplacian pyramid, and corresponding outputs are progressively combined to reconstruct the final depth map from coarse to fine scales. This is fairly desirable to precisely estimate the depth boundary as well as the global layout. We also propose to apply weight standardization to pre-activation convolution blocks of the decoder architecture, which gives a great help to improve the flow of gradients and thus makes optimization easier. Experimental results on benchmark datasets constructed under various indoor and outdoor environments demonstrate that the proposed method is effective for monocular depth estimation compared to state-of-the-art models. The code and model are publicly available at: |https://github.com/tjqansthd/LapDepth-release|.
引用
收藏
页码:4381 / 4393
页数:13
相关论文
共 50 条
  • [1] Monocular Depth Estimation Using a Laplacian Image Pyramid with Local Planar Guidance Layers
    Choi, Youn-Ho
    Kee, Seok-Cheol
    SENSORS, 2023, 23 (02)
  • [2] Promoting Monocular Depth Estimation by Multi-Scale Residual Laplacian Pyramid Fusion
    Zhang, Anmei
    Ma, Yunchao
    Liu, Jiangyu
    Sun, Jian
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 205 - 209
  • [3] Depth Estimation of Monocular Road Images Based on Pyramid Scene Analysis Network
    Zhou Wujie
    Pan Ting
    Gu Pengli
    Zhai Zhinian
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (10) : 2509 - 2515
  • [4] Monocular Depth Estimation Using Relative Depth Maps
    Lee, Jae-Han
    Kim, Chang-Su
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9721 - 9730
  • [5] DCPNet: A Densely Connected Pyramid Network for Monocular Depth Estimation
    Lai, Zhitong
    Tian, Rui
    Wu, Zhiguo
    Ding, Nannan
    Sun, Linjian
    Wang, Yanjie
    SENSORS, 2021, 21 (20)
  • [6] Tl-depth: monocular depth estimation based on tower connections and Laplacian-filtering residual completion
    Zhang, Qi
    Song, Yuqin
    Lou, Hui
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (17): : 25099 - 25117
  • [7] Monocular Image Depth Estimation using Dilated Convolution and Spatial Pyramid Polling Structure
    Ding, Yinzhang
    Lin, Lu
    Wang, Lianghao
    Zhang, Ming
    Li, Dongxiao
    Ma, Haojie
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [8] Mixed-Scale Unet Based on Dense Atrous Pyramid for Monocular Depth Estimation
    Yang, Yifan
    Wang, Yuqing
    Zhu, Chenhao
    Zhu, Ming
    Sun, Haijiang
    Yan, Tianze
    IEEE ACCESS, 2021, 9 : 114070 - 114084
  • [9] Multilevel Pyramid Network for Monocular Depth Estimation Based on Feature Refinement and Adaptive Fusion
    Xu, Huihui
    Li, Fei
    ELECTRONICS, 2022, 11 (16)
  • [10] Structure-Aware Residual Pyramid Network for Monocular Depth Estimation
    Chen, Xiaotian
    Chen, Xuejin
    Zha, Zheng-Jun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 694 - 700