Learning Occlusion-aware Coarse-to-Fine Depth Map for Self-supervised Monocular Depth Estimation

被引:7
|
作者
Zhou, Zhengming
Dong, Qiulei [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Monocular depth estimation; self-supervised learning; neural network;
D O I
10.1145/3503161.3548381
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Self-supervised monocular depth estimation, aiming to learn scene depths from single images in a self-supervised manner, has received much attention recently. In spite of recent efforts in this field, how to learn accurate scene depths and alleviate the negative influence of occlusions for self-supervised depth estimation, still remains an open problem. Addressing this problem, we firstly empirically analyze the effects of both the continuous and discrete depth constraints which are widely used in the training process of many existing works. Then inspired by the above empirical analysis, we propose a novel network to learn an Occlusion-aware Coarse-to-Fine Depth map for self-supervised monocular depth estimation, called OCFD-Net. Given an arbitrary training set of stereo image pairs, the proposed OCFD-Net does not only employ a discrete depth constraint for learning a coarse-level depth map, but also employ a continuous depth constraint for learning a scene depth residual, resulting in a fine-level depth map. In addition, an occlusion-aware module is designed under the proposed OCFD-Net, which is able to improve the capability of the learnt fine-level depth map for handling occlusions. Experimental results on KITTI demonstrate that the proposed method outperforms the comparative state-of-the-art methods under seven commonly used metrics in most cases. In addition, experimental results on Make3D demonstrate the effectiveness of the proposed method in terms of the cross-dataset generalization ability under four commonly used metrics. The code is available at https://github.com/ZM-Zhou/OCFD-Net_pytorch.
引用
收藏
页码:6386 / 6395
页数:10
相关论文
共 50 条
  • [1] Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module
    Yuanzhen Li
    Fei Luo
    Chunxia Xiao
    [J]. Computational Visual Media, 2022, 8 : 631 - 647
  • [2] Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module
    Li, Yuanzhen
    Luo, Fei
    Xiao, Chunxia
    [J]. COMPUTATIONAL VISUAL MEDIA, 2022, 8 (04) : 631 - 647
  • [3] REAL-TIME COARSE-TO-FINE DEPTH ESTIMATION ON STEREO ENDOSCOPIC IMAGES WITH SELF-SUPERVISED LEARNING
    Yang, Haotian
    Kahrs, Lueder A.
    [J]. 2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 733 - 737
  • [4] Frequency-Aware Self-Supervised Monocular Depth Estimation
    Chen, Xingyu
    Li, Thomas H.
    Zhang, Ruonan
    Li, Ge
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5797 - 5806
  • [5] DPDFormer: A Coarse-to-Fine Model for Monocular Depth Estimation
    Liu, Chunpu
    Yang, Guanglei
    Zuo, Wangmeng
    Zang, Tianyi
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (05)
  • [6] Self-supervised monocular depth estimation with occlusion mask and edge awareness
    Zhou, Shi
    Zhu, Miaomiao
    Li, Zhen
    Li, He
    Mizumachi, Mitsunori
    Zhang, Lifeng
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2021, 26 (03) : 354 - 359
  • [7] Self-supervised monocular depth estimation with occlusion mask and edge awareness
    Shi Zhou
    Miaomiao Zhu
    Zhen Li
    He Li
    Mitsunori Mizumachi
    Lifeng Zhang
    [J]. Artificial Life and Robotics, 2021, 26 : 354 - 359
  • [8] Self-supervised monocular image depth learning and confidence estimation
    Chen, Long
    Tang, Wen
    Wan, Tao Ruan
    John, Nigel W.
    [J]. NEUROCOMPUTING, 2020, 381 : 272 - 281
  • [9] Self-supervised Learning for Dense Depth Estimation in Monocular Endoscopy
    Liu, Xingtong
    Sinha, Ayushi
    Unberath, Mathias
    Ishii, Masaru
    Hager, Gregory D.
    Taylor, Russell H.
    Reiter, Austin
    [J]. OR 2.0 CONTEXT-AWARE OPERATING THEATERS, COMPUTER ASSISTED ROBOTIC ENDOSCOPY, CLINICAL IMAGE-BASED PROCEDURES, AND SKIN IMAGE ANALYSIS, OR 2.0 2018, 2018, 11041 : 128 - 138
  • [10] Digging Into Self-Supervised Monocular Depth Estimation
    Godard, Clement
    Mac Aodha, Oisin
    Firman, Michael
    Brostow, Gabriel
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837