Self-Supervised Monocular Depth Estimation with Extensive Pretraining

被引:0
|
作者
Choi, Hyukdoo [1 ,2 ]
机构
[1] Department of Electronics and Information Engineering, Soonchunhyang University, Asan, Korea, Republic of
[2] Department of Electronic Materials and Devices Engineering, Soonchunhyang University, Asan,31538, Korea, Republic of
基金
新加坡国家研究基金会;
关键词
Unsupervised learning - Stereo image processing - Supervised learning - Convolutional neural networks - Optical radar;
D O I
暂无
中图分类号
学科分类号
摘要
Although depth estimation is a key technology for three-dimensional sensing applications involving motion, active sensors such as LiDAR and depth cameras tend to be expensive and bulky. Here, we explore the potential of monocular depth estimation (MDE) based on a self-supervised approach. MDE is a promising technology, but supervised learning suffers from a need for accurate ground-truth depth data. Recent studies have enabled self-supervised training on an MDE model with only monocular image sequences and image-reconstruction errors. We pretrained networks using multiple datasets, including monocular and stereo image sequences. The main challenges posed by the self-supervised MDE model were occlusions and dynamic objects. We proposed novel loss functions to handle these problems in the form of min-over-all and min-with-flow losses, both based on the per-pixel minimum reprojection error of Monodepth2 and extended to stereo images and optical flow. With extensive pretraining and novel losses, our model outperformed existing unsupervised approaches in quantitative depth estimation and the ability to distinguish small objects against a background, as evaluated by KITTI 2015. © 2013 IEEE.
引用
收藏
页码:157236 / 157246
相关论文
共 50 条
  • [21] Self-Supervised Monocular Depth Estimation by Digging into Uncertainty Quantification
    Li, Yuan-Zhen
    Zheng, Sheng-Jie
    Tan, Zi-Xin
    Cao, Tuo
    Luo, Fei
    Xiao, Chun-Xia
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (03) : 510 - 525
  • [22] Self-supervised monocular image depth learning and confidence estimation
    Chen, Long
    Tang, Wen
    Wan, Tao Ruan
    John, Nigel W.
    NEUROCOMPUTING, 2020, 381 : 272 - 281
  • [23] Self-supervised Learning for Dense Depth Estimation in Monocular Endoscopy
    Liu, Xingtong
    Sinha, Ayushi
    Unberath, Mathias
    Ishii, Masaru
    Hager, Gregory D.
    Taylor, Russell H.
    Reiter, Austin
    OR 2.0 CONTEXT-AWARE OPERATING THEATERS, COMPUTER ASSISTED ROBOTIC ENDOSCOPY, CLINICAL IMAGE-BASED PROCEDURES, AND SKIN IMAGE ANALYSIS, OR 2.0 2018, 2018, 11041 : 128 - 138
  • [24] Graph semantic information for self-supervised monocular depth estimation
    Zhang, Dongdong
    Wang, Chunping
    Wang, Huiying
    Fu, Qiang
    PATTERN RECOGNITION, 2024, 156
  • [25] Image Masking for Robust Self-Supervised Monocular Depth Estimation
    Chawla, Hemang
    Jeeveswaran, Kishaan
    Arani, Elahe
    Zonooz, Bahram
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10054 - 10060
  • [26] Self-Supervised Monocular Depth Estimation by Digging into Uncertainty Quantification
    Yuan-Zhen Li
    Sheng-Jie Zheng
    Zi-Xin Tan
    Tuo Cao
    Fei Luo
    Chun-Xia Xiao
    Journal of Computer Science and Technology, 2023, 38 : 510 - 525
  • [27] Exploring the vulnerability of self-supervised monocular depth estimation models
    Hou, Ruitao
    Mo, Kanghua
    Long, Yucheng
    Li, Ning
    Rao, Yuan
    INFORMATION SCIENCES, 2024, 677
  • [28] Self-Supervised Scale Recovery for Monocular Depth and Egomotion Estimation
    Wagstaff, Brandon
    Kelly, Jonathan
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2620 - 2627
  • [29] Self-supervised Monocular Depth Estimation on Unseen Synthetic Cameras
    Diana-Albelda, Cecilia
    Bravo Perez-Villar, Juan Ignacio
    Montalvo, Javier
    Garcia-Martin, Alvaro
    Bescos Cano, Jesus
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 449 - 463
  • [30] MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
    Zhao, Chaoqiang
    Zhang, Youmin
    Poggi, Matteo
    Tosi, Fabio
    Guo, Xianda
    Zhu, Zheng
    Huang, Guan
    Tang, Yang
    Mattoccia, Stefano
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 668 - 678