Unsupervised Monocular Depth Estimation with Left-Right Consistency

被引:1765
|
作者
Godard, Clement [1 ]
Mac Aodha, Oisin [1 ]
Brostow, Gabriel J. [1 ]
机构
[1] UCL, London, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/CVPR.2017.699
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we innovate beyond existing approaches, replacing the use of explicit depth data during training with easier-to-obtain binocular stereo footage. We propose a novel training objective that enables our convolutional neural network to learn to perform single image depth estimation, despite the absence of ground truth depth data. Exploiting epipolar geometry constraints, we generate disparity images by training our network with an image reconstruction loss. We show that solving for image reconstruction alone results in poor quality depth images. To overcome this problem, we propose a novel training loss that enforces consistency between the disparities produced relative to both the left and right images, leading to improved performance and robustness compared to existing approaches. Our method produces state of the art results for monocular depth estimation on the KITTI driving dataset, even outperforming supervised methods that have been trained with ground truth depth.
引用
收藏
页码:6602 / 6611
页数:10
相关论文
共 50 条
  • [1] Unsupervised Monocular Depth and Camera Pose Estimation with Multiple Masks and Geometric Consistency Constraints
    Zhang, Xudong
    Zhao, Baigan
    Yao, Jiannan
    Wu, Guoqing
    [J]. SENSORS, 2023, 23 (11)
  • [2] Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss
    Song, Xiaogang
    Hu, Haoyue
    Liang, Li
    Shi, Weiwei
    Xie, Guo
    Lu, Xiaofeng
    Hei, Xinhong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3517 - 3529
  • [3] LEFT-RIGHT CONSISTENCY IN RINGS III
    Djordjevic, Dragan
    Harte, Robin
    Stack, Cora
    [J]. QUAESTIONES MATHEMATICAE, 2011, 34 (03) : 335 - 339
  • [4] Unsupervised Monocular Depth Estimation for Monocular Visual SLAM Systems
    Liu, Feng
    Huang, Ming
    Ge, Hongyu
    Tao, Dan
    Gao, Ruipeng
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
  • [5] Monocular Depth Estimation Based on Unsupervised Learning
    Liu, Wan
    Sun, Yan
    Wang, XuCheng
    Yang, Lin
    Zheng, Zhenrong
    [J]. OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VI, 2019, 11187
  • [6] Unsupervised Monocular Depth and Pose Estimation Using Multiple Masks Based on Photometric and Geometric Consistency
    Kong, Huifang
    Liu, Tiankuo
    Hu, Jie
    Fang, Yao
    Sun, Jixing
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3558 - 3563
  • [7] AsiANet: Autoencoders in Autoencoder for Unsupervised Monocular Depth Estimation
    Yusiong, John Paul T.
    Naval, Prospero C., Jr.
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 443 - 451
  • [8] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
    Wang, Zhuping
    Dai, Xinke
    Guo, Zhanyu
    Huang, Chao
    Zhang, Hao
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
  • [9] Dual CNN Models for Unsupervised Monocular Depth Estimation
    Repala, Vamshi Krishna
    Dubey, Shiv Ram
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 209 - 217
  • [10] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
    Mehta, Ishit
    Sakurikar, Parikshit
    Narayanan, P. J.
    [J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323