Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks

被引:20
|
作者
Pilzer, Andrea [1 ]
Lathuiliere, Stephane [1 ]
Xu, Dan [1 ,2 ]
Puscas, Mihai Marian [1 ]
Ricci, Elisa [1 ,3 ]
Sebe, Nicu [1 ,4 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38122 Trento, Italy
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England
[3] Fdn Bruno Kessler, I-38122 Trento, Italy
[4] Huawei Technol Ireland, Dublin D02 R156, Ireland
关键词
Estimation; Training; Deep learning; Cameras; Solid modeling; Predictive models; Network architecture; Stereo depth estimation; convolutional neural networks (ConvNet); deep multi-scale fusion; cycle network;
D O I
10.1109/TPAMI.2019.2942928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is specifically designed for binocular stereo depth estimation. This network is based on a multi-scale refinement strategy that combines the information provided by both stereo views. In addition, we propose to stack twice this network in order to form a cycle. This cycle approach can be interpreted as a form of data-augmentation since, at training time, the network learns both from the training set images (in the forward half-cycle) but also from the synthesized images (in the backward half-cycle). The architecture is jointly trained with adversarial learning. Extensive experiments on the publicly available datasets KITTI, Cityscapes and ApolloScape demonstrate the effectiveness of the proposed model which is competitive with other unsupervised deep learning methods for depth prediction.
引用
收藏
页码:2380 / 2395
页数:16
相关论文
共 50 条
  • [41] Stereo Where-What Networks: Unsupervised Binocular Feature Learning
    Solgi, Mojtaba
    Weng, Juyang
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [42] Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones
    Zhao Shuanfeng
    Huang Tao
    Xu Qian
    Geng Longlong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
  • [43] Unsupervised monocular depth estimation based on edge enhancement
    Qu Y.
    Chen Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (01): : 71 - 79
  • [44] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
    Sumanta Bhattacharyya
    Ju Shen
    Stephen Welch
    Chen Chen
    Journal of Real-Time Image Processing, 2021, 18 : 1357 - 1368
  • [45] Unsupervised Monocular Depth Estimation With Channel and Spatial Attention
    Wang, Zhuping
    Dai, Xinke
    Guo, Zhanyu
    Huang, Chao
    Zhang, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7860 - 7870
  • [46] Efficient unsupervised monocular depth estimation using attention guided generative adversarial network
    Bhattacharyya, Sumanta
    Shen, Ju
    Welch, Stephen
    Chen, Chen
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (04) : 1357 - 1368
  • [47] AsiANet: Autoencoders in Autoencoder for Unsupervised Monocular Depth Estimation
    Yusiong, John Paul T.
    Naval, Prospero C., Jr.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 443 - 451
  • [48] Unsupervised deep learning for depth estimation with offset pixels
    Imran, Saad
    Bin Mukarram, Sikander
    Khan, Muhammad Umar Karim
    Kyung, Chong-Min
    OPTICS EXPRESS, 2020, 28 (06) : 8619 - 8639
  • [49] Structured Adversarial Training for Unsupervised Monocular Depth Estimation
    Mehta, Ishit
    Sakurikar, Parikshit
    Narayanan, P. J.
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 314 - 323
  • [50] Dual CNN Models for Unsupervised Monocular Depth Estimation
    Repala, Vamshi Krishna
    Dubey, Shiv Ram
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 209 - 217