Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks

被引:20
|
作者
Pilzer, Andrea [1 ]
Lathuiliere, Stephane [1 ]
Xu, Dan [1 ,2 ]
Puscas, Mihai Marian [1 ]
Ricci, Elisa [1 ,3 ]
Sebe, Nicu [1 ,4 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38122 Trento, Italy
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England
[3] Fdn Bruno Kessler, I-38122 Trento, Italy
[4] Huawei Technol Ireland, Dublin D02 R156, Ireland
关键词
Estimation; Training; Deep learning; Cameras; Solid modeling; Predictive models; Network architecture; Stereo depth estimation; convolutional neural networks (ConvNet); deep multi-scale fusion; cycle network;
D O I
10.1109/TPAMI.2019.2942928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is specifically designed for binocular stereo depth estimation. This network is based on a multi-scale refinement strategy that combines the information provided by both stereo views. In addition, we propose to stack twice this network in order to form a cycle. This cycle approach can be interpreted as a form of data-augmentation since, at training time, the network learns both from the training set images (in the forward half-cycle) but also from the synthesized images (in the backward half-cycle). The architecture is jointly trained with adversarial learning. Extensive experiments on the publicly available datasets KITTI, Cityscapes and ApolloScape demonstrate the effectiveness of the proposed model which is competitive with other unsupervised deep learning methods for depth prediction.
引用
收藏
页码:2380 / 2395
页数:16
相关论文
共 50 条
  • [21] Unsupervised Learning for Stereo Depth Estimation using Efficient Correspondence Matching
    Wenbin, Hui
    Sei-Ichiro, Kamata
    2021 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN IMAGE PROCESSING, ICAIP 2021, 2021, : 30 - 34
  • [22] Attention based multilayer feature fusion convolutional neural network for unsupervised monocular depth estimation
    Lei, Zeyu
    Wang, Yan
    Li, Zijian
    Yang, Junyao
    NEUROCOMPUTING, 2021, 423 : 343 - 352
  • [24] SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation With Stacked Generative Adversarial Networks
    Feng, Tuo
    Gu, Dongbing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04): : 4431 - 4437
  • [25] GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks
    Almalioglu, Yasin
    Saputra, Muhamad Risqi U.
    de Gusmao, Pedro P. B.
    Markham, Andrew
    Trigoni, Niki
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5474 - 5480
  • [26] Binocular depth discrimination and estimation beyond interaction space
    Allison, Robert S.
    Gillam, Barbara J.
    Vecellio, Elia
    JOURNAL OF VISION, 2009, 9 (01):
  • [27] Monocular Depth Estimation Based on Unsupervised Learning
    Liu, Wan
    Sun, Yan
    Wang, XuCheng
    Yang, Lin
    Zheng, Zhenrong
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VI, 2019, 11187
  • [28] Unsupervised Deep Event Stereo for Depth Estimation
    Uddin, S. M. Nadim
    Ahmed, Soikat Hasan
    Jung, Yong Ju
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7489 - 7504
  • [29] Unsupervised Human Pose Estimation on Depth Images
    Blanc-Beyne, Thibault
    Carlier, Axel
    Mouysset, Sandrine
    Charvillat, Vincent
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE TRACK, ECML PKDD 2020, PT IV, 2021, 12460 : 358 - 373
  • [30] UNSUPERVISED SINGLE IMAGE UNDERWATER DEPTH ESTIMATION
    Gupta, Honey
    Mitra, Kaushik
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 624 - 628