Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks

被引:20
|
作者
Pilzer, Andrea [1 ]
Lathuiliere, Stephane [1 ]
Xu, Dan [1 ,2 ]
Puscas, Mihai Marian [1 ]
Ricci, Elisa [1 ,3 ]
Sebe, Nicu [1 ,4 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38122 Trento, Italy
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England
[3] Fdn Bruno Kessler, I-38122 Trento, Italy
[4] Huawei Technol Ireland, Dublin D02 R156, Ireland
关键词
Estimation; Training; Deep learning; Cameras; Solid modeling; Predictive models; Network architecture; Stereo depth estimation; convolutional neural networks (ConvNet); deep multi-scale fusion; cycle network;
D O I
10.1109/TPAMI.2019.2942928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is specifically designed for binocular stereo depth estimation. This network is based on a multi-scale refinement strategy that combines the information provided by both stereo views. In addition, we propose to stack twice this network in order to form a cycle. This cycle approach can be interpreted as a form of data-augmentation since, at training time, the network learns both from the training set images (in the forward half-cycle) but also from the synthesized images (in the backward half-cycle). The architecture is jointly trained with adversarial learning. Extensive experiments on the publicly available datasets KITTI, Cityscapes and ApolloScape demonstrate the effectiveness of the proposed model which is competitive with other unsupervised deep learning methods for depth prediction.
引用
收藏
页码:2380 / 2395
页数:16
相关论文
共 50 条
  • [1] Unsupervised Adversarial Depth Estimation using Cycled Generative Networks
    Pilzer, Andrea
    Xu, Dan
    Puscas, Mihai Marian
    Ricci, Elisa
    Sebe, Nicu
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 587 - 595
  • [2] Unsupervised underwater imaging based on polarization and binocular depth estimation
    Guo, Enlai
    Jiang, Jian
    Shi, Yingjie
    Bai, Lianfa
    Han, Jing
    OPTICS EXPRESS, 2024, 32 (06): : 9904 - 9919
  • [3] Unsupervised Monocular Depth Estimation Based on Dense Feature Fusion
    Chen Ying
    Wang Yiliang
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (10) : 2976 - 2984
  • [4] UNSUPERVISED MONOCULAR DEPTH ESTIMATION OF DRIVING SCENES USING SIAMESE CONVOLUTIONAL LSTM NETWORKS
    Yusiong, John Paul Tan
    Naval, Prospero Clara, Jr.
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2020, 16 (01): : 91 - 106
  • [5] Depth Estimation Based on Binocular Disparity and Color-Coded Aperture Fusion
    Zhou, Dianle
    Wang, Xiaoshen
    Zhong, Zhiwei
    Pan, Xiaotian
    Shun, Xilu
    THREE-DIMENSIONAL IMAGE ACQUISITION AND DISPLAY TECHNOLOGY AND APPLICATIONS, 2018, 10845
  • [6] Unsupervised Depth Estimation Using Feature Matching Method
    Wang, Jiangzhuo
    Chen, Honghang
    Li, Jianxun
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 3525 - 3530
  • [7] Unsupervised binocular depth prediction network for laparoscopic surgery
    Xu, Ke
    Chen, Zhiyong
    Jia, Fucang
    COMPUTER ASSISTED SURGERY, 2019, 24 : 30 - 35
  • [8] LiDAR-ToF-Binocular depth fusion using gradient priors
    Zhao, Xiaoming
    Chen, Weihai
    Liu, Ziyang
    Ma, Xinzhi
    Kong, Lingkun
    Wu, Xingming
    Yue, Haosong
    Yan, Xing
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 2024 - 2029
  • [9] Binocular Depth Estimation Algorithm Based on Multi-Scale Attention Feature Fusion
    Yang Huitong
    Lei Lang
    Lin Yongchun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [10] Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation
    Puscas, Mihai Marian
    Xu, Dan
    Pilzer, Andrea
    Sebe, Niculae
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 18 - 26