Estimating Depth From Monocular Images as Classification Using Deep Fully Convolutional Residual Networks

被引:312
|
作者
Cao, Yuanzhouhan [1 ]
Wu, Zifeng [1 ]
Shen, Chunhua [1 ]
机构
[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
关键词
Classification; deep residual networks; depth estimation;
D O I
10.1109/TCSVT.2017.2740321
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Depth estimation from single monocular images is a key component in scene understanding. Most existing algorithms formulate depth estimation as a regression problem due to the continuous property of depths. However, the depth value of input data can hardly be regressed exactly to the ground-truth value. In this paper, we propose to formulate depth estimation as a pixelwise classification task. Specifically, we first discretize the continuous ground-truth depths into several bins and label the bins according to their depth ranges. Then, we solve the depth estimation problem as classification by training a fully convolutional deep residual network. Compared with estimating the exact depth of a single point, it is easier to estimate its depth range. More importantly, by performing depth classification instead of regression, we can easily obtain the confidence of a depth prediction in the form of probability distribution. With this confidence, we can apply an information gain loss to make use of the predictions that are close to ground-truth during training, as well as fully-connected conditional random fields for post-processing to further improve the performance. We test our proposed method on both indoor and outdoor benchmark RGB-Depth datasets and achieve state-of-the-art performance.
引用
收藏
页码:3174 / 3182
页数:9
相关论文
共 50 条
  • [1] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
    Liu, Fayao
    Shen, Chunhua
    Lin, Guosheng
    Reid, Ian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) : 2024 - 2039
  • [2] Monocular Depth Estimation as Regression of Classification using Piled Residual Networks
    Su, Wen
    Zhang, Haifeng
    Li, Jia
    Yang, Wenzhen
    Wang, Zengfu
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2161 - 2169
  • [3] Feature Enhanced Fully Convolutional Networks for Monocular Depth Estimation
    Shi, Chunxiu
    Chen, Jie
    Chen, Juan
    Zhang, Zheng
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 270 - 276
  • [4] MONOCULAR DEPTH ESTIMATION OF GOOGLE EARTH IMAGES USING CONVOLUTIONAL NEURAL NETWORKS
    Najaf, M.
    Arefi, H.
    Amirkolaee, H. Amini
    Farajelahi, B.
    [J]. ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 589 - 594
  • [5] Deeper Depth Prediction with Fully Convolutional Residual Networks
    Laina, Iro
    Rupprecht, Christian
    Belagiannis, Vasileios
    Tombari, Federico
    Navab, Nassir
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 239 - 248
  • [6] Discrete convolutional CRF networks for depth estimation from monocular infrared images
    Qianqian Wang
    Haitao Zhao
    Zhengwei Hu
    Yuru Chen
    Yuqi Li
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 187 - 200
  • [7] Discrete convolutional CRF networks for depth estimation from monocular infrared images
    Wang, Qianqian
    Zhao, Haitao
    Hu, Zhengwei
    Chen, Yuru
    Li, Yuqi
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (01) : 187 - 200
  • [8] Target Classification Using the Deep Convolutional Networks for SAR Images
    Chen, Sizhe
    Wang, Haipeng
    Xu, Feng
    Jin, Ya-Qiu
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (08): : 4806 - 4817
  • [9] Fast Robust Monocular Depth Estimation for Obstacle Detection with Fully Convolutional Networks
    Mancini, Michele
    Costante, Gabriele
    Valigi, Paolo
    Ciarfuglia, Thomas A.
    [J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4296 - 4303
  • [10] Fully convolutional multi-scale dense networks for monocular depth estimation
    Liu, Jiwei
    Zhang, Yunzhou
    Cui, Jiahua
    Feng, Yonghui
    Pang, Linzhuo
    [J]. IET COMPUTER VISION, 2019, 13 (05) : 515 - 522