Estimating Depth From Monocular Images as Classification Using Deep Fully Convolutional Residual Networks

被引：312

作者：

Cao, Yuanzhouhan ^{[1
]}

Wu, Zifeng ^{[1
]}

Shen, Chunhua ^{[1
]}

机构：

[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2018年 / 28卷 / 11期

关键词：

Classification; deep residual networks; depth estimation;

D O I：

10.1109/TCSVT.2017.2740321

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Depth estimation from single monocular images is a key component in scene understanding. Most existing algorithms formulate depth estimation as a regression problem due to the continuous property of depths. However, the depth value of input data can hardly be regressed exactly to the ground-truth value. In this paper, we propose to formulate depth estimation as a pixelwise classification task. Specifically, we first discretize the continuous ground-truth depths into several bins and label the bins according to their depth ranges. Then, we solve the depth estimation problem as classification by training a fully convolutional deep residual network. Compared with estimating the exact depth of a single point, it is easier to estimate its depth range. More importantly, by performing depth classification instead of regression, we can easily obtain the confidence of a depth prediction in the form of probability distribution. With this confidence, we can apply an information gain loss to make use of the predictions that are close to ground-truth during training, as well as fully-connected conditional random fields for post-processing to further improve the performance. We test our proposed method on both indoor and outdoor benchmark RGB-Depth datasets and achieve state-of-the-art performance.

引用

页码：3174 / 3182

页数：9

共 50 条

[1] Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
Liu, Fayao
Shen, Chunhua
Lin, Guosheng
Reid, Ian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) : 2024 - 2039
[2] Monocular Depth Estimation as Regression of Classification using Piled Residual Networks
Su, Wen
Zhang, Haifeng
Li, Jia
Yang, Wenzhen
Wang, Zengfu
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2161 - 2169
[3] Feature Enhanced Fully Convolutional Networks for Monocular Depth Estimation
Shi, Chunxiu
Chen, Jie
Chen, Juan
Zhang, Zheng
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 270 - 276
[4] MONOCULAR DEPTH ESTIMATION OF GOOGLE EARTH IMAGES USING CONVOLUTIONAL NEURAL NETWORKS
Najaf, M.
Arefi, H.
Amirkolaee, H. Amini
Farajelahi, B.
[J]. ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 589 - 594
[5] Deeper Depth Prediction with Fully Convolutional Residual Networks
Laina, Iro
Rupprecht, Christian
Belagiannis, Vasileios
Tombari, Federico
Navab, Nassir
[J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 239 - 248
[6] Discrete convolutional CRF networks for depth estimation from monocular infrared images
Qianqian Wang
Haitao Zhao
Zhengwei Hu
Yuru Chen
Yuqi Li
[J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 187 - 200
[7] Discrete convolutional CRF networks for depth estimation from monocular infrared images
Wang, Qianqian
Zhao, Haitao
Hu, Zhengwei
Chen, Yuru
Li, Yuqi
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (01) : 187 - 200
[8] Target Classification Using the Deep Convolutional Networks for SAR Images
Chen, Sizhe
Wang, Haipeng
Xu, Feng
Jin, Ya-Qiu
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (08): : 4806 - 4817
[9] Fast Robust Monocular Depth Estimation for Obstacle Detection with Fully Convolutional Networks
Mancini, Michele
Costante, Gabriele
Valigi, Paolo
Ciarfuglia, Thomas A.
[J]. 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4296 - 4303
[10] Fully convolutional multi-scale dense networks for monocular depth estimation
Liu, Jiwei
Zhang, Yunzhou
Cui, Jiahua
Feng, Yonghui
Pang, Linzhuo
[J]. IET COMPUTER VISION, 2019, 13 (05) : 515 - 522

← 1 2 3 4 5 →