3-d depth reconstruction from a single still image

被引:394
|
作者
Saxena, Ashutosh [1 ]
Chung, Sung H. [1 ]
Ng, Andrew Y. [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
monocular vision; learning depth; 3D reconstruction; dense reconstruction; Markov Random Field; depth estimation; monocular depth; stereo vision; hand-held camera; visual modeling;
D O I
10.1007/s11263-007-0071-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the task of 3-d depth estimation from a single still image. We take a supervised learning approach to this problem, in which we begin by collecting a training set of monocular images (of unstructured indoor and outdoor environments which include forests, sidewalks, trees, buildings, etc.) and their corresponding ground-truth depthmaps. Then, we apply supervised learning to predict the value of the depthmap as a function of the image. Depth estimation is a challenging problem, since local features alone are insufficient to estimate depth at a point, and one needs to consider the global context of the image. Our model uses a hierarchical, multiscale Markov Random Field (MRF) that incorporates multiscale local- and global-image features, and models the depths and the relation between depths at different points in the image. We show that, even on unstructured scenes, our algorithm is frequently able to recover fairly accurate depthmaps. We further propose a model that incorporates both monocular cues and stereo (triangulation) cues, to obtain significantly more accurate depth estimates than is possible using either monocular or stereo cues alone.
引用
收藏
页码:53 / 69
页数:17
相关论文
共 50 条
  • [1] 3-D Depth Reconstruction from a Single Still Image
    Ashutosh Saxena
    Sung H. Chung
    Andrew Y. Ng
    [J]. International Journal of Computer Vision, 2008, 76 : 53 - 69
  • [2] Learning 3-d scene structure from a single still image
    Saxena, Ashutosh
    Sun, Min
    Ng, Andrew Y.
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1 - 8
  • [3] Face Denoising and 3D Reconstruction from A Single Depth Image
    Zhong, Yicheng
    Pei, Yuru
    Li, Peixin
    Guo, Yuke
    Ma, Gengyu
    Liu, Meng
    Bai, Wei
    Wu, WenHai
    Zha, Hongbin
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 117 - 124
  • [4] 3-D Reconstruction of Human Body Shape From a Single Commodity Depth Camera
    Zhao, Tianhao
    Li, Songnan
    Ngan, King Ngi
    Wu, Fanzi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (01) : 114 - 123
  • [5] AUTOMATIC 3-D DEPTH RECOVERY FROM A SINGLE URBAN-SCENE IMAGE
    Tseng, Chen-yu
    Wang, Sheng-Jyh
    [J]. 2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
  • [6] An Unsupervised Approach for 3D Face Reconstruction from a Single Depth Image
    Li, Peixin
    Pei, Yuru
    Zhong, Yicheng
    Guo, Yuke
    Ma, Gengyu
    Liu, Meng
    Bai, Wei
    Wu, Wenhai
    Zha, Hongbin
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2020, 2020, 12221 : 206 - 219
  • [7] Depth Resolution in 3-D image
    Son, Jung-Young
    Park, Min-Chul
    Lee, Chun-Hea
    Chernyshov, Oleksii O.
    Son, Wook-Ho
    [J]. IDW/AD '12: PROCEEDINGS OF THE INTERNATIONAL DISPLAY WORKSHOPS, PT 1, 2012, 19 : 195 - 198
  • [8] Reconstruction of 3-D image from stereo pictures
    Breznan, M
    Hronec, R
    [J]. Proceedings ELMAR-2005, 2005, : 19 - 23
  • [9] SINGLE-IMAGE 3-D DEPTH ESTIMATION FOR URBAN SCENES
    Cheng, Hsin-Min
    Tseng, Chen-Yu
    Hsin, Cheng-Ho
    Wang, Sheng-Jyh
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2121 - 2125
  • [10] 3-D reconstruction of building from single high-resolution SAR image
    Fu Xing-Yu
    You Hong-Jian
    Fu Kun
    [J]. JOURNAL OF INFRARED AND MILLIMETER WAVES, 2012, 31 (06) : 569 - 576