Toward 3D object reconstruction from stereo images

被引:13
|
作者
Xie, Haozhe [1 ,2 ]
Yao, Hongxun [1 ]
Zhou, Shangchen [3 ]
Zhang, Shengping [1 ]
Tong, Xiaojun [1 ]
Sun, Wenxiu [2 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin, Peoples R China
[2] SenseTime Res & TetrasAI, Beijing, Peoples R China
[3] Nanyang Technol Univ, S Lab, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
3D object reconstruction; Stereo vision; Voxel; Point cloud; Neural network; SHAPE;
D O I
10.1016/j.neucom.2021.07.089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inferring the complete 3D shape of an object from an RGB image has shown impressive results, however, existing methods rely primarily on recognizing the most similar 3D model from the training set to solve the problem. These methods suffer from poor generalization and may lead to low-quality reconstructions for unseen objects. Nowadays, stereo cameras are pervasive in emerging devices such as dual-lens smart-phones and robots, which enables the use of the two-view nature of stereo images to explore the 3D structure and thus improve the reconstruction performance. In this paper, we propose a new deep learn-ing framework for reconstructing the 3D shape of an object from a pair of stereo images, which reasons about the 3D structure of the object by taking bidirectional disparities and feature correspondences between the two views into account. Besides, we present a large-scale synthetic benchmarking dataset, namely StereoShapeNet, containing 1,052,976 pairs of stereo images rendered from ShapeNet along with the corresponding bidirectional depth and disparity maps. Experimental results on the StereoShapeNet benchmark demonstrate that the proposed framework outperforms the state-of-the-art methods. The project page is available at https://haozhexie.com/project/stereo-3d-reconstruction. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:444 / 453
页数:10
相关论文
共 50 条
  • [31] Object recognition and 3D reconstruction of occluded objects using binocular stereo
    L. Priya
    Sheila Anand
    [J]. Cluster Computing, 2018, 21 : 29 - 38
  • [32] Object recognition and 3D reconstruction of occluded objects using binocular stereo
    Priya, L.
    Anand, Sheila
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 29 - 38
  • [33] 3D reconstruction of skin surface from photometric stereo images with specular reflection and interreflection
    Matsumoto, A
    Saito, H
    Ozawa, S
    [J]. ELECTRICAL ENGINEERING IN JAPAN, 1999, 129 (03) : 51 - 58
  • [34] Stereo reconstruction of 3D curves
    Sbert, C
    Solé, AF
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 912 - 915
  • [35] A METHOD OF 3D OBJECT RECONSTRUCTION FROM A SERIES OF CROSS-SECTIONAL IMAGES
    LEE, ET
    CHOI, YK
    PARK, KH
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1994, E77D (09) : 996 - 1004
  • [36] Object-shape recognition and 3D reconstruction from tactile sensor images
    Anwesha Khasnobish
    Garima Singh
    Arindam Jati
    Amit Konar
    D. N. Tibarewala
    [J]. Medical & Biological Engineering & Computing, 2014, 52 : 353 - 362
  • [37] Object-shape recognition and 3D reconstruction from tactile sensor images
    Khasnobish, Anwesha
    Singh, Garima
    Jati, Arindam
    Konar, Amit
    Tibarewala, D. N.
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2014, 52 (04) : 353 - 362
  • [38] Reconstruction of topology valid boundary of discrete object from 3D range images
    Gueorguieva, S
    Desbarats, P
    [J]. PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2005, : 388 - 393
  • [39] OVERVIEW ON 3D RECONSTRUCTION FROM IMAGES
    Aharchi, Moncef
    Kbir, M'hamed Ait
    [J]. 4TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA' 19), 2019,
  • [40] 3D Reconstruction from Hyperspectral Images
    Zia, Ali
    Liang, Jie
    Zhou, Jun
    Gao, Yongsheng
    [J]. 2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 318 - 325