Toward 3D object reconstruction from stereo images

被引:13
|
作者
Xie, Haozhe [1 ,2 ]
Yao, Hongxun [1 ]
Zhou, Shangchen [3 ]
Zhang, Shengping [1 ]
Tong, Xiaojun [1 ]
Sun, Wenxiu [2 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin, Peoples R China
[2] SenseTime Res & TetrasAI, Beijing, Peoples R China
[3] Nanyang Technol Univ, S Lab, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
3D object reconstruction; Stereo vision; Voxel; Point cloud; Neural network; SHAPE;
D O I
10.1016/j.neucom.2021.07.089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inferring the complete 3D shape of an object from an RGB image has shown impressive results, however, existing methods rely primarily on recognizing the most similar 3D model from the training set to solve the problem. These methods suffer from poor generalization and may lead to low-quality reconstructions for unseen objects. Nowadays, stereo cameras are pervasive in emerging devices such as dual-lens smart-phones and robots, which enables the use of the two-view nature of stereo images to explore the 3D structure and thus improve the reconstruction performance. In this paper, we propose a new deep learn-ing framework for reconstructing the 3D shape of an object from a pair of stereo images, which reasons about the 3D structure of the object by taking bidirectional disparities and feature correspondences between the two views into account. Besides, we present a large-scale synthetic benchmarking dataset, namely StereoShapeNet, containing 1,052,976 pairs of stereo images rendered from ShapeNet along with the corresponding bidirectional depth and disparity maps. Experimental results on the StereoShapeNet benchmark demonstrate that the proposed framework outperforms the state-of-the-art methods. The project page is available at https://haozhexie.com/project/stereo-3d-reconstruction. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:444 / 453
页数:10
相关论文
共 50 条
  • [1] Toward 3D object reconstruction from stereo images
    Xie, Haozhe
    Yao, Hongxun
    Zhou, Shangchen
    Zhang, Shengping
    Tong, Xiaojun
    Sun, Wenxiu
    [J]. Neurocomputing, 2021, 463 : 444 - 453
  • [2] 3D object reconstruction from aerial stereo images
    Zlatanova, S
    Paintsil, J
    Tempfli, K
    [J]. WSCG '98, VOL 3: SIXTH INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS AND VISUALIZATION 98, 1998, : 472 - 478
  • [3] Improved 3D face reconstruction from stereo images
    Yu, Jian
    Da, Feipeng
    [J]. SIXTH INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING (ICOPEN 2018), 2018, 10827
  • [4] PLUMENet: Efficient 3D Object Detection from Stereo Images
    Wang, Yan
    Yang, Bin
    Hu, Rui
    Liang, Ming
    Urtasun, Raquel
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3383 - 3390
  • [5] Multi-stereo 3D object reconstruction
    Esteban, CH
    Schmitt, F
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 159 - 166
  • [6] ACCURATE AND EFFICIENT RECONSTRUCTION OF 3D FACES FROM STEREO IMAGES
    Le, Vuong
    Tang, Hao
    Cao, Liangliang
    Huang, Thomas S.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 4265 - 4268
  • [7] Reconstruction of 3D configuration of object from infrared images
    Fan, Y
    Wang, DZ
    [J]. VISUAL INFORMATION PROCESSING V, 1996, 2753 : 134 - 138
  • [8] Reconstruction of 3D Object Meshes from Silhouette Images
    Anselmo Antunes Montenegro
    Luiz Velho
    Paulo C. P. Carvalho
    Jonas Sossai
    [J]. Journal of Mathematical Imaging and Vision, 2007, 29 : 119 - 130
  • [9] Reconstruction of 3D object meshes from silhouette images
    Montenegro, Anselmo Antunes
    Velho, Luiz
    Carvalho, Paulo C. P.
    Sossai, Jonas, Jr.
    [J]. JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2007, 29 (2-3) : 119 - 130
  • [10] 3D box method for 3D reconstruction of an object from multi-images
    Alam, J
    Hama, H
    [J]. VISION GEOMETRY IX, 2000, 4117 : 81 - 90