Toward 3D object reconstruction from stereo images

被引:0
|
作者
Xie, Haozhe [1 ,2 ]
Yao, Hongxun [1 ]
Zhou, Shangchen [3 ]
Zhang, Shengping [1 ]
Tong, Xiaojun [1 ]
Sun, Wenxiu [2 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin, Peoples R China
[2] SenseTime Res & TetrasAI, Beijing, Peoples R China
[3] Nanyang Technol Univ, S Lab, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
3D object reconstruction; Stereo vision; Voxel; Point cloud; Neural network; SHAPE;
D O I
10.1016/j.neucom.2021.07.089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inferring the complete 3D shape of an object from an RGB image has shown impressive results, however, existing methods rely primarily on recognizing the most similar 3D model from the training set to solve the problem. These methods suffer from poor generalization and may lead to low-quality reconstructions for unseen objects. Nowadays, stereo cameras are pervasive in emerging devices such as dual-lens smart-phones and robots, which enables the use of the two-view nature of stereo images to explore the 3D structure and thus improve the reconstruction performance. In this paper, we propose a new deep learn-ing framework for reconstructing the 3D shape of an object from a pair of stereo images, which reasons about the 3D structure of the object by taking bidirectional disparities and feature correspondences between the two views into account. Besides, we present a large-scale synthetic benchmarking dataset, namely StereoShapeNet, containing 1,052,976 pairs of stereo images rendered from ShapeNet along with the corresponding bidirectional depth and disparity maps. Experimental results on the StereoShapeNet benchmark demonstrate that the proposed framework outperforms the state-of-the-art methods. The project page is available at https://haozhexie.com/project/stereo-3d-reconstruction. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:444 / 453
页数:10
相关论文
共 50 条
  • [41] OVERVIEW ON 3D RECONSTRUCTION FROM IMAGES
    Aharchi, Moncef
    Kbir, M'hamed Ait
    [J]. 4TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA' 19), 2019,
  • [42] 3D RECONSTRUCTION FROM MRI IMAGES
    Anderla, Andras
    Brkljac, Branko
    Stefanovic, Darko
    Krsmanovic, Cvijan
    Sladojevic, Srdan
    Culibrk, Dubravko
    [J]. METALURGIA INTERNATIONAL, 2013, 18 : 17 - 21
  • [43] 3D Reconstruction from Hyperspectral Images
    Zia, Ali
    Liang, Jie
    Zhou, Jun
    Gao, Yongsheng
    [J]. 2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 318 - 325
  • [44] 3D reconstruction of stereo images for interaction between real and virtual worlds
    Kim, H
    Yang, SJ
    Sohn, K
    [J]. SECOND IEEE AND ACM INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, PROCEEDINGS, 2003, : 169 - 177
  • [45] 3D surface reconstruction of stereo endoscopic images for minimally invasive surgery
    Huang X.
    Abdalbari A.
    Ren J.
    [J]. Biomedical Engineering Letters, 2013, 3 (03) : 149 - 157
  • [46] 3D reconstruction of surface and subsurface structures of solids by SEM stereo images
    Sokolov, VN
    Yurkovets, DI
    Mel'nik, VN
    Boyde, A
    Howell, PGT
    [J]. ELECTRON MICROSCOPY AND ANALYSIS 2001, 2001, (168): : 119 - 122
  • [47] Simplified video transmission of stereo images for 3D image reconstruction in TV
    Balasubramanian, K.
    Cellatoglu, A.
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) : 307 - 315
  • [48] Transformer-Based Stereo-Aware 3D Object Detection From Binocular Images
    Sun, Hanqing
    Pang, Yanwei
    Cao, Jiale
    Xie, Jin
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 19675 - 19687
  • [49] Toward Cooperative 3D Object Reconstruction with Multi-agent
    Li, Xiong
    Wen, Zhenyu
    Zhou, Leiqiang
    Li, Chenwei
    Zhou, Yejian
    Li, Taotao
    Hong, Zhen
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4975 - 4982
  • [50] Fast 3D model acquisition from stereo images
    Morency, LP
    Rahimi, A
    Darrell, T
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 172 - 176