Colorful 3D Reconstruction from Single Image Based on Deep Learning

被引:3
|
作者
Zhu Yuzheng [1 ]
Zhang Yaping [1 ]
Feng Qiaosheng [1 ]
机构
[1] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Yunnan, Peoples R China
关键词
deep learning; colorful three-dimensional reconstruction; single image; differentiable renderer; attention mechanism;
D O I
10.3788/L0P202158.1410010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The task of recovering the 3D shape and its surface color from a single image at the same time is extremely challenging. For this reason, an end-to-end network model is proposed to solve this problem, which uses an encoder and decoder structure. Taking a single image as input, first extract the features through the encoder, and then send them to the shape generator and the color generator at the same time to get the shape estimation and the corresponding surface color, and finally through the differentiable rendering framework to generate the fianl color three-dimensional model. In order to ensure the details of the reconstructed 3D model, an attention mechanism is introduced into the network encoder to further improve the reconstruction effect. The experimental results show that compared with the 3D reconstruction network models, the designed model has a 10% and 3% A increase in the real 3D model intersection ratio; compared with the open source project, the structural similarity of the designed model is improved by 3 A, and the mean square error is reduced by 1.2%.
引用
收藏
页数:9
相关论文
共 20 条
  • [1] [陈加 Chen Jia], 2019, [自动化学报, Acta Automatica Sinica], V45, P657
  • [2] 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction
    Choy, Christopher B.
    Xu, Danfei
    Gwak, Jun Young
    Chen, Kevin
    Savarese, Silvio
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 628 - 644
  • [3] A Point Set Generation Network for 3D Object Reconstruction from a Single Image
    Fan, Haoqiang
    Su, Hao
    Guibas, Leonidas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2463 - 2471
  • [4] Reducing the dimensionality of data with neural networks
    Hinton, G. E.
    Salakhutdinov, R. R.
    [J]. SCIENCE, 2006, 313 (5786) : 504 - 507
  • [5] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
  • [6] Learning Category-Specific Mesh Reconstruction from Image Collections
    Kanazawa, Angjoo
    Tulsiani, Shubham
    Efros, Alexei A.
    Malik, Jitendra
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 386 - 402
  • [7] Kato H., 2017, NEURAL 3D MESH RENDE
  • [8] Learning View Priors for Single-view 3D Reconstruction
    Kato, Hiroharu
    Harada, Tatsuya
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9770 - 9779
  • [9] Soft Rasterizer: A Differentiable Renderer for Image-based 3D Reasoning
    Liu, Shichen
    Li, Tianye
    Chen, Weikai
    Li, Hao
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7707 - 7716
  • [10] SMPL: A Skinned Multi-Person Linear Model
    Loper, Matthew
    Mahmood, Naureen
    Romero, Javier
    Pons-Moll, Gerard
    Black, Michael J.
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (06):