Colorful 3D Reconstruction from Single Image Based on Deep Learning

被引：3

作者：

Zhu Yuzheng ^{[1
]}

Zhang Yaping ^{[1
]}

Feng Qiaosheng ^{[1
]}

机构：

[1] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Yunnan, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2021年 / 58卷 / 14期

关键词：

deep learning; colorful three-dimensional reconstruction; single image; differentiable renderer; attention mechanism;

D O I：

10.3788/L0P202158.1410010

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The task of recovering the 3D shape and its surface color from a single image at the same time is extremely challenging. For this reason, an end-to-end network model is proposed to solve this problem, which uses an encoder and decoder structure. Taking a single image as input, first extract the features through the encoder, and then send them to the shape generator and the color generator at the same time to get the shape estimation and the corresponding surface color, and finally through the differentiable rendering framework to generate the fianl color three-dimensional model. In order to ensure the details of the reconstructed 3D model, an attention mechanism is introduced into the network encoder to further improve the reconstruction effect. The experimental results show that compared with the 3D reconstruction network models, the designed model has a 10% and 3% A increase in the real 3D model intersection ratio; compared with the open source project, the structural similarity of the designed model is improved by 3 A, and the mean square error is reduced by 1.2%.

引用

页数：9

共 20 条

[1] [陈加 Chen Jia], 2019, [自动化学报, Acta Automatica Sinica], V45, P657
[2] 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction
Choy, Christopher B.
Xu, Danfei
Gwak, Jun Young
Chen, Kevin
Savarese, Silvio
[J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 628 - 644
[3] A Point Set Generation Network for 3D Object Reconstruction from a Single Image
Fan, Haoqiang
Su, Hao
Guibas, Leonidas
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2463 - 2471
[4] Reducing the dimensionality of data with neural networks
Hinton, G. E.
Salakhutdinov, R. R.
[J]. SCIENCE, 2006, 313 (5786) : 504 - 507
[5] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[6] Learning Category-Specific Mesh Reconstruction from Image Collections
Kanazawa, Angjoo
Tulsiani, Shubham
Efros, Alexei A.
Malik, Jitendra
[J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 386 - 402
[7] Kato H., 2017, NEURAL 3D MESH RENDE
[8] Learning View Priors for Single-view 3D Reconstruction
Kato, Hiroharu
Harada, Tatsuya
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9770 - 9779
[9] Soft Rasterizer: A Differentiable Renderer for Image-based 3D Reasoning
Liu, Shichen
Li, Tianye
Chen, Weikai
Li, Hao
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7707 - 7716
[10] SMPL: A Skinned Multi-Person Linear Model
Loper, Matthew
Mahmood, Naureen
Romero, Javier
Pons-Moll, Gerard
Black, Michael J.
[J]. ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (06):

← 1 2 →