Single-view 3D Reconstruction Algorithm Based on View-aware

被引:0
|
作者
Wang Nian [1 ]
Hu Xuyang [1 ]
Zhu Fan [2 ]
Tang Jun [1 ]
机构
[1] Anhui Univ, Sch Elect Informat Engn, Hefei 230031, Peoples R China
[2] Incept Inst Artificial Intelligence, Abu Dhabi 51133, U Arab Emirates
关键词
View-aware; 3D reconstruction; Viewpoint translation; End-to-end neural network; Adaptive fusional;
D O I
10.11999/JEIT190986
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While projecting 3D shapes to 2D images is irreversible due to the abandoned dimension amid the projection process, there are rapidly growing interests across various vertical industries for 3D reconstruction techniques, from visualization purposes to computer aided geometric design. The traditional 3D reconstruction approaches based on depth map or RGB image can synthesize visually satisfactory 3D objects, while they generally suffer from several problems: (1)The 2D to 3D learning strategy is brutal-force; (2)Unable to solve the effects of differences in appearance from different viewpoints of objects; (3)Multiple images from distinctly different viewpoints are required. In this paper, an end-to-end View-Aware 3D (VA3D) reconstruction network is proposed to address the above problems. In particular, the VA3D includes a multi-neighbor-view synthesis sub-network and a 3D reconstruction sub-network. The multi-neighbor-view synthesis sub-network generates multiple neighboring viewpoint images based on the object source view, while the adaptive fusional module is added to resolve the blurry and distortion issues in viewpoint translation. The 3D reconstruction sub-network introduces a recurrent neural network to recover the object 3D shape from multi-view sequence. Extensive qualitative and quantitative experiments on the ShapeNet dataset show that the VA3D effectively improves the 3D reconstruction results based on single-view.
引用
收藏
页码:3053 / 3060
页数:8
相关论文
共 21 条
  • [1] [Anonymous], 2016, INSTANCE NORMALIZATI
  • [2] CHANG A X, 2015, SHAPENET INFORM RICH
  • [3] Speaker Recognition Based on Multimodal Generative Adversarial Nets with Triplet-loss
    Chen Ying
    Chen Huangkang
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (02) : 379 - 385
  • [4] 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction
    Choy, Christopher B.
    Xu, Danfei
    Gwak, Jun Young
    Chen, Kevin
    Savarese, Silvio
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 628 - 644
  • [5] Eigen D, 2014, ADV NEUR IN, V27
  • [6] Improving Shape Deformation in Unsupervised Image-to-Image Translation
    Gokaslan, Aaron
    Ramanujan, Vivek
    Ritchie, Daniel
    Kim, Kwang In
    Tompkin, James
    [J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 662 - 678
  • [7] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [8] 3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
    Grabner, Alexander
    Roth, Peter M.
    Lepetit, Vincent
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3022 - 3031
  • [9] Gulrajani I., 2017, ADV NEURAL INFPROCES, V30, P5767
  • [10] Triplet-Center Loss for Multi-View 3D Object Retrieval
    He, Xinwei
    Zhou, Yang
    Zhou, Zhichao
    Bai, Song
    Bai, Xiang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1945 - 1954