LEARNING POSE-AWARE 3D RECONSTRUCTION VIA 2D-3D SELF-CONSISTENCY

被引:0
|
作者
Liao, Yi-Lun [1 ]
Yang, Yao-Cheng [1 ]
Lin, Yuan-Fang [1 ]
Chen, Pin-Jung [1 ]
Kuo, Chia-Wen [2 ]
Chiu, Wei-Chen [3 ]
Wang, Yu-Chiang Frank [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan
[2] Georgia Inst Technol, Dept Comp Sci, Atlanta, GA 30332 USA
[3] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
deep learning; 3D shape reconstruction; camera pose estimation; perspective projection;
D O I
10.1109/icassp.2019.8682813
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
3D reconstruction, inferring 3D shape information from a single 2D image, has drawn attention from learning and vision communities. In this paper, we propose a framework for learning pose-aware 3D shape reconstruction. Our proposed model learns deep representation for recovering the 3D object, with the ability to extract camera pose information but without any direct supervision of ground truth camera pose. This is realized by exploitation of 2D-3D self-consistency between 2D masks and 3D voxels. Experiments qualitatively and quantitatively demonstrate the effectiveness and robustness of our model, which performs favorably against state-of-the-art methods.
引用
收藏
页码:3857 / 3861
页数:5
相关论文
共 50 条
  • [21] Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency
    Zhang, Yuqi
    Luo, Han
    Lei, Yinjie
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13063 - 13072
  • [22] ACCURATE 3D RECONSTRUCTION VIA SURFACE-CONSISTENCY
    Wu, Chenglei
    Cao, Xun
    Dai, Qionghai
    2009 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2009, : 49 - +
  • [23] Face 2D to 3D Reconstruction Network Based on Head Pose and 3D Facial Landmarks
    Xu, Yuanquan
    Jung, Cheolkon
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [24] Using Specular Highlights as Pose Invariant Features for 2D-3D Pose Estimation
    Netz, Aaron
    Osadchy, Margarita
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 721 - 728
  • [25] 3D Band Diagram and Photoexcitation of 2D-3D Semiconductor Heterojunctions
    Li, Bo
    Shi, Gang
    Lei, Sidong
    He, Yongmin
    Gao, Weilu
    Gong, Yongji
    Ye, Gonglan
    Zhou, Wu
    Keyshar, Kunttal
    Hao, Ji
    Dong, Pei
    Ge, Liehui
    Lou, Jun
    Kono, Junichiro
    Vajtai, Robert
    Ajayan, Pulickel M.
    NANO LETTERS, 2015, 15 (09) : 5919 - 5925
  • [26] Mixed 2D-3D information for pose estimation and face recognition
    Rama, Antonio
    Tarres, Francesc
    Onofrio, Davide
    Tubaro, Stefano
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1609 - 1612
  • [27] 3D hand pose reconstruction with ISOSOM
    Guan, HY
    Turk, M
    ADVANCES IN VISUAL COMPUTING, PROCEEDINGS, 2005, 3804 : 630 - 635
  • [28] 3D Human Pose Estimation via Deep Learning from 2D annotations
    Brau, Ernesto
    Jiang, Hao
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 582 - 591
  • [29] Learning Typical 3D Representation from a Single 2D Correspondence Using 2D-3D Transformation Network
    Ul Islam, Naeem
    Lee, Sukhan
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 440 - 455
  • [30] A Pose-Aware Auto-Augmentation Framework for 3D Human Pose and Shape Estimation from Partial Point Clouds
    Wang, Kangkan
    Yin, Sliihao
    Fang, Chenghao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 64 - 79