Temporal Consistency Loss for High Resolution Textured and Clothed 3D Human Reconstruction from Monocular Video

被引:2
|
作者
Caliskan, Akin [1 ]
Mustafa, Armin [1 ]
Hilton, Adrian [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/CVPRW53098.2021.00197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel method to learn temporally consistent 3D reconstruction of clothed people from a monocular video. Recent methods for 3D human reconstruction from monocular video using volumetric, implicit or parametric human shape models, produce per frame reconstructions giving temporally inconsistent output and limited performance when applied to video. In this paper we introduce an approach to learn temporally consistent features for textured reconstruction of clothed 3D human sequences from monocular video by proposing two advances: a novel temporal consistency loss function; and hybrid representation learning for implicit 3D reconstruction from 2D images and coarse 3D geometry. The proposed advances improve the temporal consistency and accuracy of both the 3D reconstruction and texture prediction from a monocular video. Comprehensive comparative performance evaluation on images of people demonstrates that the proposed method significantly outperforms the state-of-the-art learning-based single image 3D human shape estimation approaches achieving significant improvement of reconstruction accuracy, completeness, quality and temporal consistency.
引用
收藏
页码:1780 / 1790
页数:11
相关论文
共 50 条
  • [1] MotioNet: 3D Human motion reconstruction from monocular video with skeleton consistency
    Shi, Mingyi
    Aberman, Kfir
    Aristidou, Andreas
    Komura, Taku
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    [J]. ACM Transactions on Graphics, 2020, 40 (01):
  • [2] MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency
    Shi, Mingyi
    Aberman, Kfir
    Aristidou, Andreas
    Komura, Taku
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (01):
  • [3] 3D Clothed Human Reconstruction in the Wild
    Moon, Gyeongsik
    Nam, Hyeongjin
    Shiratori, Takaaki
    Lee, Kyoung Mu
    [J]. COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 184 - 200
  • [4] 3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video
    Chen, Yen-Lin
    Chai, Jinxiang
    [J]. COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 : 71 - 82
  • [5] Towards robust 3D reconstruction of human motion from monocular video
    Chen, Cheng
    Zhuang, Yueting
    Xiao, Jun
    [J]. Advances in Artificial Reality and Tele-Existence, Proceedings, 2006, 4282 : 594 - 603
  • [6] Monocular 3D Reconstruction of Locally Textured Surfaces
    Varol, Aydin
    Shaji, Appu
    Salzmann, Mathieu
    Fua, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) : 1118 - 1130
  • [7] Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey
    Yang, Shuo
    Gu, Xiaoling
    Kuang, Zhenzhong
    Qin, Feiwei
    Wu, Zizhao
    [J]. VISUAL COMPUTER, 2024,
  • [8] LayerNet: High-Resolution Semantic 3D Reconstruction of Clothed People
    Corona, Enric
    Alenya, Guillem
    Pons-Moll, Gerard
    Moreno-Noguer, Francesc
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 1257 - 1272
  • [9] Corrective 3D Reconstruction of Lips from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Wu, Chenglei
    Bradley, Derek
    Perez, Patrick
    Beeler, Thabo
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
  • [10] 3D reconstruction of human skeleton from single images or monocular video sequences
    Remondino, F
    Roditakis, A
    [J]. PATTERN RECOGNITION, PROCEEDINGS, 2003, 2781 : 100 - 107