Temporal Consistency Loss for High Resolution Textured and Clothed 3D Human Reconstruction from Monocular Video

被引:2
|
作者
Caliskan, Akin [1 ]
Mustafa, Armin [1 ]
Hilton, Adrian [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/CVPRW53098.2021.00197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel method to learn temporally consistent 3D reconstruction of clothed people from a monocular video. Recent methods for 3D human reconstruction from monocular video using volumetric, implicit or parametric human shape models, produce per frame reconstructions giving temporally inconsistent output and limited performance when applied to video. In this paper we introduce an approach to learn temporally consistent features for textured reconstruction of clothed 3D human sequences from monocular video by proposing two advances: a novel temporal consistency loss function; and hybrid representation learning for implicit 3D reconstruction from 2D images and coarse 3D geometry. The proposed advances improve the temporal consistency and accuracy of both the 3D reconstruction and texture prediction from a monocular video. Comprehensive comparative performance evaluation on images of people demonstrates that the proposed method significantly outperforms the state-of-the-art learning-based single image 3D human shape estimation approaches achieving significant improvement of reconstruction accuracy, completeness, quality and temporal consistency.
引用
收藏
页码:1780 / 1790
页数:11
相关论文
共 50 条
  • [41] 3D Human Motion Reconstruction in Unity with Monocular Camera
    Chen, Tai-Wei
    Lin, Wei-Liang
    2020 17TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2020), 2020, : 191 - 192
  • [42] NeuralRecon: Real-Time Coherent 3D Scene Reconstruction from Monocular Video
    Chen X.
    Sun J.
    Xie Y.
    Bao H.
    Zhou X.
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46 (12) : 1 - 14
  • [43] Digital 3D reconstruction of human orbitae from high resolution serial sections
    van Zwieten, J
    Botha, CP
    Willekens, B
    Schutte, S
    Post, FH
    Simonsz, HJ
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2005, 46
  • [44] DENSE OPTICAL FLOW VARIATION BASED 3D FACE RECONSTRUCTION FROM MONOCULAR VIDEO
    Wang, Shan
    Shen, Xukun
    Liu, Jiaqing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2665 - 2669
  • [45] Model-based human gait tracking, 3D reconstruction and recognition in uncalibrated monocular video
    Adeli-Mosabbeb, E.
    Fathy, M.
    Zargari, F.
    IMAGING SCIENCE JOURNAL, 2012, 60 (01): : 9 - 28
  • [46] View-Invariant 3D Human Body Pose Reconstruction using a Monocular Video Camera
    Ke, Shian-Ru
    Hwang, Jenq-Neng
    Lan, Kung-Ming
    Wang, Shen-Zheng
    2011 FIFTH ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2011,
  • [47] Statistical bias in 3-D reconstruction from a monocular video
    Roy-Chowdhury, AK
    Chellappa, R
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (08) : 1057 - 1062
  • [48] Automatic reconstruction of 3D human motion pose from uncalibrated monocular video sequences based on markerless human motion tracking
    Zou, Beiji
    Chen, Shu
    Shi, Cao
    Providence, Umugwaneza Marie
    PATTERN RECOGNITION, 2009, 42 (07) : 1559 - 1571
  • [49] Video Pop-up: Monocular 3D Reconstruction of Dynamic Scenes
    Russell, Chris
    Yu, Rui
    Agapito, Lourdes
    COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 583 - 598
  • [50] Realtime Dynamic 3D Facial Reconstruction for Monocular Video In-the-Wild
    Liu, Shuang
    Wang, Zhao
    Yang, Xiaosong
    Zhang, Jianjun
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 777 - 785