Temporal Consistency Loss for High Resolution Textured and Clothed 3D Human Reconstruction from Monocular Video

被引：2

作者：

Caliskan, Akin ^{[1
]}

Mustafa, Armin ^{[1
]}

Hilton, Adrian ^{[1
]}

机构：

[1] Univ Surrey, CVSSP, Guildford, Surrey, England

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 | 2021年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/CVPRW53098.2021.00197

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel method to learn temporally consistent 3D reconstruction of clothed people from a monocular video. Recent methods for 3D human reconstruction from monocular video using volumetric, implicit or parametric human shape models, produce per frame reconstructions giving temporally inconsistent output and limited performance when applied to video. In this paper we introduce an approach to learn temporally consistent features for textured reconstruction of clothed 3D human sequences from monocular video by proposing two advances: a novel temporal consistency loss function; and hybrid representation learning for implicit 3D reconstruction from 2D images and coarse 3D geometry. The proposed advances improve the temporal consistency and accuracy of both the 3D reconstruction and texture prediction from a monocular video. Comprehensive comparative performance evaluation on images of people demonstrates that the proposed method significantly outperforms the state-of-the-art learning-based single image 3D human shape estimation approaches achieving significant improvement of reconstruction accuracy, completeness, quality and temporal consistency.

引用

页码：1780 / 1790

页数：11

共 50 条

[41] 3D Human Motion Reconstruction in Unity with Monocular Camera
Chen, Tai-Wei
Lin, Wei-Liang
2020 17TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2020), 2020, : 191 - 192
[42] NeuralRecon: Real-Time Coherent 3D Scene Reconstruction from Monocular Video
Chen X.
Sun J.
Xie Y.
Bao H.
Zhou X.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46 (12) : 1 - 14
[43] Digital 3D reconstruction of human orbitae from high resolution serial sections
van Zwieten, J
Botha, CP
Willekens, B
Schutte, S
Post, FH
Simonsz, HJ
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2005, 46
[44] DENSE OPTICAL FLOW VARIATION BASED 3D FACE RECONSTRUCTION FROM MONOCULAR VIDEO
Wang, Shan
Shen, Xukun
Liu, Jiaqing
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2665 - 2669
[45] Model-based human gait tracking, 3D reconstruction and recognition in uncalibrated monocular video
Adeli-Mosabbeb, E.
Fathy, M.
Zargari, F.
IMAGING SCIENCE JOURNAL, 2012, 60 (01): : 9 - 28
[46] View-Invariant 3D Human Body Pose Reconstruction using a Monocular Video Camera
Ke, Shian-Ru
Hwang, Jenq-Neng
Lan, Kung-Ming
Wang, Shen-Zheng
2011 FIFTH ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2011,
[47] Statistical bias in 3-D reconstruction from a monocular video
Roy-Chowdhury, AK
Chellappa, R
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (08) : 1057 - 1062
[48] Automatic reconstruction of 3D human motion pose from uncalibrated monocular video sequences based on markerless human motion tracking
Zou, Beiji
Chen, Shu
Shi, Cao
Providence, Umugwaneza Marie
PATTERN RECOGNITION, 2009, 42 (07) : 1559 - 1571
[49] Video Pop-up: Monocular 3D Reconstruction of Dynamic Scenes
Russell, Chris
Yu, Rui
Agapito, Lourdes
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 583 - 598
[50] Realtime Dynamic 3D Facial Reconstruction for Monocular Video In-the-Wild
Liu, Shuang
Wang, Zhao
Yang, Xiaosong
Zhang, Jianjun
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 777 - 785

← 1 2 3 4 5 →