Corrective 3D Reconstruction of Lips from Monocular Video

被引：21

作者：

Garrido, Pablo ^{[1
]}

Zollhoefer, Michael ^{[1
]}

Wu, Chenglei ^{[2
]}

Bradley, Derek ^{[3
]}

Perez, Patrick ^{[4
]}

Beeler, Thabo ^{[3
]}

Theobalt, Christian ^{[1
]}

机构：

[1] Max Planck Inst Informat, Saarbrucken, Germany

[2] Swiss Fed Inst Technol, Zurich, Switzerland

[3] Disney Res, Zurich, Switzerland

[4] Technicolor, Cesson Sevigne, France

来源：

ACM TRANSACTIONS ON GRAPHICS | 2016年 / 35卷 / 06期

基金：

欧洲研究理事会;

关键词：

Lip Shape Reconstruction; Radial Basis Function Networks; Face Modeling; Facial Performance Capture; MOTION CAPTURE; ACCURATE; GEOMETRY; MODEL;

D O I：

10.1145/2980179.2982419

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In facial animation, the accurate shape and motion of the lips of virtual humans is of paramount importance, since subtle nuances in mouth expression strongly influence the interpretation of speech and the conveyed emotion. Unfortunately, passive photometric reconstruction of expressive lip motions, such as a kiss or rolling lips, is fundamentally hard even with multi-view methods in controlled studios. To alleviate this problem, we present a novel approach for fully automatic reconstruction of detailed and expressive lip shapes along with the dense geometry of the entire face, from just monocular RGB video. To this end, we learn the difference between inaccurate lip shapes found by a state-of-the-art monocular facial performance capture approach, and the true 3D lip shapes reconstructed using a high-quality multi-view system in combination with applied lip tattoos that are easy to track. A robust gradient domain regressor is trained to infer accurate lip shapes from coarse monocular reconstructions, with the additional help of automatically extracted inner and outer 2D lip contours. We quantitatively and qualitatively show that our monocular approach reconstructs higher quality lip shapes, even for complex shapes like a kiss or lip rolling, than previous monocular approaches. Furthermore, we compare the performance of person-specific and multi-person generic regression strategies and show that our approach generalizes to new individuals and general scenes, enabling high-fidelity reconstruction even from commodity video footage.

引用

页数：11

共 50 条

[1] Reconstruction of Personalized 3D Face Rigs from Monocular Video
Garrido, Pablo
Zollhoefer, Michael
Casas, Dan
Valgaerts, Levi
Varanasi, Kiran
Perez, Patrick
Theobalt, Christian
[J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (03):
[2] Joint Semantic Segmentation and 3D Reconstruction from Monocular Video
Kundu, Abhijit
Li, Yin
Dellaert, Frank
Li, Fuxin
Rehg, James M.
[J]. COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 703 - 718
[3] 3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video
Chen, Yen-Lin
Chai, Jinxiang
[J]. COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 : 71 - 82
[4] Towards robust 3D reconstruction of human motion from monocular video
Chen, Cheng
Zhuang, Yueting
Xiao, Jun
[J]. Advances in Artificial Reality and Tele-Existence, Proceedings, 2006, 4282 : 594 - 603
[5] 3D scene reconstruction from monocular spherical video with motion parallax
Tanaka, Kenji
[J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, : 191 - 197
[6] NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video
Sun, Jiaming
Xie, Yiming
Chen, Linghao
Zhou, Xiaowei
Bao, Hujun
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15593 - 15602
[7] LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses
Stier, Noah
Angles, Baptiste
Yang, Liang
Yan, Yajie
Colburn, Alex
Chuang, Ming
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7887 - 7896
[8] MotioNet: 3D Human motion reconstruction from monocular video with skeleton consistency
Shi, Mingyi
Aberman, Kfir
Aristidou, Andreas
Komura, Taku
Lischinski, Dani
Cohen-Or, Daniel
Chen, Baoquan
[J]. ACM Transactions on Graphics, 2020, 40 (01):
[9] 3D Reconstruction of Non-Rigid Surfaces from Realistic Monocular Video
Sepehrinour, Maryam
Kasaei, Shohreh
[J]. 2015 9TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2015, : 199 - 202
[10] 3D reconstruction of human skeleton from single images or monocular video sequences
Remondino, F
Roditakis, A
[J]. PATTERN RECOGNITION, PROCEEDINGS, 2003, 2781 : 100 - 107

← 1 2 3 4 5 →