Corrective 3D Reconstruction of Lips from Monocular Video

被引:21
|
作者
Garrido, Pablo [1 ]
Zollhoefer, Michael [1 ]
Wu, Chenglei [2 ]
Bradley, Derek [3 ]
Perez, Patrick [4 ]
Beeler, Thabo [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Swiss Fed Inst Technol, Zurich, Switzerland
[3] Disney Res, Zurich, Switzerland
[4] Technicolor, Cesson Sevigne, France
来源
ACM TRANSACTIONS ON GRAPHICS | 2016年 / 35卷 / 06期
基金
欧洲研究理事会;
关键词
Lip Shape Reconstruction; Radial Basis Function Networks; Face Modeling; Facial Performance Capture; MOTION CAPTURE; ACCURATE; GEOMETRY; MODEL;
D O I
10.1145/2980179.2982419
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In facial animation, the accurate shape and motion of the lips of virtual humans is of paramount importance, since subtle nuances in mouth expression strongly influence the interpretation of speech and the conveyed emotion. Unfortunately, passive photometric reconstruction of expressive lip motions, such as a kiss or rolling lips, is fundamentally hard even with multi-view methods in controlled studios. To alleviate this problem, we present a novel approach for fully automatic reconstruction of detailed and expressive lip shapes along with the dense geometry of the entire face, from just monocular RGB video. To this end, we learn the difference between inaccurate lip shapes found by a state-of-the-art monocular facial performance capture approach, and the true 3D lip shapes reconstructed using a high-quality multi-view system in combination with applied lip tattoos that are easy to track. A robust gradient domain regressor is trained to infer accurate lip shapes from coarse monocular reconstructions, with the additional help of automatically extracted inner and outer 2D lip contours. We quantitatively and qualitatively show that our monocular approach reconstructs higher quality lip shapes, even for complex shapes like a kiss or lip rolling, than previous monocular approaches. Furthermore, we compare the performance of person-specific and multi-person generic regression strategies and show that our approach generalizes to new individuals and general scenes, enabling high-fidelity reconstruction even from commodity video footage.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Reconstruction of Personalized 3D Face Rigs from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Casas, Dan
    Valgaerts, Levi
    Varanasi, Kiran
    Perez, Patrick
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (03):
  • [2] Joint Semantic Segmentation and 3D Reconstruction from Monocular Video
    Kundu, Abhijit
    Li, Yin
    Dellaert, Frank
    Li, Fuxin
    Rehg, James M.
    [J]. COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 703 - 718
  • [3] 3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video
    Chen, Yen-Lin
    Chai, Jinxiang
    [J]. COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 : 71 - 82
  • [4] Towards robust 3D reconstruction of human motion from monocular video
    Chen, Cheng
    Zhuang, Yueting
    Xiao, Jun
    [J]. Advances in Artificial Reality and Tele-Existence, Proceedings, 2006, 4282 : 594 - 603
  • [5] 3D scene reconstruction from monocular spherical video with motion parallax
    Tanaka, Kenji
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, : 191 - 197
  • [6] NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video
    Sun, Jiaming
    Xie, Yiming
    Chen, Linghao
    Zhou, Xiaowei
    Bao, Hujun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15593 - 15602
  • [7] LivePose: Online 3D Reconstruction from Monocular Video with Dynamic Camera Poses
    Stier, Noah
    Angles, Baptiste
    Yang, Liang
    Yan, Yajie
    Colburn, Alex
    Chuang, Ming
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7887 - 7896
  • [8] MotioNet: 3D Human motion reconstruction from monocular video with skeleton consistency
    Shi, Mingyi
    Aberman, Kfir
    Aristidou, Andreas
    Komura, Taku
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    [J]. ACM Transactions on Graphics, 2020, 40 (01):
  • [9] 3D Reconstruction of Non-Rigid Surfaces from Realistic Monocular Video
    Sepehrinour, Maryam
    Kasaei, Shohreh
    [J]. 2015 9TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2015, : 199 - 202
  • [10] 3D reconstruction of human skeleton from single images or monocular video sequences
    Remondino, F
    Roditakis, A
    [J]. PATTERN RECOGNITION, PROCEEDINGS, 2003, 2781 : 100 - 107