Talking Head Generation Based on 3D Morphable Facial Model

被引:1
|
作者
Shen, Hsin-Yu [1 ]
Tsai, Wen-Jiin [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
talking-head generation; 3DMM; image-to-image translation; self-attention; deep learning;
D O I
10.1109/PCS60826.2024.10566437
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a framework for one-shot talking-head video generation which takes a single person image and audio clips as input and synthesizes photo-realistic videos with natural head-poses and lip motion synced to the driving audio. The main idea behind this framework is to use 3D Morphable Model (3DMM) parameters as intermediate representation in generating the videos. We design an Expression Predictor and a Head Pose Predictor to predict facial expression and head-pose parameters from audio, respectively, and adopt a 3DMM model to extract identity and texture parameters from the reference image. With these parameters, facial images are rendered as an auxiliary to guide video generation. Compared to widely used facial landmarks, 3DMM parameters are more powerful in representing facial details. Experimental results show that our method can generate realistic talking-head videos and outperform many state-of-the-art methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Pose variant face recognition based on 3D morphable model
    Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing 100022, China
    Beijing Gongye Daxue Xuebao J. Beijing Univ. Technol., 2007, 3 (320-325):
  • [32] NLDF: Neural Light Dynamic Fields for 3D Talking Head Generation
    Niu, Guanchen
    Cheng, Songsong
    Li, Teng
    PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 396 - 402
  • [33] Improved 3D face modeling method based on morphable model
    Wang, Cheng-Zhang
    Yin, Bao-Cai
    Sun, Yan-Feng
    Hu, Yong-Li
    Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (03): : 232 - 239
  • [34] Pore-Scale Facial Features Matching Under 3D Morphable Model Constraint
    Zeng, Xianxian
    Li, Dong
    Zhang, Yun
    Lam, Kin-Man
    COMPUTER VISION, PT II, 2017, 772 : 29 - 39
  • [35] A Dictionary Learning-Based 3D Morphable Shape Model
    Ferrari, Claudio
    Lisanti, Giuseppe
    Berretti, Stefano
    Del Bimbo, Alberto
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (12) : 2666 - 2679
  • [36] Robust face recognition by an albedo based 3D morphable model
    Hu, Guosheng
    Chan, Chi Ho
    Yan, Fei
    Christmas, William
    Kittler, Josef
    2014 IEEE/IAPR INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2014), 2014,
  • [37] Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation
    Ling, Jingwang
    Wang, Zhibo
    Lu, Ming
    Wang, Quan
    Qian, Chen
    Xu, Feng
    COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 249 - 267
  • [38] Method for Generating Panoramic Textures for 3D Face Reconstruction Based on the 3D Morphable Model
    Hao, Shujia
    Wen, Mingyun
    Cho, Kyungeun
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [39] Pose-Invariant Facial Expression Recognition Based on 3D Face Morphable Model and Domain Adversarial Learning
    Ma, Xiao
    Zhang, Kaige
    Yang, Xuan
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 491 - 502
  • [40] 3D Morphable Face Model for Face Animation
    Ye, Dan
    Fuh, Chiou-Shann
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2020, 20 (01)