Talking Head Generation Based on 3D Morphable Facial Model

被引：1

作者：

Shen, Hsin-Yu ^{[1
]}

Tsai, Wen-Jiin ^{[1
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan

来源：

2024 PICTURE CODING SYMPOSIUM, PCS 2024 | 2024年

关键词：

talking-head generation; 3DMM; image-to-image translation; self-attention; deep learning;

D O I：

10.1109/PCS60826.2024.10566437

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a framework for one-shot talking-head video generation which takes a single person image and audio clips as input and synthesizes photo-realistic videos with natural head-poses and lip motion synced to the driving audio. The main idea behind this framework is to use 3D Morphable Model (3DMM) parameters as intermediate representation in generating the videos. We design an Expression Predictor and a Head Pose Predictor to predict facial expression and head-pose parameters from audio, respectively, and adopt a 3DMM model to extract identity and texture parameters from the reference image. With these parameters, facial images are rendered as an auxiliary to guide video generation. Compared to widely used facial landmarks, 3DMM parameters are more powerful in representing facial details. Experimental results show that our method can generate realistic talking-head videos and outperform many state-of-the-art methods.

引用

页数：5

共 50 条

[31] Pose variant face recognition based on 3D morphable model
Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, Beijing University of Technology, Beijing 100022, China
Beijing Gongye Daxue Xuebao J. Beijing Univ. Technol., 2007, 3 (320-325):
[32] NLDF: Neural Light Dynamic Fields for 3D Talking Head Generation
Niu, Guanchen
Cheng, Songsong
Li, Teng
PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 396 - 402
[33] Improved 3D face modeling method based on morphable model
Wang, Cheng-Zhang
Yin, Bao-Cai
Sun, Yan-Feng
Hu, Yong-Li
Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (03): : 232 - 239
[34] Pore-Scale Facial Features Matching Under 3D Morphable Model Constraint
Zeng, Xianxian
Li, Dong
Zhang, Yun
Lam, Kin-Man
COMPUTER VISION, PT II, 2017, 772 : 29 - 39
[35] A Dictionary Learning-Based 3D Morphable Shape Model
Ferrari, Claudio
Lisanti, Giuseppe
Berretti, Stefano
Del Bimbo, Alberto
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (12) : 2666 - 2679
[36] Robust face recognition by an albedo based 3D morphable model
Hu, Guosheng
Chan, Chi Ho
Yan, Fei
Christmas, William
Kittler, Josef
2014 IEEE/IAPR INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2014), 2014,
[37] Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation
Ling, Jingwang
Wang, Zhibo
Lu, Ming
Wang, Quan
Qian, Chen
Xu, Feng
COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 249 - 267
[38] Method for Generating Panoramic Textures for 3D Face Reconstruction Based on the 3D Morphable Model
Hao, Shujia
Wen, Mingyun
Cho, Kyungeun
APPLIED SCIENCES-BASEL, 2022, 12 (19):
[39] Pose-Invariant Facial Expression Recognition Based on 3D Face Morphable Model and Domain Adversarial Learning
Ma, Xiao
Zhang, Kaige
Yang, Xuan
IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 491 - 502
[40] 3D Morphable Face Model for Face Animation
Ye, Dan
Fuh, Chiou-Shann
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2020, 20 (01)

← 1 2 3 4 5 →