Talking Head Generation Based on 3D Morphable Facial Model

被引：1

作者：

Shen, Hsin-Yu ^{[1
]}

Tsai, Wen-Jiin ^{[1
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan

来源：

2024 PICTURE CODING SYMPOSIUM, PCS 2024 | 2024年

关键词：

talking-head generation; 3DMM; image-to-image translation; self-attention; deep learning;

D O I：

10.1109/PCS60826.2024.10566437

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a framework for one-shot talking-head video generation which takes a single person image and audio clips as input and synthesizes photo-realistic videos with natural head-poses and lip motion synced to the driving audio. The main idea behind this framework is to use 3D Morphable Model (3DMM) parameters as intermediate representation in generating the videos. We design an Expression Predictor and a Head Pose Predictor to predict facial expression and head-pose parameters from audio, respectively, and adopt a 3DMM model to extract identity and texture parameters from the reference image. With these parameters, facial images are rendered as an auxiliary to guide video generation. Compared to widely used facial landmarks, 3DMM parameters are more powerful in representing facial details. Experimental results show that our method can generate realistic talking-head videos and outperform many state-of-the-art methods.

引用

页数：5

共 50 条

[41] A Biophysical 3D Morphable Model of Face Appearance
Alotaibi, Sarah
Smith, William A. P.
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 824 - 832
[42] Gaussian mixture 3D morphable face model
Koppen, Paul
Feng, Zhen-Hua
Kittler, Josef
Awais, Muhammad
Christmas, William
Wu, Xiao-Jun
Yin, He-Feng
PATTERN RECOGNITION, 2018, 74 : 617 - 628
[43] Fitting a morphable model to 3D scans of faces
Blanz, Volker
Scherbaum, Kristina
Seidel, Hans-Peter
2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1562 - 1569
[44] Inverse Rendering of Faces with a 3D Morphable Model
Aldrian, Oswald
Smith, William A. P.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (05) : 1080 - 1093
[45] An improved morphable model for 3D face synthesis
Hu, YL
Yin, BC
Cheng, SQ
Gu, CL
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 4362 - 4367
[46] Unsupervised Training for 3D Morphable Model Regression
Genova, Kyle
Cole, Forrester
Maschinot, Aaron
Sarna, Aaron
Vlasic, Daniel
Freeman, William T.
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8377 - 8386
[47] Resolution-Aware 3D Morphable Model
Hu, Guosheng
Chan, Chi Ho
Kittler, Josef
Christmas, William
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[48] Efficient 3D morphable face model fitting
Hu, Guosheng
Yan, Fei
Kittler, Josef
Christmas, William
Chan, Chi Ho
Feng, Zhenhua
Huber, Patrik
PATTERN RECOGNITION, 2017, 67 : 366 - 379
[49] 3D face reconstruction based on canonical correlation analysis and morphable model
Hu, Yongli
Ge, Yun
Sun, Yanfeng
Yin, Baocai
Journal of Computational Information Systems, 2014, 10 (06): : 2405 - 2415
[50] Efficient Emotional Talking Head Generation via Dynamic 3D Gaussian Rendering
Liu, Tiantian
Li, Jiahe
Bai, Xiao
Zheng, Jin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 80 - 94

← 1 2 3 4 5 →