Real-time speech-driven 3D face animation

被引:4
|
作者
Hong, PY [1 ]
Wen, Z [1 ]
Huang, TS [1 ]
Shum, HY [1 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
关键词
D O I
10.1109/TDPVT.2002.1024147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an approach for real-time speech-driven 3D face animation using neural networks. We first analyze a 3D facial movement sequence of a talking subject and learn a quantitative representation of the facial deformations, called the 3D Motion Units (MUs). A 3D facial deformation can be approximated by a linear combination of the MUs weighted by, the MU parameters (MUPs) - the visual features of the facial deformation. The facial movement sequence synchronizes with a audio track. The audio track is digitized and the audio features of each frame are calculated. A real-time audio-to-MUP mapping is constructed by training a set of neural networks using the calculated audio-visual features. The audio-visual features are divided into several groups based on the audio features. One neural network is trained per group to map the audio features to the corresponding MUPs. Given a new audio feature vector, we first classify it into one of the groups and select the corresponding neural network to map the audio feature vector to MUPs, which are used for face animation. The quantitative evaluation shows the effectiveness of the proposed approach.
引用
收藏
页码:713 / 716
页数:4
相关论文
共 50 条
  • [1] Real-time speech-driven face animation with expressions using neural networks
    Hong, PY
    Wen, Z
    Huang, TS
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 916 - 927
  • [2] SYNTHESIZING REAL-TIME SPEECH-DRIVEN FACIAL ANIMATION
    Luo, Changwei
    Yu, Jun
    Wang, Zengfu
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
    Peng, Ziqiao
    Wu, Haoyu
    Song, Zhenbo
    Xu, Hao
    Zhu, Xiangyu
    He, Jun
    Liu, Hongyan
    Fan, Zhaoxin
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20630 - 20640
  • [4] Real-time speech-driven animation of expressive talking faces
    Liu, Jia
    You, Mingyu
    Chen, Chun
    Song, Mingli
    [J]. INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 439 - 455
  • [5] Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
    Wu, Haozhe
    Zhou, Songtao
    Jia, Jia
    Xing, Junliang
    Wen, Qi
    Wen, Xiang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6822 - 6830
  • [6] Speech-Driven 3D Facial Animation with Mesh Convolution
    Ji, Xuejie
    Su, Zewei
    Dong, Lanfang
    Li, Guoming
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 14 - 18
  • [7] Speech-driven 3D Facial Animation for Mobile Entertainment
    Yan, Juan
    Xie, Xiang
    Hu, Hao
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2334 - 2337
  • [8] Imitator: Personalized Speech-driven 3D Facial Animation
    Thambiraja, Balamurugan
    Habibie, Ikhsanul
    Aliakbarian, Sadegh
    Cosker, Darren
    Theobalt, Christian
    Thies, Justus
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20564 - 20574
  • [9] HMM BASED SPEECH-DRIVEN 3D TONGUE ANIMATION
    Luo, Changwei
    Yu, Jun
    Li, Xian
    Zhang, Leilei
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4377 - 4381
  • [10] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
    Fan, Yingruo
    Lin, Zhaojiang
    Saito, Jun
    Wang, Wenping
    Komura, Taku
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18749 - 18758