Audio-driven emotional speech animation for interactive virtual characters

被引:13
|
作者
Charalambous, Constantinos [1 ]
Yumak, Zerrin [1 ]
van der Stappen, A. Frank [1 ]
机构
[1] Univ Utrecht, Dept Informat & Comp Sci, NL-3512 JE Utrecht, Netherlands
基金
欧盟地平线“2020”;
关键词
audio-driven speech animation; emotional speech; procedural animation;
D O I
10.1002/cav.1892
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a procedural audio-driven speech animation method for interactive virtual characters. Given any audio with its respective speech transcript, we automatically generate lip-synchronized speech animation that could drive any three-dimensional virtual character. The realism of the animation is enhanced by studying the emotional features of the audio signal and its effect on mouth movements. We also propose a coarticulation model that takes into account various linguistic rules. The generated animation is configurable by the user by modifying the control parameters, such as viseme types, intensities, and coarticulation curves. We compare our approach against two lip-synchronized speech animation generators. Our results show that our method surpasses them in terms of user preference.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] VisemeNet: Audio-Driven Animator-Centric Speech Animation
    Zhou, Yang
    Xu, Zhan
    Landreth, Chris
    Kalogerakis, Evangelos
    Maji, Subhransu
    Singh, Karan
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
  • [2] EmoFace: Audio-driven Emotional 3D Face Animation
    Liu, Chang
    Lin, Qunfen
    Zeng, Zijiao
    Pan, Ye
    [J]. 2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES, VR 2024, 2024, : 387 - 397
  • [3] Audio2AB: Audio-driven collaborative generation of virtual character animation
    Niu, Lichao
    Xie, Wenjun
    Wang, Dong
    Cao, Zhongrui
    Liu, Xiaoping
    [J]. Virtual Reality and Intelligent Hardware, 2024, 6 (01): : 56 - 70
  • [4] Audio2AB:Audio-driven collaborative generation of virtual character animation
    Lichao NIU
    Wenjun XIE
    Dong WANG
    Zhongrui CAO
    Xiaoping LIU
    [J]. 虚拟现实与智能硬件(中英文), 2024, 6 (01) : 56 - 70
  • [5] Audio-Driven Emotional Video Portraits
    Ji, Xinya
    Zhou, Hang
    Wang, Kaisiyuan
    Wu, Wayne
    Loy, Chen Change
    Cao, Xun
    Xu, Feng
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14075 - 14084
  • [6] Multi-Task Audio-Driven Facial Animation
    Kim, Youngsoo
    An, Shounan
    Jo, Youngbak
    Park, Seungje
    Kang, Shindong
    Oh, Insoo
    Kim, Duke Donghyun
    [J]. SIGGRAPH '19 - ACM SIGGRAPH 2019 POSTERS, 2019,
  • [7] Audio-Driven Violin Performance Animation with Clear Fingering and Bowing
    Hirata, Asuka
    Tanaka, Keitaro
    Hamanaka, Masatoshi
    Morishima, Shigeo
    [J]. PROCEEDINGS OF SIGGRAPH 2022 POSTERS, SIGGRAPH 2022, 2022,
  • [8] DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation
    Shen, Shuai
    Zhao, Wenliang
    Meng, Zibin
    Li, Wanhua
    Zhu, Zheng
    Zhou, Jie
    Lu, Jiwen
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1982 - 1991
  • [9] Audio-Driven Co-Speech Gesture Video Generation
    Liu, Xian
    Wu, Qianyi
    Zhou, Hang
    Du, Yuanqi
    Wu, Wayne
    Lin, Dahua
    Liu, Ziwei
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation
    Qi, Xingqun
    Liu, Chen
    Li, Lincheng
    Hou, Jie
    Xin, Haoran
    Yu, Xin
    [J]. IEEE Transactions on Multimedia, 2024, 26 : 10420 - 10430