Audio-driven emotional speech animation for interactive virtual characters

被引：13

作者：

Charalambous, Constantinos ^{[1
]}

Yumak, Zerrin ^{[1
]}

van der Stappen, A. Frank ^{[1
]}

机构：

[1] Univ Utrecht, Dept Informat & Comp Sci, NL-3512 JE Utrecht, Netherlands

来源：

COMPUTER ANIMATION AND VIRTUAL WORLDS | 2019年 / 30卷 / 3-4期

基金：

欧盟地平线“2020”;

关键词：

audio-driven speech animation; emotional speech; procedural animation;

D O I：

10.1002/cav.1892

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We present a procedural audio-driven speech animation method for interactive virtual characters. Given any audio with its respective speech transcript, we automatically generate lip-synchronized speech animation that could drive any three-dimensional virtual character. The realism of the animation is enhanced by studying the emotional features of the audio signal and its effect on mouth movements. We also propose a coarticulation model that takes into account various linguistic rules. The generated animation is configurable by the user by modifying the control parameters, such as viseme types, intensities, and coarticulation curves. We compare our approach against two lip-synchronized speech animation generators. Our results show that our method surpasses them in terms of user preference.

引用

页数：11

共 50 条

[1] VisemeNet: Audio-Driven Animator-Centric Speech Animation
Zhou, Yang
Xu, Zhan
Landreth, Chris
Kalogerakis, Evangelos
Maji, Subhransu
Singh, Karan
[J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
[2] EmoFace: Audio-driven Emotional 3D Face Animation
Liu, Chang
Lin, Qunfen
Zeng, Zijiao
Pan, Ye
[J]. 2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES, VR 2024, 2024, : 387 - 397
[3] Audio2AB: Audio-driven collaborative generation of virtual character animation
Niu, Lichao
Xie, Wenjun
Wang, Dong
Cao, Zhongrui
Liu, Xiaoping
[J]. Virtual Reality and Intelligent Hardware, 2024, 6 (01): : 56 - 70
[4] Audio2AB:Audio-driven collaborative generation of virtual character animation
Lichao NIU
Wenjun XIE
Dong WANG
Zhongrui CAO
Xiaoping LIU
[J]. 虚拟现实与智能硬件(中英文), 2024, 6 (01) : 56 - 70
[5] Audio-Driven Emotional Video Portraits
Ji, Xinya
Zhou, Hang
Wang, Kaisiyuan
Wu, Wayne
Loy, Chen Change
Cao, Xun
Xu, Feng
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14075 - 14084
[6] Multi-Task Audio-Driven Facial Animation
Kim, Youngsoo
An, Shounan
Jo, Youngbak
Park, Seungje
Kang, Shindong
Oh, Insoo
Kim, Duke Donghyun
[J]. SIGGRAPH '19 - ACM SIGGRAPH 2019 POSTERS, 2019,
[7] Audio-Driven Violin Performance Animation with Clear Fingering and Bowing
Hirata, Asuka
Tanaka, Keitaro
Hamanaka, Masatoshi
Morishima, Shigeo
[J]. PROCEEDINGS OF SIGGRAPH 2022 POSTERS, SIGGRAPH 2022, 2022,
[8] DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation
Shen, Shuai
Zhao, Wenliang
Meng, Zibin
Li, Wanhua
Zhu, Zheng
Zhou, Jie
Lu, Jiwen
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1982 - 1991
[9] Audio-Driven Co-Speech Gesture Video Generation
Liu, Xian
Wu, Qianyi
Zhou, Hang
Du, Yuanqi
Wu, Wayne
Lin, Dahua
Liu, Ziwei
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[10] EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation
Qi, Xingqun
Liu, Chen
Li, Lincheng
Hou, Jie
Xin, Haoran
Yu, Xin
[J]. IEEE Transactions on Multimedia, 2024, 26 : 10420 - 10430

← 1 2 3 4 5 →