Text2Video: Text-driven facial animation using MPEG-4

被引：0

作者：

Rurainsky, J ^{[1
]}

Eisert, P ^{[1
]}

机构：

[1] Heinrich Hertz Inst Nachrichtentech Berlin GmbH, Fraunhofer Inst Telecommun, Image Proc Dept, D-10587 Berlin, Germany

来源：

VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4 | 2005年 / 5960卷

关键词：

MPEG-4; facial animation; text-driven animation; SMS; MMS;

D O I：

10.1117/12.631413

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

We present a complete system for the automatic creation of talking head video sequences from text messages. Our system converts the text into MPEG-4 Facial Animation Parameters and synthetic voice. A user selected 3D character will perform lip movements synchronized to the speech data. The 3D models created from a single image vary from realistic people to cartoon characters. A voice selection for different languages and gender as well as a pitch shift component enables a personalization of the animation. The animation can be shown on different displays and devices ranging from 3GPP players on mobile phones to real-time 3D render engines. Therefore, our system can be used in mobile communication for the conversion of regular SMS messages to MMS animations.

引用

页码：492 / 500

页数：9

共 50 条

[21] Inner lip feature extraction for MPEG-4 facial animation
Wu, ZL
Aleksic, PS
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 633 - 636
[22] Text2Mesh: Text-Driven Neural Stylization for Meshes
Michel, Oscar
Bar-On, Roi
Liu, Richard
Benaim, Sagie
Hanocka, Rana
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13482 - 13492
[23] Dynamic Facial Expression Analysis and Synthesis With MPEG-4 Facial Animation Parameters
Zhang, Yongmian
Ji, Qiang
Zhu, Zhiwei
Yi, Beifang
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2008, 18 (10) : 1383 - 1396
[24] Text2Palette: Text-Driven Color Palette Generation Using Internet Images
Lei, Kaixiang
Liu, Zhengning
Xu, Kun
[J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (05): : 694 - 703
[25] Text2Video: An End-to-end Learning Framework for Expressing Text With Videos
Yang, Xiaoshan
Zhang, Tianzhu
Xu, Changsheng
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) : 2360 - 2370
[26] Text-driven 3D Avatar Animation with Emotional and Expressive Behaviors
Hu, Li
Qi, Jinwei
Zhang, Bang
Pan, Pan
Xu, Yinghui
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2816 - 2818
[27] Automatic Facial Animation Parameters extraction in MPEG-4 visual communication
Yang, CG
Gong, WW
Yu, L
[J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2, 2002, 4671 : 396 - 405
[28] Compression of MPEG-4 facial animation parameters for transmission of talking heads
Tao, H
Chen, HH
Wu, W
Huang, TS
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (02) : 264 - 276
[29] An MPEG-4 Quadric-based LoD Simplification for Facial Animation
Duarte, Ricardo Leandro Parreira
El Rhalibi, Abdennour
Carter, Christopher
Cooper, Simon
Merabti, Madjid
[J]. 2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 743 - 748
[30] Realization of 3-D facial animation based on MPEG-4
Jiang, Xiu-Feng
Pu, Xiao-Rong
Zhang, Yi
[J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2007, 36 (03): : 569 - 572

← 1 2 3 4 5 →