Text2Video: Text-driven facial animation using MPEG-4

被引:0
|
作者
Rurainsky, J [1 ]
Eisert, P [1 ]
机构
[1] Heinrich Hertz Inst Nachrichtentech Berlin GmbH, Fraunhofer Inst Telecommun, Image Proc Dept, D-10587 Berlin, Germany
关键词
MPEG-4; facial animation; text-driven animation; SMS; MMS;
D O I
10.1117/12.631413
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
We present a complete system for the automatic creation of talking head video sequences from text messages. Our system converts the text into MPEG-4 Facial Animation Parameters and synthetic voice. A user selected 3D character will perform lip movements synchronized to the speech data. The 3D models created from a single image vary from realistic people to cartoon characters. A voice selection for different languages and gender as well as a pitch shift component enables a personalization of the animation. The animation can be shown on different displays and devices ranging from 3GPP players on mobile phones to real-time 3D render engines. Therefore, our system can be used in mobile communication for the conversion of regular SMS messages to MMS animations.
引用
收藏
页码:492 / 500
页数:9
相关论文
共 50 条
  • [21] Inner lip feature extraction for MPEG-4 facial animation
    Wu, ZL
    Aleksic, PS
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 633 - 636
  • [22] Text2Mesh: Text-Driven Neural Stylization for Meshes
    Michel, Oscar
    Bar-On, Roi
    Liu, Richard
    Benaim, Sagie
    Hanocka, Rana
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13482 - 13492
  • [23] Dynamic Facial Expression Analysis and Synthesis With MPEG-4 Facial Animation Parameters
    Zhang, Yongmian
    Ji, Qiang
    Zhu, Zhiwei
    Yi, Beifang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2008, 18 (10) : 1383 - 1396
  • [24] Text2Palette: Text-Driven Color Palette Generation Using Internet Images
    Lei, Kaixiang
    Liu, Zhengning
    Xu, Kun
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (05): : 694 - 703
  • [25] Text2Video: An End-to-end Learning Framework for Expressing Text With Videos
    Yang, Xiaoshan
    Zhang, Tianzhu
    Xu, Changsheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) : 2360 - 2370
  • [26] Text-driven 3D Avatar Animation with Emotional and Expressive Behaviors
    Hu, Li
    Qi, Jinwei
    Zhang, Bang
    Pan, Pan
    Xu, Yinghui
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2816 - 2818
  • [27] Automatic Facial Animation Parameters extraction in MPEG-4 visual communication
    Yang, CG
    Gong, WW
    Yu, L
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2, 2002, 4671 : 396 - 405
  • [28] Compression of MPEG-4 facial animation parameters for transmission of talking heads
    Tao, H
    Chen, HH
    Wu, W
    Huang, TS
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (02) : 264 - 276
  • [29] An MPEG-4 Quadric-based LoD Simplification for Facial Animation
    Duarte, Ricardo Leandro Parreira
    El Rhalibi, Abdennour
    Carter, Christopher
    Cooper, Simon
    Merabti, Madjid
    [J]. 2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 743 - 748
  • [30] Realization of 3-D facial animation based on MPEG-4
    Jiang, Xiu-Feng
    Pu, Xiao-Rong
    Zhang, Yi
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2007, 36 (03): : 569 - 572