Automatic selection of visemes for image-based visual speech synthesis

被引:0
|
作者
Yang, J [1 ]
Xiao, J [1 ]
Ritter, M [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An image-based approach provides an efficient way for visual speech synthesis. In an image-based visual speech synthesis system, a few lip images, namely visemes, are used for generating an arbitrary new sentence. Many approaches select visemes manually. In this paper we propose a method for a system to automatically select visemes by minimizing the synthesis error The feasibility of the proposed method has been demonstrated by experiments. We describe an application of image-based visual speech synthesis to a multimodal communication agent for a translation task where two people, who speak different languages, cart talk to each other over the Internet.
引用
收藏
页码:1081 / 1084
页数:4
相关论文
共 50 条
  • [1] Visual Speech Synthesis based on Chinese Dynamic Visemes
    Zhao, Hui
    Tang, Chaojing
    2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 139 - 143
  • [2] Visual speech synthesis by morphing visemes
    Ezzat, Tony
    Poggio, Tomaso
    NTT R and D, 2000, 49 (07): : 372 - 375
  • [3] Visual speech synthesis by morphing visemes
    Ezzat, T
    Poggio, T
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2000, 38 (01) : 45 - 57
  • [4] Visual Speech Synthesis by Morphing Visemes
    Tony Ezzat
    Tomaso Poggio
    International Journal of Computer Vision, 2000, 38 : 45 - 57
  • [5] An Image-Based Visual Speech Animation System
    Zhou, Ziheng
    Zhao, Guoying
    Guo, Yimo
    Pietikainen, Matti
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (10) : 1420 - 1432
  • [6] Visual speech synthesis using dynamic visemes, contextual features and DNNs
    Thangthai, Ausdang
    Milner, Ben
    Taylor, Sarah
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2458 - 2462
  • [7] Automatic Visual Fingerprinting for Indoor Image-Based Localization Applications
    Vedadi, Farhang
    Valaee, Shahrokh
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (01): : 305 - 317
  • [8] Image-Based Visual Servoing Control for Automatic Carrier Landing
    Liu, Simin
    Zheng, Zewei
    Guan, Zhiyuan
    Lin, Kang
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 351 - 356
  • [9] The usefulness of the depth images in image-based speech synthesis
    Lee, Ki-Seung
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2023, 42 (01): : 67 - 74
  • [10] Image-based visual hulls
    Matusik, W
    Buehler, C
    Raskar, R
    Gortler, SJ
    McMillan, L
    SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 369 - 374