Visual speech synthesis based on learning model

被引:0
|
作者
Sun, Yan-Feng [1 ]
Lin, Xian-Ping [1 ]
Yin, Bao-Cai [1 ]
Jia, Xi-Bin [1 ]
机构
[1] Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, College of Computer Sciences, Beijing University of Technology, Beijing 100124, China
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:702 / 707
相关论文
共 50 条
  • [41] Lip contour description based on Fourier descriptors in speech synthesis system driven by visual-speech
    School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, Tianjin 300072, China
    Yi Qi Yi Biao Xue Bao, 2007, 8 (1464-1468):
  • [42] Learning fuzzy rules for visual speech recognition
    Anwar, MA
    Baldwin, JF
    Martin, TP
    ADAPTIVE MULTIMEDIA RETRIEVAL, 2004, 3094 : 164 - 175
  • [43] Deep Learning for Visual Speech Analysis: A Survey
    Sheng, Changchong
    Kuang, Gangyao
    Bai, Liang
    Hou, Chenping
    Guo, Yulan
    Xu, Xin
    Pietikainen, Matti
    Liu, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6001 - 6022
  • [44] An Acoustic Model For English Speech Recognition Based On Deep Learning
    Ling, Zhang
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 610 - 614
  • [45] Deep learning based Affective Model for Speech Emotion Recognition
    Zhou, Xi
    Guo, Junqi
    Bie, Rongfang
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 841 - 846
  • [46] The Study of Speech Training and Learning Method Based on DIVA Model
    Zhang Shaobai
    Hu Chenhong
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3890 - 3895
  • [47] Towards Model Compression for Deep Learning Based Speech Enhancement
    Tan, Ke
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1785 - 1794
  • [48] The Role of Visual Speech Information in Supporting Perceptual Learning of Degraded Speech
    Wayne, Rachel V.
    Johnsrude, Ingrid S.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2012, 18 (04) : 419 - 435
  • [49] A Visual Speech Intelligibility Benefit Based on Speech Rhythm
    Kawase, Saya
    Davis, Chris
    Kim, Jeesun
    BRAIN SCIENCES, 2023, 13 (06)
  • [50] Text-to-visual speech synthesis based on parameter generation from HMM
    Masuko, T
    Kobayashi, T
    Tamura, M
    Masubuchi, J
    Tokuda, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3745 - 3748