Trainable prosodic model for standard Chinese Text-to-Speech system

被引:0
|
作者
TAO Jianhua
机构
基金
中国国家自然科学基金;
关键词
Trainable prosodic model for standard Chinese Text-to-Speech system; Text;
D O I
10.15949/j.cnki.0217-9776.2001.03.007
中图分类号
H017 [实验语音学(仪器语音学)];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Putonghua prosody is characterized by its hierarchical structure when influenced by linguistic environments. Based on this, a neural network, with specially weighted factors and optimizing outputs, is described and applied to construct the Putonghua prosodic model in Text-to-Speech (TTS) system. Extensive tests show that the structure of the neural network characterizes the Putonghua prosody more exactly than traditional models. Learning rate is speeded up and computational precision is improved, which makes the whole prosodic model more efficient. Furthermore, the paper also stylizes the Putonghua syllable pitch contours with SPiS parameters (Syllable Pitch Stylized Parameters), and analyzes them in adjusting the syllable pitch. It shows that the SPiS parameters effectively characterize the Putonghua syllable pitch contours, and facilitate the establishment of the network model and the prosodic controlling.
引用
收藏
页码:257 / 265
页数:9
相关论文
共 50 条
  • [1] Study of the trainable prosodic model for Chinese text to speech system
    Tao, J.H.
    Cai, L.H.
    Zhao, S.X.
    Wu, Z.Y.
    [J]. Shengxue Xuebao/Acta Acustica, 2001, 26 (01): : 67 - 72
  • [2] A superposed prosodic model for Chinese text-to-speech synthesis
    Chen, GP
    Bailly, G
    Liu, QF
    Wang, RH
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 177 - 180
  • [3] Whistler: A trainable text-to-speech system
    Huang, XD
    Acero, A
    Adcock, J
    Hon, HW
    Goldsmith, J
    Liu, JS
    Plumpe, M
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2387 - 2390
  • [4] A prosodic phrasing model for a Korean text-to-speech synthesis system
    Yoon, K
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (01): : 69 - 79
  • [5] A prosodic model for text-to-speech synthesis in French
    Di Cristo, A
    Di Cristo, P
    Campione, E
    Véronis, J
    [J]. INTONATION: ANALYSIS, MODELLING AND TECHNOLOGY, 2000, 15 : 321 - 355
  • [6] A Prosodic Text-to-Speech System for Yoruba Language
    Akinwonmi, Akintoba Emmanuel
    Alese, Boniface Kayode
    [J]. 2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 630 - 635
  • [7] Prosodic Annotation in a Thai Text-to-speech System
    Potisuk, Siripong
    [J]. PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 405 - 414
  • [8] A tree-based model of prosodic phrasing for Chinese text-to-speech systems
    Chen, WJ
    Lin, FZ
    Li, JM
    Zhang, B
    [J]. ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 1054 - 1059
  • [9] Prosodic boundary prediction model for Vietnamese text-to-speech
    Trang, Nguyen Thi Thu
    Ky, Nguyen Hoang
    Rilliard, Albert
    D'Alessandro, Christophe
    [J]. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2021, 5 : 3366 - 3370
  • [10] Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
    Nguyen Thi Thu Trang
    Nguyen Hoang Ky
    Rilliard, Albert
    d'Alessandro, Christophe
    [J]. INTERSPEECH 2021, 2021, : 3885 - 3889