Study of the trainable prosodic model for Chinese text to speech system

被引：0

作者：

Tao, J.H. ^{[1
]}

Cai, L.H. ^{[1
]}

Zhao, S.X. ^{[1
]}

Wu, Z.Y. ^{[1
]}

机构：

[1] Dep. of Computer Sci., Tsinghua Univ., Beijing 100084, China

来源：

Shengxue Xuebao/Acta Acustica | 2001年 / 26卷 / 01期

关键词：

Mathematical models - Neural networks;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Mandarin prosody is characterized by its hierarchical structures when it is influenced by the context. An artificial on this, a neural network, with specially weighted factors and optimizing outputs, is described and used to construct the Mandarin prosodic model in a TTS system for Chinese. Extensive tests show that the structure of the artificial neural network characterizes the Mandarin prosody more exactly than traditional models. Learning rate is speeded up and computational precision is improved, which makes the whole prosodic model more efficient. Furthermore, the paper also stylizes the Mandarin syllable pitch contours with SPiS parameters (Syllable Pitch Stylized Parameters), and analyzes them in adjusting the syllable pitch. It shows that the SPiS parameters effectively characterize the Mandarin syllable pitch contours, and facilitate the establishment of the network model and the prosodic controlling.

引用

页码：67 / 72

共 50 条

[1] Trainable prosodic model for standard Chinese Text-to-Speech system
TAO Jianhua
[J]. Chinese Journal of Acoustics, 2001, (03) : 257 - 265
[2] A superposed prosodic model for Chinese text-to-speech synthesis
Chen, GP
Bailly, G
Liu, QF
Wang, RH
[J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 177 - 180
[3] Whistler: A trainable text-to-speech system
Huang, XD
Acero, A
Adcock, J
Hon, HW
Goldsmith, J
Liu, JS
Plumpe, M
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2387 - 2390
[4] A prosodic phrasing model for a Korean text-to-speech synthesis system
Yoon, K
[J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (01): : 69 - 79
[5] SFC: A trainable prosodic model
Bailly, G
Holm, B
[J]. SPEECH COMMUNICATION, 2005, 46 (3-4) : 348 - 364
[6] A tree-based model of prosodic phrasing for Chinese text-to-speech systems
Chen, WJ
Lin, FZ
Li, JM
Zhang, B
[J]. ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 1054 - 1059
[7] Prosodic Annotation in a Thai Text-to-speech System
Potisuk, Siripong
[J]. PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 405 - 414
[8] A prosodic model for text-to-speech synthesis in French
Di Cristo, A
Di Cristo, P
Campione, E
Véronis, J
[J]. INTONATION: ANALYSIS, MODELLING AND TECHNOLOGY, 2000, 15 : 321 - 355
[9] An Improvement of Prosodic Characteristics in Vietnamese Text to Speech System
Thanh Son Phan
Anh Tuan Dinh
Tat Thang Vu
Chi Mai Luong
[J]. KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 1, 2014, 244 : 99 - 111
[10] A Prosodic Text-to-Speech System for Yoruba Language
Akinwonmi, Akintoba Emmanuel
Alese, Boniface Kayode
[J]. 2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 630 - 635

← 1 2 3 4 5 →