A stochastic knowledge-based Thai text-to-speech system

被引：5

作者：

Narupiyakul, L ^{[1
]}

Khumya, A

Sirinaovakul, B

Cercone, N

机构：

[1] King Mongkuts Univ Technol Thonburi, Dept Comp Engn, Bangkok 10140, Thailand

[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada

来源：

MATHEMATICAL AND COMPUTER MODELLING | 2005年 / 42卷 / 1-2期

基金：

加拿大自然科学与工程研究理事会;

关键词：

All Open Access; Bronze;

D O I：

10.1016/j.mcm.2002.11.004

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We describe the development of our Thai text-to-speech (TTS) system. Thai TTS system transforms Thai texts to the sequence of appropriate sounds for Thai speech. Thai complexity requires approximately four hundred rules including main and specific rules to derive most pronunciations for our rule-based approach grounded on Thai syllable structure analysis. An exception dictionary that covers anomalous pronunciations and a rule inference engine that determines sentence structures are included to improve the quality of Thai TTS. Speech generation by concatenative synthesis is a sequential step that transforms sound symbols to synthetic speech. We have tested our system with magazine and internet articles, together with articles from the experiments of other researchers and we report the results of this informal evaluation. Most syllables from Thai written strings can be converted to phonetic symbols. With a compact Thai unit inventory, the concatenative synthesis system can synthesize synthetic speech covering many Thai syllables. (c) 2005 Elsevier Ltd. All rights reserved.

引用

页码：1 / 16

页数：16

共 50 条

[1] Prosodic Annotation in a Thai Text-to-speech System
Potisuk, Siripong
[J]. PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 405 - 414
[2] State of the Art Review on Thai Text-to-Speech System
Yimngam, Sukanya
Premchaisawadi, Wichian
Kreesuradej, Worapoj
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 194 - +
[3] Knowledge-based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis
Li, Jingbei
Wu, Zhiyong
Li, Runnan
Zhi, Pengpeng
Yang, Song
Meng, Helen
[J]. INTERSPEECH 2019, 2019, : 4494 - 4498
[4] PHONETIC KNOWLEDGE IN TEXT-TO-SPEECH SYNTHESIS
van Santen, Jan P. H.
[J]. INTEGRATION OF PHONETIC KNOWLEDGE IN SPEECH TECHNOLOGY, 2005, 25 : 149 - 166
[5] SIGNIFICANCE OF KNOWLEDGE SOURCES OR A TEXT-TO-SPEECH SYSTEM FOR INDIAN LANGUAGES
YEGNANARAYANA, B
RAJENDRAN, S
RAMACHANDRAN, VR
MADHUKUMAR, AS
[J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1994, 19 : 147 - 169
[6] Slovenian text-to-speech system
Sef, T
[J]. ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
[7] A TEXT-TO-SPEECH CONVERSION SYSTEM
KLATT, DH
[J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
[8] A Hakka text-to-speech system
Yu, Hsiu-Min
Hwang, Hsin-Te
Lin, Dong-Yi
Chen, Sin-Horng
[J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +
[9] A Mandarin text-to-speech system
Hwang, SH
Chen, SH
Wang, YR
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
[10] A stochastic model of intonation for text-to-speech synthesis
Véronis, J
Di Cristo, P
Courtois, F
Chaumette, C
[J]. SPEECH COMMUNICATION, 1998, 26 (04) : 233 - 244

← 1 2 3 4 5 →