A stochastic knowledge-based Thai text-to-speech system

被引:5
|
作者
Narupiyakul, L [1 ]
Khumya, A
Sirinaovakul, B
Cercone, N
机构
[1] King Mongkuts Univ Technol Thonburi, Dept Comp Engn, Bangkok 10140, Thailand
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
All Open Access; Bronze;
D O I
10.1016/j.mcm.2002.11.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We describe the development of our Thai text-to-speech (TTS) system. Thai TTS system transforms Thai texts to the sequence of appropriate sounds for Thai speech. Thai complexity requires approximately four hundred rules including main and specific rules to derive most pronunciations for our rule-based approach grounded on Thai syllable structure analysis. An exception dictionary that covers anomalous pronunciations and a rule inference engine that determines sentence structures are included to improve the quality of Thai TTS. Speech generation by concatenative synthesis is a sequential step that transforms sound symbols to synthetic speech. We have tested our system with magazine and internet articles, together with articles from the experiments of other researchers and we report the results of this informal evaluation. Most syllables from Thai written strings can be converted to phonetic symbols. With a compact Thai unit inventory, the concatenative synthesis system can synthesize synthetic speech covering many Thai syllables. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] Prosodic Annotation in a Thai Text-to-speech System
    Potisuk, Siripong
    [J]. PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 405 - 414
  • [2] State of the Art Review on Thai Text-to-Speech System
    Yimngam, Sukanya
    Premchaisawadi, Wichian
    Kreesuradej, Worapoj
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 194 - +
  • [3] Knowledge-based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis
    Li, Jingbei
    Wu, Zhiyong
    Li, Runnan
    Zhi, Pengpeng
    Yang, Song
    Meng, Helen
    [J]. INTERSPEECH 2019, 2019, : 4494 - 4498
  • [4] PHONETIC KNOWLEDGE IN TEXT-TO-SPEECH SYNTHESIS
    van Santen, Jan P. H.
    [J]. INTEGRATION OF PHONETIC KNOWLEDGE IN SPEECH TECHNOLOGY, 2005, 25 : 149 - 166
  • [5] SIGNIFICANCE OF KNOWLEDGE SOURCES OR A TEXT-TO-SPEECH SYSTEM FOR INDIAN LANGUAGES
    YEGNANARAYANA, B
    RAJENDRAN, S
    RAMACHANDRAN, VR
    MADHUKUMAR, AS
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1994, 19 : 147 - 169
  • [6] Slovenian text-to-speech system
    Sef, T
    [J]. ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
  • [7] A TEXT-TO-SPEECH CONVERSION SYSTEM
    KLATT, DH
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
  • [8] A Hakka text-to-speech system
    Yu, Hsiu-Min
    Hwang, Hsin-Te
    Lin, Dong-Yi
    Chen, Sin-Horng
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +
  • [9] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [10] A stochastic model of intonation for text-to-speech synthesis
    Véronis, J
    Di Cristo, P
    Courtois, F
    Chaumette, C
    [J]. SPEECH COMMUNICATION, 1998, 26 (04) : 233 - 244