Corpus-based Malay Text-to-Speech Synthesis System

被引:0
|
作者
Swee, Tan Tian [1 ]
Salleh, Sheikh Hussain Shaikh [1 ]
机构
[1] Univ Teknol Malaysia, Fac Biomed & Hlth Sci Engn, Utm Skudai 81310, Johor, Malaysia
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation]. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely Modify Rhythm Test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.
引用
收藏
页码:52 / 56
页数:5
相关论文
共 50 条
  • [41] A Performance Improvement Method using Variable Break in Corpus Based Japanese Text-to-Speech System
    Na, Deok-Su
    Min, So-Yeon
    Lee, Jong-Seok
    Bae, Myung-Jin
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (02): : 155 - 163
  • [42] Slovenian text-to-speech system
    Sef, T
    [J]. ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
  • [43] A TEXT-TO-SPEECH CONVERSION SYSTEM
    KLATT, DH
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
  • [44] A Hakka text-to-speech system
    Yu, Hsiu-Min
    Hwang, Hsin-Te
    Lin, Dong-Yi
    Chen, Sin-Horng
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +
  • [45] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [46] Text-to-speech system for Danish
    [J]. 1600, Publ by Elsevier Science Publishers B.V., Amsterdam, Neth
  • [47] LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
    Zen, Heiga
    Dang, Viet
    Clark, Rob
    Zhang, Yu
    Weiss, Ron J.
    Jia, Ye
    Chen, Zhifeng
    Wu, Yonghui
    [J]. INTERSPEECH 2019, 2019, : 1526 - 1530
  • [48] Introduction to Multilingual Corpus-Based Concatenative Speech Synthesis
    Deprez, Filip
    Odijk, Jan
    De Moortel, Jan
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 357 - 360
  • [49] Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis
    Sakai, Shinsuke
    Kawahara, Tatsuya
    Kawai, Hisashi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2006 - 2014
  • [50] Segment Connection Networks for Corpus-Based Speech Synthesis
    Coorman, Geert
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2074 - 2077