Corpus-based Malay Text-to-Speech Synthesis System

被引:0
|
作者
Swee, Tan Tian [1 ]
Salleh, Sheikh Hussain Shaikh [1 ]
机构
[1] Univ Teknol Malaysia, Fac Biomed & Hlth Sci Engn, Utm Skudai 81310, Johor, Malaysia
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation]. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely Modify Rhythm Test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.
引用
收藏
页码:52 / 56
页数:5
相关论文
共 50 条
  • [21] Text-to-speech synthesis system for Punjabi language
    Dept. of Computer Sc. & Engg, Guru Nanak Dev Engg. College, Ludhiana
    Pb, India
    不详
    Pb, India
    [J]. Commun. Comput. Info. Sci., (302-303):
  • [22] Development of Assamese Text-to-Speech Synthesis System
    Sharma, Bidisha
    Adiga, Nagaraj
    Prasanna, S. R. Mahadeva
    [J]. TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [23] Slovak text-to-speech synthesis in ARTIC system
    Matousek, J
    Tihelka, D
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 155 - 162
  • [24] Text-To-Speech Synthesis System for Punjabi Language
    Singh, Parminder
    Lehal, Gurpreet Singh
    [J]. INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 302 - 303
  • [25] Corpus design based on the Kullback-Leibler divergence for Text-To-Speech synthesis application
    Krul, Aleksandra
    Damnati, Geraldine
    Yvon, Francois
    Moudenc, Thierry
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2030 - +
  • [26] SUST TTS Corpus: A phonetically-balanced corpus for Bangla text-to-speech synthesis
    Ahmad, Arif
    Selim, Md Reza
    Iqbal, Md Zafar
    Rahman, M. Shahidur
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2021, 42 (06) : 326 - 332
  • [27] IndicSpeech: Text-to-Speech Corpus for Indian Languages
    Srivastava, Nimisha
    Mukhopadhyay, Rudrabha
    Prajwal, K. R.
    Jawahar, C., V
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6417 - 6422
  • [28] Myanmar text-to-speech system with rule-based tone synthesis
    Win, Kyawt Yin
    Takara, Tomio
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (05) : 174 - 181
  • [29] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
    Doukhan, David
    Rosset, Sophie
    Rilliard, Albert
    d'Alessandro, Christophe
    Adda-Decker, Martine
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
  • [30] Multilingual text-to-speech synthesis
    Black, AW
    Lenzo, KA
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 761 - 764