Corpus-based Malay Text-to-Speech Synthesis System

被引：0

作者：

Swee, Tan Tian ^{[1
]}

Salleh, Sheikh Hussain Shaikh ^{[1
]}

机构：

[1] Univ Teknol Malaysia, Fac Biomed & Hlth Sci Engn, Utm Skudai 81310, Johor, Malaysia

来源：

2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2 | 2008年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation]. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely Modify Rhythm Test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.

引用

页码：52 / 56

页数：5

共 50 条

[21] Text-to-speech synthesis system for Punjabi language
Dept. of Computer Sc. & Engg, Guru Nanak Dev Engg. College, Ludhiana
Pb, India
不详
Pb, India
[J]. Commun. Comput. Info. Sci., (302-303):
[22] Development of Assamese Text-to-Speech Synthesis System
Sharma, Bidisha
Adiga, Nagaraj
Prasanna, S. R. Mahadeva
[J]. TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
[23] Slovak text-to-speech synthesis in ARTIC system
Matousek, J
Tihelka, D
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 155 - 162
[24] Text-To-Speech Synthesis System for Punjabi Language
Singh, Parminder
Lehal, Gurpreet Singh
[J]. INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 302 - 303
[25] Corpus design based on the Kullback-Leibler divergence for Text-To-Speech synthesis application
Krul, Aleksandra
Damnati, Geraldine
Yvon, Francois
Moudenc, Thierry
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2030 - +
[26] SUST TTS Corpus: A phonetically-balanced corpus for Bangla text-to-speech synthesis
Ahmad, Arif
Selim, Md Reza
Iqbal, Md Zafar
Rahman, M. Shahidur
[J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2021, 42 (06) : 326 - 332
[27] IndicSpeech: Text-to-Speech Corpus for Indian Languages
Srivastava, Nimisha
Mukhopadhyay, Rudrabha
Prajwal, K. R.
Jawahar, C., V
[J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6417 - 6422
[28] Myanmar text-to-speech system with rule-based tone synthesis
Win, Kyawt Yin
Takara, Tomio
[J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (05) : 174 - 181
[29] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
Doukhan, David
Rosset, Sophie
Rilliard, Albert
d'Alessandro, Christophe
Adda-Decker, Martine
[J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
[30] Multilingual text-to-speech synthesis
Black, AW
Lenzo, KA
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 761 - 764

← 1 2 3 4 5 →