Corpus-based Malay Text-to-Speech Synthesis System

被引：0

作者：

Swee, Tan Tian ^{[1
]}

Salleh, Sheikh Hussain Shaikh ^{[1
]}

机构：

[1] Univ Teknol Malaysia, Fac Biomed & Hlth Sci Engn, Utm Skudai 81310, Johor, Malaysia

来源：

2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2 | 2008年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation]. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely Modify Rhythm Test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.

引用

页码：52 / 56

页数：5

共 50 条

[1] A new Korean corpus-based text-to-speech system
Kim S.
Lee Y.
Hirose K.
[J]. International Journal of Speech Technology, 2002, 5 (2) : 105 - 116
[2] An objective measure for assement of a corpus-based text-to-speech system
Xu, J
Guan, CT
Li, HZ
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 179 - 182
[3] Time and space-efficient architecture for a corpus-based text-to-speech synthesis system
Rojc, Matej
Kacic, Zdravko
[J]. SPEECH COMMUNICATION, 2007, 49 (03) : 230 - 249
[4] A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
Chou, FC
Tseng, CY
Lee, LS
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 481 - 494
[5] A study of prosodic variability methods in a corpus-based unit selection text-to-speech system
Csapo, Tamas Gabor
Zainko, Csaba
Nemeth, Geza
[J]. INFOCOMMUNICATIONS JOURNAL, 2010, 2 (01): : 32 - 37
[6] An LSTM-based model for the compression of acoustic inventories for corpus-based text-to-speech synthesis systems
Rojc, Matej
Mlakar, Izidor
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
[7] Unit generation based on phrase break strength and pruning for corpus-based text-to-speech
Kim, S
Lee, Y
Hirose, K
[J]. ETRI JOURNAL, 2001, 23 (04) : 168 - 176
[8] RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Zandie, Rohola
Mahoor, Mohammad H.
Madsen, Julia
Emamian, Eshrat S.
[J]. INTERSPEECH 2021, 2021, : 2751 - 2755
[9] ADDING AN EMOTIONS FILTER TO MALAY TEXT-TO-SPEECH SYSTEM
Begum, Mumtaz
Ainon, Raja N.
Don, Zuraidah M.
Knowles, Gerry
[J]. ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 1007 - +
[10] Towards designing a high intelligibility rule based Standard Malay text-to-speech synthesis system
Ahmad, Zakiah Hanim
Khalifa, Othman
[J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 89 - 94

← 1 2 3 4 5 →