Corpus-based Malay Text-to-Speech Synthesis System

被引：0

作者：

Swee, Tan Tian ^{[1
]}

Salleh, Sheikh Hussain Shaikh ^{[1
]}

机构：

[1] Univ Teknol Malaysia, Fac Biomed & Hlth Sci Engn, Utm Skudai 81310, Johor, Malaysia

来源：

2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2 | 2008年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation]. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely Modify Rhythm Test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.

引用

页码：52 / 56

页数：5

共 50 条

[41] A Performance Improvement Method using Variable Break in Corpus Based Japanese Text-to-Speech System
Na, Deok-Su
Min, So-Yeon
Lee, Jong-Seok
Bae, Myung-Jin
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (02): : 155 - 163
[42] Slovenian text-to-speech system
Sef, T
[J]. ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
[43] A TEXT-TO-SPEECH CONVERSION SYSTEM
KLATT, DH
[J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 184 (SEP): : 11 - CINF
[44] A Hakka text-to-speech system
Yu, Hsiu-Min
Hwang, Hsin-Te
Lin, Dong-Yi
Chen, Sin-Horng
[J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +
[45] A Mandarin text-to-speech system
Hwang, SH
Chen, SH
Wang, YR
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
[46] Text-to-speech system for Danish
[J]. 1600, Publ by Elsevier Science Publishers B.V., Amsterdam, Neth
[47] LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Zen, Heiga
Dang, Viet
Clark, Rob
Zhang, Yu
Weiss, Ron J.
Jia, Ye
Chen, Zhifeng
Wu, Yonghui
[J]. INTERSPEECH 2019, 2019, : 1526 - 1530
[48] Introduction to Multilingual Corpus-Based Concatenative Speech Synthesis
Deprez, Filip
Odijk, Jan
De Moortel, Jan
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 357 - 360
[49] Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis
Sakai, Shinsuke
Kawahara, Tatsuya
Kawai, Hisashi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2006 - 2014
[50] Segment Connection Networks for Corpus-Based Speech Synthesis
Coorman, Geert
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2074 - 2077

← 1 2 3 4 5 →