Corpus-based Malay Text-to-Speech Synthesis System

被引:0
|
作者
Swee, Tan Tian [1 ]
Salleh, Sheikh Hussain Shaikh [1 ]
机构
[1] Univ Teknol Malaysia, Fac Biomed & Hlth Sci Engn, Utm Skudai 81310, Johor, Malaysia
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The main problem with current Malay text-to-speech (TTS) synthesis system is the poor quality of the generated speech sound. This poor quality is resulted from the inability of traditional TTS system to provide multiple choices of unit for generating more accurate synthesized speech. Most of the current available Malay TTS systems are utilizing diphone concatenation that only support a single unit for each existing diphone, thus it cannot provide more accurate selection of speech unit for concatenation]. This project has implemented a variable length unit selection Malay text to speech system that is capable of providing more natural and accurate unit selection for synthesized speech. This paper proposes a method of combining both linguistic context and feature distance cost for selecting the best match unit. A set of digitized Malay word has been collected from Malay internet news for Malay word frequency count. 381 sentences have been designed which cover around 70 percent of high frequency words from 10 million digitized word obtained from Malay internet news. Then a unit selection method has been implemented to provide the capability of selecting a speech unit not only limited to phoneme, diphone or triphone but also a string of phonemes that can be matched directly to the database. A set of listening test namely Modify Rhythm Test (MRT) has been carried out with 35 participants, which represented 86 percent of accuracy.
引用
收藏
页码:52 / 56
页数:5
相关论文
共 50 条
  • [1] A new Korean corpus-based text-to-speech system
    Kim S.
    Lee Y.
    Hirose K.
    [J]. International Journal of Speech Technology, 2002, 5 (2) : 105 - 116
  • [2] An objective measure for assement of a corpus-based text-to-speech system
    Xu, J
    Guan, CT
    Li, HZ
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 179 - 182
  • [3] Time and space-efficient architecture for a corpus-based text-to-speech synthesis system
    Rojc, Matej
    Kacic, Zdravko
    [J]. SPEECH COMMUNICATION, 2007, 49 (03) : 230 - 249
  • [4] A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
    Chou, FC
    Tseng, CY
    Lee, LS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 481 - 494
  • [5] A study of prosodic variability methods in a corpus-based unit selection text-to-speech system
    Csapo, Tamas Gabor
    Zainko, Csaba
    Nemeth, Geza
    [J]. INFOCOMMUNICATIONS JOURNAL, 2010, 2 (01): : 32 - 37
  • [6] An LSTM-based model for the compression of acoustic inventories for corpus-based text-to-speech synthesis systems
    Rojc, Matej
    Mlakar, Izidor
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [7] Unit generation based on phrase break strength and pruning for corpus-based text-to-speech
    Kim, S
    Lee, Y
    Hirose, K
    [J]. ETRI JOURNAL, 2001, 23 (04) : 168 - 176
  • [8] RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
    Zandie, Rohola
    Mahoor, Mohammad H.
    Madsen, Julia
    Emamian, Eshrat S.
    [J]. INTERSPEECH 2021, 2021, : 2751 - 2755
  • [9] ADDING AN EMOTIONS FILTER TO MALAY TEXT-TO-SPEECH SYSTEM
    Begum, Mumtaz
    Ainon, Raja N.
    Don, Zuraidah M.
    Knowles, Gerry
    [J]. ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 1007 - +
  • [10] Towards designing a high intelligibility rule based Standard Malay text-to-speech synthesis system
    Ahmad, Zakiah Hanim
    Khalifa, Othman
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 89 - 94