MYANMAR SPEECH SYNTHESIS SYSTEM BY USING PHONEME CONCATENATION METHOD

被引：0

作者：

Hlaing, Chaw Su ^{[1
]}

Thida, Aye ^{[1
]}

机构：

[1] Univ Comp Studies, Mandalay, Myanmar

来源：

PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17) | 2017年

关键词：

Myanmar Text to Speech; Phoneme; Concatenative Speech Synthesis; SELECTION;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

For Myanmar language, there has been great effort in speech processing so that Myanmar text to speech (MTTS) is one of the interesting research topics for Myanmar natural language processing field. Generally, Myanmar language is the syllabic language and the combination of consonant and vowel phonemes can make syllable. In this paper, we proposed new phoneme concatenation algorithm for MTTS system. Consequently, firstly, we created phoneme speech database in which there are only 125 phoneme units that can speech out for any Myanmar texts. It is very suitable for resource limited devices, such as mobile phones. In our proposed system, firstly, the system accepts input Myanmar texts and then these texts are normalized for next processing. After that, the standized texts are converted into phoneme sequences by using proposed phonological rules that can get the high quality MTTS system. Detecting phrase boundary is also considered to assign pause duration for the purpose of getting more natural sounding MTTS. In the case of speech generation, we concatenate the phoneme speech units with our proposed algorithm. According to the experimental result, our proposed phoneme concatenation algorithm achieves the acceptable level of intelligibility and naturalness for Myanmar speech output.

引用

页码：399 / 404

页数：6

共 50 条

[41] Development of a Taiwanese Speech Synthesis System UsingHidden Markov Models and aRobust Tonal Phoneme Corpus
Sher, Yung-Ji
Hsu, Ming-Chun
Chiu, Yu-Hsien
Chen, Yeou-Jiunn
Wu, Chung-Hsien
Wu, Jiunn-Liang
[J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (02) : 283 - 302
[42] HMM Based Myanmar Text to Speech System
Thu, Ye Kyaw
Pa, Win Pa
Ni, Jinfu
Shiga, Yoshinori
Finch, Andrew
Hori, Chiori
Kawai, Hisashi
Sumita, Eiichiro
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2237 - 2241
[43] Symbol based concatenation approach for Text to Speech System for Hindi using vowel classification technique
Chaudhury, Pamela
Rao, Madhuri
Kumar, KVinod
[J]. 2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 1081 - +
[44] Phoneme Duration Modeling Using Speech Rhythm-Based Speaker Embeddings for Multi-Speaker Speech Synthesis
Fujita, Kenichi
Ando, Atsushi
Ijima, Yusuke
[J]. INTERSPEECH 2021, 2021, : 3141 - 3145
[45] Speech bandwidth extension method using speech recognition and speech synthesis
Takashina, Masashi
Kuroiwa, Shingo
Tsuge, Satoru
Ren, Fuji
[J]. 2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1273 - +
[46] ERROR DETECTION OF GRAPHEME-TO-PHONEME CONVERSION IN TEXT-TO-SPEECH SYNTHESIS USING SPEECH SIGNAL AND LEXICAL CONTEXT
Vythelingum, Kevin
Esteve, Yannick
Rosec, Olivier
[J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 692 - 697
[47] The Effects of Phoneme Errors in Speaker Adaptation for HMM Speech Synthesis
Toth, Balint
Fegyo, Tibor
Nemeth, Geza
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2816 - +
[48] Speech coding and phoneme classification using MATLAB and NeuralWorks
StGeorge, BA
Wooten, EC
Sellami, L
[J]. FRONTIERS IN EDUCATION 1997 - 27TH ANNUAL CONFERENCE, PROCEEDINGS, BOLS I - III, 1997, : 12 - 12
[49] Speech Emotion Recognition Using Spectrogram & Phoneme Embedding
Yenigalla, Promod
Kumar, Abhay
Tripathi, Suraj
Singh, Chirag
Kar, Sibsambhu
Vepa, Jithendra
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3688 - 3692
[50] Expressive Speech Animation Synthesis with Phoneme-Level Controls
Deng, Z.
Neumann, U.
[J]. COMPUTER GRAPHICS FORUM, 2008, 27 (08) : 2096 - 2113

← 1 2 3 4 5 →