MYANMAR SPEECH SYNTHESIS SYSTEM BY USING PHONEME CONCATENATION METHOD

被引:0
|
作者
Hlaing, Chaw Su [1 ]
Thida, Aye [1 ]
机构
[1] Univ Comp Studies, Mandalay, Myanmar
关键词
Myanmar Text to Speech; Phoneme; Concatenative Speech Synthesis; SELECTION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For Myanmar language, there has been great effort in speech processing so that Myanmar text to speech (MTTS) is one of the interesting research topics for Myanmar natural language processing field. Generally, Myanmar language is the syllabic language and the combination of consonant and vowel phonemes can make syllable. In this paper, we proposed new phoneme concatenation algorithm for MTTS system. Consequently, firstly, we created phoneme speech database in which there are only 125 phoneme units that can speech out for any Myanmar texts. It is very suitable for resource limited devices, such as mobile phones. In our proposed system, firstly, the system accepts input Myanmar texts and then these texts are normalized for next processing. After that, the standized texts are converted into phoneme sequences by using proposed phonological rules that can get the high quality MTTS system. Detecting phrase boundary is also considered to assign pause duration for the purpose of getting more natural sounding MTTS. In the case of speech generation, we concatenate the phoneme speech units with our proposed algorithm. According to the experimental result, our proposed phoneme concatenation algorithm achieves the acceptable level of intelligibility and naturalness for Myanmar speech output.
引用
收藏
页码:399 / 404
页数:6
相关论文
共 50 条
  • [41] Development of a Taiwanese Speech Synthesis System UsingHidden Markov Models and aRobust Tonal Phoneme Corpus
    Sher, Yung-Ji
    Hsu, Ming-Chun
    Chiu, Yu-Hsien
    Chen, Yeou-Jiunn
    Wu, Chung-Hsien
    Wu, Jiunn-Liang
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (02) : 283 - 302
  • [42] HMM Based Myanmar Text to Speech System
    Thu, Ye Kyaw
    Pa, Win Pa
    Ni, Jinfu
    Shiga, Yoshinori
    Finch, Andrew
    Hori, Chiori
    Kawai, Hisashi
    Sumita, Eiichiro
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2237 - 2241
  • [43] Symbol based concatenation approach for Text to Speech System for Hindi using vowel classification technique
    Chaudhury, Pamela
    Rao, Madhuri
    Kumar, KVinod
    [J]. 2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 1081 - +
  • [44] Phoneme Duration Modeling Using Speech Rhythm-Based Speaker Embeddings for Multi-Speaker Speech Synthesis
    Fujita, Kenichi
    Ando, Atsushi
    Ijima, Yusuke
    [J]. INTERSPEECH 2021, 2021, : 3141 - 3145
  • [45] Speech bandwidth extension method using speech recognition and speech synthesis
    Takashina, Masashi
    Kuroiwa, Shingo
    Tsuge, Satoru
    Ren, Fuji
    [J]. 2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1273 - +
  • [46] ERROR DETECTION OF GRAPHEME-TO-PHONEME CONVERSION IN TEXT-TO-SPEECH SYNTHESIS USING SPEECH SIGNAL AND LEXICAL CONTEXT
    Vythelingum, Kevin
    Esteve, Yannick
    Rosec, Olivier
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 692 - 697
  • [47] The Effects of Phoneme Errors in Speaker Adaptation for HMM Speech Synthesis
    Toth, Balint
    Fegyo, Tibor
    Nemeth, Geza
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2816 - +
  • [48] Speech coding and phoneme classification using MATLAB and NeuralWorks
    StGeorge, BA
    Wooten, EC
    Sellami, L
    [J]. FRONTIERS IN EDUCATION 1997 - 27TH ANNUAL CONFERENCE, PROCEEDINGS, BOLS I - III, 1997, : 12 - 12
  • [49] Speech Emotion Recognition Using Spectrogram & Phoneme Embedding
    Yenigalla, Promod
    Kumar, Abhay
    Tripathi, Suraj
    Singh, Chirag
    Kar, Sibsambhu
    Vepa, Jithendra
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3688 - 3692
  • [50] Expressive Speech Animation Synthesis with Phoneme-Level Controls
    Deng, Z.
    Neumann, U.
    [J]. COMPUTER GRAPHICS FORUM, 2008, 27 (08) : 2096 - 2113