Memory-based Data-driven Approach for Grapheme-to-Phoneme Conversion in Bengali Text-to-Speech Synthesis System

被引:0
|
作者
Ghosh, Krishnendu [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
Grapheme-to-phoneme conversion; Bengali; Alignment problem; Text-to-speech synthesis; Data-driven method;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a memory-based data-driven model for grapheme-to-phoneme (G2P) conversion for Bengali text-to-speech synthesis (TTS) system. Previous studies have stated the significance of the linguistic and phonetic features for rule-based Bengali G2P conversion techniques. But due to the lack of proper morphological analyzer, the scope of rule-based approaches is bounded. The proposed method overcomes the limitation of rule-based methods by exploiting the variety of contexts present in the text corpus built in the current study. The model has been trained with a memory-base showing the relation between graphs and phones based on contexts. The model has been tested with 300 random words and it achieved accuracy of 79.33% at word-level and 96.28% at graph-level. This performance has been compared with a related rule-based approach to prove the effectiveness of a data-driven method. Furthermore, the model doesn't require any morphological knowledge of the words.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] A System for Transforming the Emotion in Speech: Combining Data-Driven Conversion Techniques for Prosody and Voice Quality
    Inanoglu, Zeynep
    Young, Steve
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 457 - 460
  • [42] Optimal state duration assignment in hidden Markov model-based text-to-speech synthesis system
    Khan, Najeeb Ullah
    Lee, Jung-Chul
    [J]. ELECTRONICS LETTERS, 2015, 51 (12) : 941 - 942
  • [43] OCR BASED SPEECH SYNTHESIS SYSTEM USING LAB VIEW Text to Speech Conversion System using OCR
    Mullani, J. J.
    Sankar, M.
    Khade, Priyanka S.
    Sonalkar, Snehal H.
    Patil, Nikita L.
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 7 - 14
  • [44] Knowledge-based and Data-driven Approach based Fault Diagnosis for Power-Electronics Energy Conversion System
    Liu, Chuang
    Kou, Lei
    Cai, Guo-wei
    Zhou, Jia-ning
    Meng, Yi-qun
    Yan, Yu-heng
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CONTROL, AND COMPUTING TECHNOLOGIES FOR SMART GRIDS (SMARTGRIDCOMM), 2019,
  • [45] A Data-Driven Approach for Sensor Fault Diagnosis in Gearbox of Wind Energy Conversion System
    Krueger, Minjia
    Ding, Steven X.
    Haghani, Adel
    Engel, Peter
    Jeinsch, Torsten
    [J]. 2013 10TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2013, : 227 - 232
  • [46] A Data-Driven Approach for Blockchain-Based Smart Grid System
    Zeng, Zeng
    Dong, Meiya
    Miao, Weiwei
    Zhang, Mingming
    Tang, Hao
    [J]. IEEE ACCESS, 2021, 9 : 70061 - 70070
  • [47] Data-Driven Pause Prediction for Synthesis of Storytelling Style Speech based on Discourse Modes
    Sarkar, Parakrant
    Rao, K. Sreenivasa
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES (CONECCT), 2015,
  • [48] DNN-based Speech Synthesis for Small Data Sets Considering Bidirectional Speech-Text Conversion
    Sone, Kentaro
    Nakashika, Toru
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2519 - 2523
  • [49] A Joint Model and Data-driven Based Power System Disturbance Localization Approach
    Huang D.
    Liu H.
    Bi T.
    Yang Q.
    [J]. Dianwang Jishu/Power System Technology, 2023, 47 (03): : 1206 - 1217
  • [50] Deep learning-based speaker-adaptive postfiltering with limited adaptation data for embedded text-to-speech synthesis systems
    Eren, Eray
    Demiroglu, Cenk
    [J]. COMPUTER SPEECH AND LANGUAGE, 2023, 81