Memory-based Data-driven Approach for Grapheme-to-Phoneme Conversion in Bengali Text-to-Speech Synthesis System

被引:0
|
作者
Ghosh, Krishnendu [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
Grapheme-to-phoneme conversion; Bengali; Alignment problem; Text-to-speech synthesis; Data-driven method;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a memory-based data-driven model for grapheme-to-phoneme (G2P) conversion for Bengali text-to-speech synthesis (TTS) system. Previous studies have stated the significance of the linguistic and phonetic features for rule-based Bengali G2P conversion techniques. But due to the lack of proper morphological analyzer, the scope of rule-based approaches is bounded. The proposed method overcomes the limitation of rule-based methods by exploiting the variety of contexts present in the text corpus built in the current study. The model has been trained with a memory-base showing the relation between graphs and phones based on contexts. The model has been tested with 300 random words and it achieved accuracy of 79.33% at word-level and 96.28% at graph-level. This performance has been compared with a related rule-based approach to prove the effectiveness of a data-driven method. Furthermore, the model doesn't require any morphological knowledge of the words.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] A unified approach to grapheme-to-phoneme conversion for the PLATTOS Slovenian text-to-speech system
    Rojc, Matej
    Kacic, Zdravko
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2007, 21 (06) : 563 - 603
  • [2] DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis
    Ikbel Hadj Ali
    Zied Mnasri
    Zied Lachiri
    [J]. International Journal of Speech Technology, 2020, 23 : 569 - 584
  • [3] DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis
    Ali, Ikbel Hadj
    Mnasri, Zied
    Lachiri, Zied
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 569 - 584
  • [4] ERROR DETECTION OF GRAPHEME-TO-PHONEME CONVERSION IN TEXT-TO-SPEECH SYNTHESIS USING SPEECH SIGNAL AND LEXICAL CONTEXT
    Vythelingum, Kevin
    Esteve, Yannick
    Rosec, Olivier
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 692 - 697
  • [5] Objective evaluation of grapheme to phoneme conversion for text-to-speech synthesis in French
    Yvon, F
    de Mareuil, PB
    d'Alessandro, C
    Auberge, V
    Bagein, M
    Bailly, G
    Bechet, F
    Foukia, S
    Goldman, JF
    Keller, E
    O'Shaughnessy, D
    Pagel, V
    Sannier, F
    Veronis, J
    Zellner, B
    [J]. COMPUTER SPEECH AND LANGUAGE, 1998, 12 (04): : 393 - 410
  • [6] Data-Driven Phrase Break Prediction for Bengali Text-to-Speech System
    Ghosh, Krishnendu
    Rao, K. Sreenivasa
    [J]. CONTEMPORARY COMPUTING, 2012, 306 : 118 - 129
  • [7] Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework
    Razavi, Marzieh
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    [J]. SPEECH COMMUNICATION, 2016, 80 : 1 - 21
  • [8] ACOUSTIC DATA-DRIVEN GRAPHEME-TO-PHONEME CONVERSION USING KL-HMM
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4841 - 4844
  • [9] ACOUSTIC DATA-DRIVEN GRAPHEME-TO-PHONEME CONVERSION USING KL-HMM
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4841 - 4844
  • [10] Text-To-Speech with cross-lingual Neural Network-based grapheme-to-phoneme models
    Gonzalvo, Xavi
    Podsiadlo, Monika
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 765 - 769