Memory-based Data-driven Approach for Grapheme-to-Phoneme Conversion in Bengali Text-to-Speech Synthesis System

被引:0
|
作者
Ghosh, Krishnendu [1 ]
Rao, K. Sreenivasa [1 ]
机构
[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
Grapheme-to-phoneme conversion; Bengali; Alignment problem; Text-to-speech synthesis; Data-driven method;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a memory-based data-driven model for grapheme-to-phoneme (G2P) conversion for Bengali text-to-speech synthesis (TTS) system. Previous studies have stated the significance of the linguistic and phonetic features for rule-based Bengali G2P conversion techniques. But due to the lack of proper morphological analyzer, the scope of rule-based approaches is bounded. The proposed method overcomes the limitation of rule-based methods by exploiting the variety of contexts present in the text corpus built in the current study. The model has been trained with a memory-base showing the relation between graphs and phones based on contexts. The model has been tested with 300 random words and it achieved accuracy of 79.33% at word-level and 96.28% at graph-level. This performance has been compared with a related rule-based approach to prove the effectiveness of a data-driven method. Furthermore, the model doesn't require any morphological knowledge of the words.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis
    Lee, Kai-Zhan
    Cooper, Erica
    Hirschberg, Julia
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2873 - 2877
  • [32] A Smart Control System for the Oil Industry Using Text-to-Speech Synthesis Based on IIoT
    Mandeel, Ali Raheem
    Aggar, Ammar Abdullah
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    [J]. ELECTRONICS, 2023, 12 (16)
  • [33] Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis
    Hertz, SR
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 87 - 90
  • [34] Development of a rule based approach for Bangla text to speech conversion system
    Anam, A. S. M. Iftekhar
    Osman, Sowkot
    Chowdhury, Asif Jamil
    Ali, Muhammad Masroor
    [J]. ICECE 2006: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, 2006, : 181 - +
  • [35] A Novel Text-to-Speech Synthesis System Using Syllable-Based HMM for Tamil Language
    Manoharan, J. Samuel
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 305 - 314
  • [36] Towards designing a high intelligibility rule based Standard Malay text-to-speech synthesis system
    Ahmad, Zakiah Hanim
    Khalifa, Othman
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 89 - 94
  • [37] Time and space-efficient architecture for a corpus-based text-to-speech synthesis system
    Rojc, Matej
    Kacic, Zdravko
    [J]. SPEECH COMMUNICATION, 2007, 49 (03) : 230 - 249
  • [38] A Data-Driven Approach for Fault Diagnosis in Gearbox of Wind Energy Conversion System
    Krueger, Minjia
    Ding, Steven X.
    Haghani, Adel
    Engel, Peter
    Jeinsch, Torsten
    [J]. 2013 2ND INTERNATIONAL CONFERENCE ON CONTROL AND FAULT-TOLERANT SYSTEMS (SYSTOL), 2013, : 359 - 364
  • [39] A System for Transforming the Emotion in Speech: Combining Data-Driven Conversion Techniques for Prosody and Voice Quality
    Inanoglu, Zeynep
    Young, Steve
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 457 - 460
  • [40] Optimal state duration assignment in hidden Markov model-based text-to-speech synthesis system
    Khan, Najeeb Ullah
    Lee, Jung-Chul
    [J]. ELECTRONICS LETTERS, 2015, 51 (12) : 941 - 942