Memory-based Data-driven Approach for Grapheme-to-Phoneme Conversion in Bengali Text-to-Speech Synthesis System

被引：0

作者：

Ghosh, Krishnendu ^{[1
]}

Rao, K. Sreenivasa ^{[1
]}

机构：

[1] Indian Inst Technol Kharagpur, Sch Informat Technol, Kharagpur, W Bengal, India

来源：

2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS | 2011年

关键词：

Grapheme-to-phoneme conversion; Bengali; Alignment problem; Text-to-speech synthesis; Data-driven method;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In this paper, we propose a memory-based data-driven model for grapheme-to-phoneme (G2P) conversion for Bengali text-to-speech synthesis (TTS) system. Previous studies have stated the significance of the linguistic and phonetic features for rule-based Bengali G2P conversion techniques. But due to the lack of proper morphological analyzer, the scope of rule-based approaches is bounded. The proposed method overcomes the limitation of rule-based methods by exploiting the variety of contexts present in the text corpus built in the current study. The model has been trained with a memory-base showing the relation between graphs and phones based on contexts. The model has been tested with 300 random words and it achieved accuracy of 79.33% at word-level and 96.28% at graph-level. This performance has been compared with a related rule-based approach to prove the effectiveness of a data-driven method. Furthermore, the model doesn't require any morphological knowledge of the words.

引用

页数：4

共 50 条

[31] A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis
Lee, Kai-Zhan
Cooper, Erica
Hirschberg, Julia
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2873 - 2877
[32] A Smart Control System for the Oil Industry Using Text-to-Speech Synthesis Based on IIoT
Mandeel, Ali Raheem
Aggar, Ammar Abdullah
Al-Radhi, Mohammed Salah
Csapo, Tamas Gabor
[J]. ELECTRONICS, 2023, 12 (16)
[33] Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis
Hertz, SR
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 87 - 90
[34] Development of a rule based approach for Bangla text to speech conversion system
Anam, A. S. M. Iftekhar
Osman, Sowkot
Chowdhury, Asif Jamil
Ali, Muhammad Masroor
[J]. ICECE 2006: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, 2006, : 181 - +
[35] A Novel Text-to-Speech Synthesis System Using Syllable-Based HMM for Tamil Language
Manoharan, J. Samuel
[J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 305 - 314
[36] Towards designing a high intelligibility rule based Standard Malay text-to-speech synthesis system
Ahmad, Zakiah Hanim
Khalifa, Othman
[J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 89 - 94
[37] Time and space-efficient architecture for a corpus-based text-to-speech synthesis system
Rojc, Matej
Kacic, Zdravko
[J]. SPEECH COMMUNICATION, 2007, 49 (03) : 230 - 249
[38] A Data-Driven Approach for Fault Diagnosis in Gearbox of Wind Energy Conversion System
Krueger, Minjia
Ding, Steven X.
Haghani, Adel
Engel, Peter
Jeinsch, Torsten
[J]. 2013 2ND INTERNATIONAL CONFERENCE ON CONTROL AND FAULT-TOLERANT SYSTEMS (SYSTOL), 2013, : 359 - 364
[39] A System for Transforming the Emotion in Speech: Combining Data-Driven Conversion Techniques for Prosody and Voice Quality
Inanoglu, Zeynep
Young, Steve
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 457 - 460
[40] Optimal state duration assignment in hidden Markov model-based text-to-speech synthesis system
Khan, Najeeb Ullah
Lee, Jung-Chul
[J]. ELECTRONICS LETTERS, 2015, 51 (12) : 941 - 942

← 1 2 3 4 5 →