Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla Language

被引:0
|
作者
Shubha, Sudipta Saha [1 ]
Sadeq, Nafis [1 ]
Ahmed, Shafayat [1 ]
Islam, Md Nahidul [1 ]
Adnan, Muhammad Abdullah [1 ]
Khan, Md Yasin Ali [2 ]
Islam, Mohammad Zuberul [2 ]
机构
[1] Bangladesh Univ Engn & Technol BUET, Dhaka, Bangladesh
[2] Samsung R&D Inst, Dhaka, Bangladesh
关键词
MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grapheme to phoneme (G2P) conversion is an integral part in various text and speech processing systems, such as: Text to Speech system, Speech Recognition system, etc. The existing methodologies for G2P conversion in Bangla language are mostly rule-based. However, data-driven approaches have proved their superiority over rule-based approaches for largescale G2P conversion in other languages, such as: English, German, etc. As the performance of data-driven approaches for G2P conversion depend largely on pronunciation lexicon on which the system is trained, in this paper, we investigate on developing an improved training lexicon by identifying and categorizing the critical cases in Bangla language and include those critical cases in training lexicon for developing a robust G2P conversion system in Bangla language. Additionally, we have incorporated nasal vowels in our proposed phoneme list. Our methodology outperforms other stateof-the-art approaches for G2P conversion in Bangla language.
引用
收藏
页码:3191 / 3200
页数:10
相关论文
共 50 条
  • [1] Grapheme-to-Phoneme Models for (Almost) Any Language
    Deri, Aliya
    Knight, Kevin
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 399 - 408
  • [2] An encoder-decoder based grapheme-to-phoneme converter for Bangla speech synthesis
    Ahmad, Arif
    Selim, Mohammad Reza
    Iqbal, Muhammed Zafar
    Rahman, Mohammad Shahidur
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2019, 40 (06) : 374 - 381
  • [3] Grapheme-to-phoneme conversion in Chinese TTS system
    Dong, HH
    Tao, JH
    Xu, B
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 165 - 168
  • [4] Automated Grapheme-to-Phoneme Conversion System for Romanian
    Jozsef, Domokos
    Ovidiu, Buza
    Gavril, Toderean
    2011 6TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2011,
  • [5] GRAPHEME-TO-PHONEME CONVERSION METHODS FOR MINORITY LANGUAGE CONDITIONS
    Cao, Mengxue
    Renals, Steve
    Bell, Peter
    Li, Aijun
    Fang, Qiang
    2012 INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2012, : 151 - 156
  • [6] A new definition of Xhosa grapheme-to-phoneme rules for automatic transcription
    Louw, Philippa H.
    SOUTH AFRICAN JOURNAL OF AFRICAN LANGUAGES, 2005, 25 (02) : 71 - 91
  • [7] A Rule-Based Grapheme-to-Phoneme Conversion System
    Klosowski, Piotr
    APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [8] An evaluation of non-standard features for grapheme-to-phoneme conversion
    Webster, Gabriel
    Braunschweiler, Norbert
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1845 - 1848
  • [9] Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy
    Bellegarda, JR
    SPEECH COMMUNICATION, 2005, 46 (02) : 140 - 152
  • [10] Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy
    Bellegarda, JR
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 244 - 247