Incorporating syllabification points into a model of grapheme-to-phoneme conversion

被引:5
|
作者
Suyanto, Suyanto [1 ]
机构
[1] Telkom Univ, Sch Comp, Bandung 40257, West Java, Indonesia
关键词
Bahasa Indonesia; Grapheme-to-phoneme conversion; Syllabification points; Nearest neighbour; Probabilistic-based approach;
D O I
10.1007/s10772-019-09619-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A model to convert a grapheme into a phoneme (G2P) is crucial in the natural language processing area. In general, it is developed using a probabilistic-based data-driven approach and directly applied to a sequence of graphemes with no other information. Important research shows that incorporating information of syllabification point is capable of improving a probabilistic-based English G2P. However, the information should be accurately provided by a perfect orthographic syllabification. Some noises or errors of syllabification significantly reduce the G2P performance. In this paper, incorporation of syllabification points into a probabilistic-based G2P model for Bahasa Indonesia is investigated. This information is important since Bahasa Indonesia is richer than English in terms of syllables. A 5-fold cross-validating on 50 k words shows that the incorporation of syllabification points significantly improves the performance of G2P model, where the phoneme error rate (PER) can be relatively reduced by 10.75%. This PER is much lower than the G2P model based on an inductive learning algorithm. An important contribution of this research is that the proposed G2P model is quite robust to syllabification errors. A syllable error rate (SER) of 2.5% that comes from an orthographic syllabification model just slightly increases the PER of the proposed G2P model from 0.83% to be 0.90%. A higher SER up to 10% just increase the PER to be 1.14%.
引用
收藏
页码:459 / 470
页数:12
相关论文
共 50 条
  • [1] Incorporating syllabification points into a model of grapheme-to-phoneme conversion
    Suyanto Suyanto
    [J]. International Journal of Speech Technology, 2019, 22 : 459 - 470
  • [2] Probabilistic context-free grammars for syllabification and grapheme-to-phoneme conversion
    Müller, K
    [J]. PROCEEDINGS OF THE 2001 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2001, : 143 - 150
  • [3] Grapheme-to-Phoneme Conversion with a Multilingual Transformer Model
    ElSaadany, Omnia
    Suter, Benjamin
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 85 - 89
  • [4] Boosting Rule-Based Grapheme-to-Phoneme Conversion with Morphological Segmentation and Syllabification in Bengali
    Ghosh, Krishnendu
    Mandal, Sandipan
    Roy, Nilay
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 415 - 429
  • [5] Fast Bilingual Grapheme-To-Phoneme Conversion
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 289 - 296
  • [6] Transformer based Grapheme-to-Phoneme Conversion
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. INTERSPEECH 2019, 2019, : 2095 - 2099
  • [7] Grapheme-to-Phoneme Conversion with Convolutional Neural Networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (06):
  • [8] Grapheme-to-phoneme conversion in Chinese TTS system
    Dong, HH
    Tao, JH
    Xu, B
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 165 - 168
  • [9] Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion
    Prabhu, Nikhil
    Kann, Katharina
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 123 - 127
  • [10] Label Embedding for Chinese Grapheme-to-Phoneme Conversion
    Choi, Eunbi
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. INTERSPEECH 2021, 2021, : 4094 - 4098