Grapheme-to-phoneme conversion of Arabic numeral expressions for embedded TTS systems

被引:2
|
作者
Jung, Youngim [1 ]
Yoon, Aesun
Kwon, Hyuk-Chul
机构
[1] Pusan Natl Univ, Dept Comp Sci & Engn, Korean Language Proc Lab, Pusan 609735, South Korea
[2] Pusan Natl Univ, Dept French, Korean Language Proc Lab, Interdisciplinary Program Cognit Sci, Pusan 609735, South Korea
关键词
Arabic numeral; embedded text-to-speech (TTS) systems; rule-based grapheme-to-phoneme conversion systems; word sense disambiguation;
D O I
10.1109/TASL.2006.876761
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Despite the increasing need for accuracy, current text-to-speech (TTS) systems are still poor at generating the correct pronunciation of Arabic numerals due to their high ambiguity and various interpretations. In this paper, we propose a mini-transliteration system for Arabic-numeral expressions, which can efficiently and correctly convert Arabic numeral expressions found in Korean text into phonemes for embedded TTS systems. For the purpose of building grapheme-to-phoneme rules, we deduced the components of ANEs, and investigated their pattern and arithmetic features based on the analyzed corpus. A word sense disambiguation based on lexical hierarchies in KorLex 1.0 was developed to resolve ambi guities caused by the homographic components of the ANEs. Our system minimized the amount of memory used by 1) separating the morphological analysis module from the transliteration system, 2) compacting the lexicon size, and 3) removing named entities. It reduced the process time dramatically without any serious loss of accuracy, and showed an accuracy of 97.2%-98.3%, which was 21.4%-22.5% higher than that of the baseline, and 5.5%-19.5% higher than current commercial Korean TTS systems.
引用
收藏
页码:296 / 309
页数:14
相关论文
共 50 条
  • [31] NEURAL GRAPHEME-TO-PHONEME CONVERSION WITH PRE-TRAINED GRAPHEME MODELS
    Dong, Lu
    Guo, Zhi-Qiang
    Tan, Chao-Hong
    Hu, Ya-Jun
    Jiang, Yuan
    Ling, Zhen-Hua
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6202 - 6206
  • [32] Grapheme-to-Phoneme Conversion using Conditional Random Fields
    Illina, Irina
    Fohr, Dominique
    Jouvet, Denis
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2324 - 2327
  • [33] A Maximum Entropy Approach to Chinese Grapheme-to-Phoneme Conversion
    Tsai, Richard Tzong-Han
    Wang, Yu-Chun
    [J]. PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 411 - +
  • [34] Incorporating syllabification points into a model of grapheme-to-phoneme conversion
    Suyanto, Suyanto
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 459 - 470
  • [35] Incorporating syllabification points into a model of grapheme-to-phoneme conversion
    Suyanto Suyanto
    [J]. International Journal of Speech Technology, 2019, 22 : 459 - 470
  • [36] A Rule-Based Grapheme-to-Phoneme Conversion System
    Klosowski, Piotr
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [37] Multilingual grapheme-to-phoneme conversion with global character vectors
    Ni, Jinfu
    Shiga, Yoshinori
    Kawai, Hisashi
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2823 - 2827
  • [38] GRAPHEME-TO-PHONEME CONVERSION METHODS FOR MINORITY LANGUAGE CONDITIONS
    Cao, Mengxue
    Renals, Steve
    Bell, Peter
    Li, Aijun
    Fang, Qiang
    [J]. 2012 INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2012, : 151 - 156
  • [39] Example-Based Grapheme-to-Phoneme Conversion for Thai
    Charoenpornsawat, Paisarn
    Schultz, Tanja
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1268 - 1271
  • [40] Grapheme-to-phoneme Conversion based on Adaptive Regularization of Weight Vectors
    Kubo, Keigo
    Sakti, Sakriani
    Neubig, Graham
    Toda, Tomoki
    Nakamura, Satoshi
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1945 - 1949