Grapheme-to-phoneme conversion of Arabic numeral expressions for embedded TTS systems

被引:2
|
作者
Jung, Youngim [1 ]
Yoon, Aesun
Kwon, Hyuk-Chul
机构
[1] Pusan Natl Univ, Dept Comp Sci & Engn, Korean Language Proc Lab, Pusan 609735, South Korea
[2] Pusan Natl Univ, Dept French, Korean Language Proc Lab, Interdisciplinary Program Cognit Sci, Pusan 609735, South Korea
关键词
Arabic numeral; embedded text-to-speech (TTS) systems; rule-based grapheme-to-phoneme conversion systems; word sense disambiguation;
D O I
10.1109/TASL.2006.876761
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Despite the increasing need for accuracy, current text-to-speech (TTS) systems are still poor at generating the correct pronunciation of Arabic numerals due to their high ambiguity and various interpretations. In this paper, we propose a mini-transliteration system for Arabic-numeral expressions, which can efficiently and correctly convert Arabic numeral expressions found in Korean text into phonemes for embedded TTS systems. For the purpose of building grapheme-to-phoneme rules, we deduced the components of ANEs, and investigated their pattern and arithmetic features based on the analyzed corpus. A word sense disambiguation based on lexical hierarchies in KorLex 1.0 was developed to resolve ambi guities caused by the homographic components of the ANEs. Our system minimized the amount of memory used by 1) separating the morphological analysis module from the transliteration system, 2) compacting the lexicon size, and 3) removing named entities. It reduced the process time dramatically without any serious loss of accuracy, and showed an accuracy of 97.2%-98.3%, which was 21.4%-22.5% higher than that of the baseline, and 5.5%-19.5% higher than current commercial Korean TTS systems.
引用
收藏
页码:296 / 309
页数:14
相关论文
共 50 条
  • [1] Grapheme-to-Phoneme and Phoneme-to-Grapheme Conversion in Belarusian with NooJ for TTS and STT Systems
    Zahariev, Vadim
    Lysy, Stanislau
    Hiuntar, Alena
    Hetsevich, Yury
    [J]. AUTOMATIC PROCESSING OF NATURAL-LANGUAGE ELECTRONIC TEXTS WITH NOOJ, 2016, 607 : 137 - 150
  • [2] Grapheme-to-phoneme conversion in Chinese TTS system
    Dong, HH
    Tao, JH
    Xu, B
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 165 - 168
  • [3] Improved Grapheme-to-Phoneme Conversion for Mandarin TTS
    易立夫
    李健
    郝杰
    熊子瑜
    [J]. Tsinghua Science and Technology, 2009, 14 (05) : 606 - 611
  • [4] Main issues in grapheme-to-phoneme conversion for TTS
    Polyakova, Tatyana
    Bonafonte, Antonio
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 29 - 34
  • [5] Automatic Grapheme-to-Phoneme Conversion of Arabic Text
    Al-Daradkah, Belal
    Al-Diri, Bashir
    [J]. 2015 SCIENCE AND INFORMATION CONFERENCE (SAI), 2015, : 468 - 473
  • [6] Grapheme-to-phoneme conversion based on a fast TBL algorithm in mandarin TTS systems
    Zheng, M
    Shi, Q
    Zhang, W
    Cai, LH
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 600 - 609
  • [7] A grapheme-to-phoneme translator for TTS synthesis in Greek
    Stathopoulou-Zois, P
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2005, 14 (06) : 901 - 918
  • [8] Fast Bilingual Grapheme-To-Phoneme Conversion
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 289 - 296
  • [9] Transformer based Grapheme-to-Phoneme Conversion
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. INTERSPEECH 2019, 2019, : 2095 - 2099
  • [10] Arabic grapheme-to-phoneme conversion based on joint multi-gram model
    Cherifi, El-Hadi
    Guerti, Mhania
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (01) : 173 - 182