Grapheme-to-phoneme conversion of Arabic numeral expressions for embedded TTS systems

被引:2
|
作者
Jung, Youngim [1 ]
Yoon, Aesun
Kwon, Hyuk-Chul
机构
[1] Pusan Natl Univ, Dept Comp Sci & Engn, Korean Language Proc Lab, Pusan 609735, South Korea
[2] Pusan Natl Univ, Dept French, Korean Language Proc Lab, Interdisciplinary Program Cognit Sci, Pusan 609735, South Korea
关键词
Arabic numeral; embedded text-to-speech (TTS) systems; rule-based grapheme-to-phoneme conversion systems; word sense disambiguation;
D O I
10.1109/TASL.2006.876761
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Despite the increasing need for accuracy, current text-to-speech (TTS) systems are still poor at generating the correct pronunciation of Arabic numerals due to their high ambiguity and various interpretations. In this paper, we propose a mini-transliteration system for Arabic-numeral expressions, which can efficiently and correctly convert Arabic numeral expressions found in Korean text into phonemes for embedded TTS systems. For the purpose of building grapheme-to-phoneme rules, we deduced the components of ANEs, and investigated their pattern and arithmetic features based on the analyzed corpus. A word sense disambiguation based on lexical hierarchies in KorLex 1.0 was developed to resolve ambi guities caused by the homographic components of the ANEs. Our system minimized the amount of memory used by 1) separating the morphological analysis module from the transliteration system, 2) compacting the lexicon size, and 3) removing named entities. It reduced the process time dramatically without any serious loss of accuracy, and showed an accuracy of 97.2%-98.3%, which was 21.4%-22.5% higher than that of the baseline, and 5.5%-19.5% higher than current commercial Korean TTS systems.
引用
收藏
页码:296 / 309
页数:14
相关论文
共 50 条
  • [21] MULTILINGUAL GRAPHEME-TO-PHONEME CONVERSION WITH BYTE REPRESENTATION
    Yu, Mingzhi
    Hieu Duy Nguyen
    Sokolov, Alex
    Lepird, Jack
    Sathyendra, Kanthashree Mysore
    Choudhary, Samridhi
    Mouchtaris, Athanasios
    Kunzmann, Siegfried
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8234 - 8238
  • [22] DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis
    Ikbel Hadj Ali
    Zied Mnasri
    Zied Lachiri
    [J]. International Journal of Speech Technology, 2020, 23 : 569 - 584
  • [23] Adapting grapheme-to-phoneme conversion for name recognition
    Li, Xiao
    Gunawardana, Asela
    Acero, Alex
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 130 - 135
  • [24] Grapheme-to-Phoneme Conversion with a Multilingual Transformer Model
    ElSaadany, Omnia
    Suter, Benjamin
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 85 - 89
  • [25] DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis
    Ali, Ikbel Hadj
    Mnasri, Zied
    Lachiri, Zied
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 569 - 584
  • [26] Linguistic Knowledge in Multilingual Grapheme-to-Phoneme Conversion
    Lo, Roger Yu-Hsiang
    Nicolai, Garrett
    [J]. SIGMORPHON 2021: 18TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS, PHONOLOGY, AND MORPHOLOGY, 2021, : 131 - 140
  • [27] A data-driven grapheme-to-phoneme conversion method using dynamic contextual converting rules for Korean TTS systems
    Lee, Jinsik
    Lee, Gary Geunbae
    [J]. COMPUTER SPEECH AND LANGUAGE, 2009, 23 (04): : 423 - 434
  • [28] Neural Machine Translation for Multilingual Grapheme-to-Phoneme Conversion
    Sokolov, Alex
    Rohlin, Tracy
    Rastrow, Ariya
    [J]. INTERSPEECH 2019, 2019, : 2065 - 2069
  • [29] Joint-sequence models for grapheme-to-phoneme conversion
    Bisani, Maximilian
    Ney, Hermann
    [J]. SPEECH COMMUNICATION, 2008, 50 (05) : 434 - 451
  • [30] A linguistically motivated approach to grapheme-to-phoneme conversion for Korean
    Yoon, Kyuchul
    Brew, Chris
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04): : 357 - 381