New Grapheme Generation Rules for Two-Stage Model-based Grapheme-to-Phoneme Conversion

被引:2
|
作者
Kheang, Seng [1 ]
Katsurada, Kouichi [1 ]
Iribe, Yurie [2 ]
Nitta, Tsuneo [1 ,3 ]
机构
[1] Toyohashi Univ Technol, 1-1 Tempaku, Toyohashi, Aichi 4418580, Japan
[2] Aichi Prefectural Univ, Nagakute, Aichi 4801198, Japan
[3] Waseda Univ, Shinjuku Ku, Tokyo 1698050, Japan
关键词
grapheme generation rules (GGR); combined grapheme-phoneme information; two-stage model; grapheme-to-phoneme (G2P); automatic text-to-phonetic transcription;
D O I
10.5614/itbj.ict.res.appl.2014.8.2.6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The precise conversion of arbitrary text into its corresponding phoneme sequence (grapheme-to-phoneme or G2P conversion) is implemented in speech synthesis and recognition, pronunciation learning software, spoken term detection and spoken document retrieval systems. Because the quality of this module plays an important role in the performance of such systems and many problems regarding G2P conversion have been reported, we propose a novel two-stage model-based approach, which is implemented using an existing weighted finite-state transducer-based G2P conversion framework, to improve the performance of the G2P conversion model. The first-stage model is built for automatic conversion of words to phonemes, while the second-stage model utilizes the input graphemes and output phonemes obtained from the first stage to determine the best final output phoneme sequence. Additionally, we designed new grapheme generation rules, which enable extra detail for the vowel and consonant graphemes appearing within a word. When compared with previous approaches, the evaluation results indicate that our approach using rules focusing on the vowel graphemes slightly improved the accuracy of the out-of-vocabulary dataset and consistently increased the accuracy of the in-vocabulary dataset.
引用
收藏
页码:157 / 174
页数:18
相关论文
共 50 条
  • [1] Novel Two-Stage Model for Grapheme-to-Phoneme Conversion using New Grapheme Generation Rules
    Kheang, Seng
    Katsurada, Kouichi
    Iribe, Yurie
    Nitta, Tsuneo
    [J]. 2014 INTERNATIONAL CONFERENCE OF ADVANCED INFORMATICS: CONCEPT, THEORY AND APPLICATION (ICAICTA), 2014, : 97 - 102
  • [2] Efficient two-stage processing for joint sequence model-based Thai grapheme-to-phoneme conversion
    Rugchatjaroen, Anocha
    Saychum, Sittipong
    Kongyoung, Sarawoot
    Chootrakool, Patcharika
    Kasuriya, Sawit
    Wutiwiwatchai, Chai
    [J]. SPEECH COMMUNICATION, 2019, 106 : 105 - 111
  • [3] The generation of letter-to-sound rules for grapheme-to-phoneme conversion
    Przybysz, Pawel
    Kasprzak, Wlodzimierz
    [J]. 2013 6TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI), 2013, : 292 - 297
  • [4] Using Reversed Sequences and Grapheme Generation Rules to Extend the Feasibility of a Phoneme Transition Network-Based Grapheme-to-Phoneme Conversion
    Kheang, Seng
    Katsurada, Kouichi
    Iribe, Yurie
    Nitta, Tsuneo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (04): : 1182 - 1192
  • [5] Transformer based Grapheme-to-Phoneme Conversion
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. INTERSPEECH 2019, 2019, : 2095 - 2099
  • [6] Solving the Phoneme Conflict in Grapheme-to-Phoneme Conversion Using a Two-Stage Neural Network-Based Approach
    Kheang, Seng
    Katsurada, Kouichi
    Iribe, Yurie
    Nitta, Tsuneo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 901 - 910
  • [7] Grapheme-to-Phoneme Conversion with a Multilingual Transformer Model
    ElSaadany, Omnia
    Suter, Benjamin
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 85 - 89
  • [8] Fast Bilingual Grapheme-To-Phoneme Conversion
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 289 - 296
  • [9] Incorporating syllabification points into a model of grapheme-to-phoneme conversion
    Suyanto Suyanto
    [J]. International Journal of Speech Technology, 2019, 22 : 459 - 470
  • [10] Incorporating syllabification points into a model of grapheme-to-phoneme conversion
    Suyanto, Suyanto
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 459 - 470