A Maximum Entropy Approach to Chinese Grapheme-to-Phoneme Conversion

被引:0
|
作者
Tsai, Richard Tzong-Han [1 ]
Wang, Yu-Chun [2 ]
机构
[1] Yuan Ze Univ, Dept Comp Sci & Engn, Tao Yuan, Taiwan
[2] Chunghwa Telecommun Labs, Taipei, Taiwan
关键词
Speech Synthesis; Chinese Grapheme-to-Phoneme Conversion; Maximum Entropy Model;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Grapheme-to-phoneme (G2P) conversion plays an important role in speech synthesis. The main difficulty facing Chinese G2P conversion is that many Chinese characters are polyphonic, having more than one pronunciation. A Chinese G2P system must be able to pick the correct pronunciation from among several candidates. Contextual information on neighboring characters such as character n-grams, phonetic information, or position of the polyphone in a word or sentence is the key to correct prediction. Most previous works employed rule-based or rule-learning methods, which often suffered from data sparseness. In this paper, we propose a novel G2P approach to avoid data sparseness. Our method uses the maximum entropy (ME) model framework to represent contextual information as ME features. Our system achieves a top accuracy of 99.84%, which is significantly higher than other state-of-the-art rule-based and rule-learning methods. In addition, our approach consistently improves accuracy regardless of a character's main pronunciation ratio. Further analysis also shows that the ME model is fast and efficient, requiring much less training and labeling time.
引用
收藏
页码:411 / +
页数:2
相关论文
共 50 条
  • [1] Grapheme-to-phoneme conversion in Chinese TTS system
    Dong, HH
    Tao, JH
    Xu, B
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 165 - 168
  • [2] Label Embedding for Chinese Grapheme-to-Phoneme Conversion
    Choi, Eunbi
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. INTERSPEECH 2021, 2021, : 4094 - 4098
  • [3] Polyphone Disambiguation Based on Maximum Entropy Model in Mandarin Grapheme-to-Phoneme Conversion
    Liu, Fangzhou
    Zhou, You
    [J]. MATERIALS ENGINEERING FOR ADVANCED TECHNOLOGIES, PTS 1 AND 2, 2011, 480-481 : 1043 - +
  • [4] A linguistically motivated approach to grapheme-to-phoneme conversion for Korean
    Yoon, Kyuchul
    Brew, Chris
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04): : 357 - 381
  • [5] Fast Bilingual Grapheme-To-Phoneme Conversion
    Kim, Hwa-Yeon
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 289 - 296
  • [6] A Survey of Grapheme-to-Phoneme Conversion Methods
    Cheng, Shiyang
    Zhu, Pengcheng
    Liu, Jueting
    Wang, Zehua
    [J]. Applied Sciences (Switzerland), 2024, 14 (24):
  • [7] Transformer based Grapheme-to-Phoneme Conversion
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. INTERSPEECH 2019, 2019, : 2095 - 2099
  • [8] Grapheme-to-Phoneme Conversion with Convolutional Neural Networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (06):
  • [9] Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion
    Prabhu, Nikhil
    Kann, Katharina
    [J]. 17TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2020), 2020, : 123 - 127
  • [10] Automatic Grapheme-to-Phoneme Conversion of Arabic Text
    Al-Daradkah, Belal
    Al-Diri, Bashir
    [J]. 2015 SCIENCE AND INFORMATION CONFERENCE (SAI), 2015, : 468 - 473