A THREE-STAGE TEXT NORMALIZATION STRATEGY FOR MANDARIN TEXT-TO-SPEECH SYSTEMS

被引:0
|
作者
Zhou, Tao [1 ]
Dong, Yuan [1 ,2 ]
Huang, Dezhi [2 ]
Liu, Wu [2 ]
Wang, Haila [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100088, Peoples R China
[2] France Telecom R & D, Beijing, Peoples R China
关键词
Text-to-Speech; Text Normalization; Finite State Automata (FSA); Maximum Entropy (ME) Classifier; Standard Word Conversion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text normalization is an important component in mandarin Text-to-Speech system. This paper develops a taxonomy of Non-Standard Words (NSW's) based on a Large-scale Chinese corpus and proposes a three-stage text normalization strategy: Finite State Automata (FSA) for initial classification, Maximum Entropy (ME) Classifier & Rules for further classification and General Rules for standard word conversion. The three-stage approach achieves Precision of 96.02% in experiments, 5.21% higher than that of simple rule based approach and 2.21% higher than that of simple machine learning method. Experiments results show that the approach of three-stage disambiguation strategy for text normalization makes considerable improvement, and works well in real TTS system.
引用
收藏
页码:125 / 128
页数:4
相关论文
共 50 条
  • [1] Text normalization in mandarin Text-to-Speech system
    Jia, Yuxiang
    Huang, Dezhi
    Liu, Wu
    Dong, Yuan
    Yu, Shiwen
    Wang, Haila
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4693 - +
  • [2] A two-stage prosodic structure generation strategy for Mandarin text-to-speech systems
    Dong, Yuan
    Zhou, Tao
    Dong, Cheng-Yu
    Wang, Hai-La
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2010, 36 (11): : 1569 - 1574
  • [3] NORMALIZATION OF TEXT MESSAGES FOR TEXT-TO-SPEECH
    Pennell, Deana L.
    Liu, Yang
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4842 - 4845
  • [4] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [5] Pitch models of Mandarin text-to-speech
    邵艳秋
    穗志方
    韩纪庆
    [J]. Journal of Harbin Institute of Technology, 2009, 16 (02) - 184
  • [6] Pitch models of Mandarin text-to-speech
    邵艳秋
    穗志方
    韩纪庆
    [J]. Journal of Harbin Institute of Technology(New series), 2009, 16 (02) : 179 - 184
  • [7] Myanmar Number Normalization for Text-to-Speech
    Hlaing, Aye Mya
    Pa, Win Pa
    Thu, Ye Kyaw
    [J]. COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 263 - 274
  • [8] Hierarchical Stress Modeling in Mandarin Text-to-Speech
    Li, Ya
    Tao, Jianhua
    Xu, Xiaoying
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2024 - +
  • [9] An enhanced text analysis approach in text-to-speech synthesis for mandarin chinese
    Jiang, Wei
    Wang, Xiao-Long
    Guan, Yi
    Pang, Xiu-Li
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2007, : 410 - +
  • [10] A text analyzer for Korean text-to-speech systems
    Lee, SH
    Oh, YH
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1692 - 1695