Text normalization in mandarin Text-to-Speech system

被引:0
|
作者
Jia, Yuxiang [1 ,2 ]
Huang, Dezhi [2 ]
Liu, Wu [2 ]
Dong, Yuan [2 ,3 ]
Yu, Shiwen [1 ]
Wang, Haila [2 ]
机构
[1] Peking Univ, Inst Computat Linguist, Beijing 100871, Peoples R China
[2] France Telecom R&D Beijing, Speech & Nat Language Proc Unit, Beijing, Peoples R China
[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
Text-to-Speech (TTS); text normalization; finite state automata; maximum entropy classifier;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Text normalization is an important component in Text-to-Speech system and the difficulty in text normalization is to disambiguate the Non-Standard Words (NSWs). This paper develops a taxonomy of NSWs on the basis of a large scale Chinese corpus, and proposes a two-stage NSWs disambiguation strategy, Finite State Automata (FSA) for initial classification and Maximum Entropy (ME) classifiers for subclass disambiguation. Based on the above NSWs taxonomy, the two-stage approach achieves an F-score of 98.53% in open test, 5.23% higher than that of FSA based approach. Experiments show that the NSWs taxonomy ensures FSA a high baseline performance and ME classifiers make considerable improvement, and the two-stage approach adapts well to new domains.
引用
下载
收藏
页码:4693 / +
页数:2
相关论文
共 50 条
  • [1] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [2] A THREE-STAGE TEXT NORMALIZATION STRATEGY FOR MANDARIN TEXT-TO-SPEECH SYSTEMS
    Zhou, Tao
    Dong, Yuan
    Huang, Dezhi
    Liu, Wu
    Wang, Haila
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 125 - 128
  • [3] NORMALIZATION OF TEXT MESSAGES FOR TEXT-TO-SPEECH
    Pennell, Deana L.
    Liu, Yang
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4842 - 4845
  • [4] An efficient Mandarin text-to-speech system on time domain
    Lin, YJ
    Yu, MS
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (06): : 545 - 555
  • [5] A Prosodic Mandarin Text-to-Speech System Based on Tacotron
    Zhang, Chuxiong
    Zhang, Sheng
    Zhong, Haibing
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 165 - 169
  • [6] The pause duration prediction for mandarin text-to-speech system
    Yu, J
    Tao, JH
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 204 - 208
  • [7] Pitch models of Mandarin text-to-speech
    邵艳秋
    穗志方
    韩纪庆
    Journal of Harbin Institute of Technology, 2009, 16 (02) : 179 - 184
  • [8] Pitch models of Mandarin text-to-speech
    邵艳秋
    穗志方
    韩纪庆
    Journal of Harbin Institute of Technology(New series), 2009, 16 (02) : 179 - 184
  • [9] Myanmar Number Normalization for Text-to-Speech
    Hlaing, Aye Mya
    Pa, Win Pa
    Thu, Ye Kyaw
    COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 263 - 274
  • [10] An HMM-based Mandarin Chinese Text-to-Speech system
    Qian, Yao
    Soong, Frank
    Chen, Yining
    Chu, Min
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 223 - +