Text normalization in mandarin Text-to-Speech system

被引:0
|
作者
Jia, Yuxiang [1 ,2 ]
Huang, Dezhi [2 ]
Liu, Wu [2 ]
Dong, Yuan [2 ,3 ]
Yu, Shiwen [1 ]
Wang, Haila [2 ]
机构
[1] Peking Univ, Inst Computat Linguist, Beijing 100871, Peoples R China
[2] France Telecom R&D Beijing, Speech & Nat Language Proc Unit, Beijing, Peoples R China
[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
Text-to-Speech (TTS); text normalization; finite state automata; maximum entropy classifier;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Text normalization is an important component in Text-to-Speech system and the difficulty in text normalization is to disambiguate the Non-Standard Words (NSWs). This paper develops a taxonomy of NSWs on the basis of a large scale Chinese corpus, and proposes a two-stage NSWs disambiguation strategy, Finite State Automata (FSA) for initial classification and Maximum Entropy (ME) classifiers for subclass disambiguation. Based on the above NSWs taxonomy, the two-stage approach achieves an F-score of 98.53% in open test, 5.23% higher than that of FSA based approach. Experiments show that the NSWs taxonomy ensures FSA a high baseline performance and ME classifiers make considerable improvement, and the two-stage approach adapts well to new domains.
引用
下载
收藏
页码:4693 / +
页数:2
相关论文
共 50 条
  • [31] COSEGMENTATION IN THE IBM TEXT-TO-SPEECH SYSTEM
    PICKERING, JB
    PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 385 - 392
  • [32] TOWARD AN ARABIC TEXT-TO-SPEECH SYSTEM
    AHMED, ME
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1991, 16 (04): : 565 - 583
  • [33] Study on Cantonese text-to-speech system
    Long, Qinghua
    Jing, Huisheng
    Ren, Ping
    Situ, Xikang
    Shengxue Xuebao/Acta Acustica, 1993, 18 (02): : 143 - 147
  • [34] Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech
    Oo, Yin May
    Wattanavekin, Theeraphol
    Li, Chenfang
    De Silva, Pasindu
    Sarin, Supheakmungkol
    Pipatsrisawat, Knot
    Jansche, Martin
    Kjartansson, Oddur
    Gutkin, Alexander
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6328 - 6339
  • [35] JAPANESE TEXT-TO-SPEECH CONVERSION SYSTEM
    SATO, H
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1984, 32 (02): : 179 - 187
  • [36] Whistler: A trainable text-to-speech system
    Huang, XD
    Acero, A
    Adcock, J
    Hon, HW
    Goldsmith, J
    Liu, JS
    Plumpe, M
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2387 - 2390
  • [37] Japanese Text-to-Speech Conversion System
    1600, (The International Society for Computers and Their Applications (ISCA)):
  • [38] Slovenian text-to-speech system GOVOREC
    Šef, Tomaž
    Elektrotehniski Vestnik/Electrotechnical Review, 2002, 69 (3-4): : 165 - 170
  • [39] TEXT-TO-SPEECH TRANSLATION SYSTEM FOR ITALIAN
    LESMO, L
    MEZZALAMA, M
    TORASSO, P
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1978, 10 (05): : 569 - 591
  • [40] MORPHOPHONOLOGY IN THE CSTR TEXT-TO-SPEECH SYSTEM
    SHOCKEY, L
    PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 393 - 398