Text normalization in mandarin Text-to-Speech system

被引:0
|
作者
Jia, Yuxiang [1 ,2 ]
Huang, Dezhi [2 ]
Liu, Wu [2 ]
Dong, Yuan [2 ,3 ]
Yu, Shiwen [1 ]
Wang, Haila [2 ]
机构
[1] Peking Univ, Inst Computat Linguist, Beijing 100871, Peoples R China
[2] France Telecom R&D Beijing, Speech & Nat Language Proc Unit, Beijing, Peoples R China
[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
Text-to-Speech (TTS); text normalization; finite state automata; maximum entropy classifier;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Text normalization is an important component in Text-to-Speech system and the difficulty in text normalization is to disambiguate the Non-Standard Words (NSWs). This paper develops a taxonomy of NSWs on the basis of a large scale Chinese corpus, and proposes a two-stage NSWs disambiguation strategy, Finite State Automata (FSA) for initial classification and Maximum Entropy (ME) classifiers for subclass disambiguation. Based on the above NSWs taxonomy, the two-stage approach achieves an F-score of 98.53% in open test, 5.23% higher than that of FSA based approach. Experiments show that the NSWs taxonomy ensures FSA a high baseline performance and ME classifiers make considerable improvement, and the two-stage approach adapts well to new domains.
引用
下载
收藏
页码:4693 / +
页数:2
相关论文
共 50 条
  • [41] TTTS: TURKISH TEXT-TO-SPEECH SYSTEM
    Gormez, Zeliha
    Orhan, Zeynep
    PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 977 - +
  • [42] EXPERIMENTAL TEXT-TO-SPEECH SYSTEM FOR HANDICAPPED
    CARLSON, R
    GRANSTROM, B
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S163 - S163
  • [43] Dealing with prosody in a text-to-speech system
    Goldsmith J.
    International Journal of Speech Technology, 1999, 3 (1) : 51 - 63
  • [44] Dealing with prosody in a text-to-speech system
    Goldsmith, John
    International Journal of Speech Technology, 1999, 3 (01): : 51 - 63
  • [45] EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
    Cui, Chenye
    Ren, Yi
    Liu, Jinglin
    Chen, Feiyang
    Huang, Rongjie
    Lei, Ming
    Zhao, Zhou
    INTERSPEECH 2021, 2021, : 2766 - 2770
  • [46] A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system
    National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China
    Acoust. Sci. Technol., 1 (33-41):
  • [47] A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system
    Yu, Jian
    Tao, Jianhua
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2009, 30 (01) : 33 - 41
  • [48] Automatic conversion from lexical words to prosodic words for mandarin text-to-speech system
    Shao, Yanqiu
    Han, Jiqing
    Liu, Ting
    Zhao, Yongzhen
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (01) : 45 - 55
  • [49] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
    Doukhan, David
    Rosset, Sophie
    Rilliard, Albert
    d'Alessandro, Christophe
    Adda-Decker, Martine
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
  • [50] Software text-to-speech
    Hallahan W.I.
    International Journal of Speech Technology, 1997, 1 (2) : 121 - 134