Text normalization in mandarin Text-to-Speech system

被引：0

作者：

Jia, Yuxiang ^{[1
,2
]}

Huang, Dezhi ^{[2
]}

Liu, Wu ^{[2
]}

Dong, Yuan ^{[2
,3
]}

Yu, Shiwen ^{[1
]}

Wang, Haila ^{[2
]}

机构：

[1] Peking Univ, Inst Computat Linguist, Beijing 100871, Peoples R China

[2] France Telecom R&D Beijing, Speech & Nat Language Proc Unit, Beijing, Peoples R China

[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

Text-to-Speech (TTS); text normalization; finite state automata; maximum entropy classifier;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Text normalization is an important component in Text-to-Speech system and the difficulty in text normalization is to disambiguate the Non-Standard Words (NSWs). This paper develops a taxonomy of NSWs on the basis of a large scale Chinese corpus, and proposes a two-stage NSWs disambiguation strategy, Finite State Automata (FSA) for initial classification and Maximum Entropy (ME) classifiers for subclass disambiguation. Based on the above NSWs taxonomy, the two-stage approach achieves an F-score of 98.53% in open test, 5.23% higher than that of FSA based approach. Experiments show that the NSWs taxonomy ensures FSA a high baseline performance and ME classifiers make considerable improvement, and the two-stage approach adapts well to new domains.

引用

下载

页码：4693 / +

页数：2

共 50 条

[41] TTTS: TURKISH TEXT-TO-SPEECH SYSTEM
Gormez, Zeliha
Orhan, Zeynep
PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 977 - +
[42] EXPERIMENTAL TEXT-TO-SPEECH SYSTEM FOR HANDICAPPED
CARLSON, R
GRANSTROM, B
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S163 - S163
[43] Dealing with prosody in a text-to-speech system
Goldsmith J.
International Journal of Speech Technology, 1999, 3 (1) : 51 - 63
[44] Dealing with prosody in a text-to-speech system
Goldsmith, John
International Journal of Speech Technology, 1999, 3 (01): : 51 - 63
[45] EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Cui, Chenye
Ren, Yi
Liu, Jinglin
Chen, Feiyang
Huang, Rongjie
Lei, Ming
Zhao, Zhou
INTERSPEECH 2021, 2021, : 2766 - 2770
[46] A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China
Acoust. Sci. Technol., 1 (33-41):
[47] A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system
Yu, Jian
Tao, Jianhua
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2009, 30 (01) : 33 - 41
[48] Automatic conversion from lexical words to prosodic words for mandarin text-to-speech system
Shao, Yanqiu
Han, Jiqing
Liu, Ting
Zhao, Yongzhen
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (01) : 45 - 55
[49] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
Doukhan, David
Rosset, Sophie
Rilliard, Albert
d'Alessandro, Christophe
Adda-Decker, Martine
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
[50] Software text-to-speech
Hallahan W.I.
International Journal of Speech Technology, 1997, 1 (2) : 121 - 134

← 1 2 3 4 5 →