Text normalization in mandarin Text-to-Speech system

被引：0

作者：

Jia, Yuxiang ^{[1
,2
]}

Huang, Dezhi ^{[2
]}

Liu, Wu ^{[2
]}

Dong, Yuan ^{[2
,3
]}

Yu, Shiwen ^{[1
]}

Wang, Haila ^{[2
]}

机构：

[1] Peking Univ, Inst Computat Linguist, Beijing 100871, Peoples R China

[2] France Telecom R&D Beijing, Speech & Nat Language Proc Unit, Beijing, Peoples R China

[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

Text-to-Speech (TTS); text normalization; finite state automata; maximum entropy classifier;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Text normalization is an important component in Text-to-Speech system and the difficulty in text normalization is to disambiguate the Non-Standard Words (NSWs). This paper develops a taxonomy of NSWs on the basis of a large scale Chinese corpus, and proposes a two-stage NSWs disambiguation strategy, Finite State Automata (FSA) for initial classification and Maximum Entropy (ME) classifiers for subclass disambiguation. Based on the above NSWs taxonomy, the two-stage approach achieves an F-score of 98.53% in open test, 5.23% higher than that of FSA based approach. Experiments show that the NSWs taxonomy ensures FSA a high baseline performance and ME classifiers make considerable improvement, and the two-stage approach adapts well to new domains.

引用

下载

页码：4693 / +

页数：2

共 50 条

[31] COSEGMENTATION IN THE IBM TEXT-TO-SPEECH SYSTEM
PICKERING, JB
PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 385 - 392
[32] TOWARD AN ARABIC TEXT-TO-SPEECH SYSTEM
AHMED, ME
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1991, 16 (04): : 565 - 583
[33] Study on Cantonese text-to-speech system
Long, Qinghua
Jing, Huisheng
Ren, Ping
Situ, Xikang
Shengxue Xuebao/Acta Acustica, 1993, 18 (02): : 143 - 147
[34] Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech
Oo, Yin May
Wattanavekin, Theeraphol
Li, Chenfang
De Silva, Pasindu
Sarin, Supheakmungkol
Pipatsrisawat, Knot
Jansche, Martin
Kjartansson, Oddur
Gutkin, Alexander
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6328 - 6339
[35] JAPANESE TEXT-TO-SPEECH CONVERSION SYSTEM
SATO, H
REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1984, 32 (02): : 179 - 187
[36] Whistler: A trainable text-to-speech system
Huang, XD
Acero, A
Adcock, J
Hon, HW
Goldsmith, J
Liu, JS
Plumpe, M
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2387 - 2390
[37] Japanese Text-to-Speech Conversion System
1600, (The International Society for Computers and Their Applications (ISCA)):
[38] Slovenian text-to-speech system GOVOREC
Šef, Tomaž
Elektrotehniski Vestnik/Electrotechnical Review, 2002, 69 (3-4): : 165 - 170
[39] TEXT-TO-SPEECH TRANSLATION SYSTEM FOR ITALIAN
LESMO, L
MEZZALAMA, M
TORASSO, P
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1978, 10 (05): : 569 - 591
[40] MORPHOPHONOLOGY IN THE CSTR TEXT-TO-SPEECH SYSTEM
SHOCKEY, L
PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 393 - 398

← 1 2 3 4 5 →