Pitch models of Mandarin text-to-speech

被引：0

作者：

邵艳秋 ^{[1
,2
]}

穗志方 ^{[1
]}

韩纪庆 ^{[2
]}

机构：

[1] Institute of Computational Linguistics,Peking University

[2] School of Computer Science and Technology,Harbin Institute of Technology

来源：

Journal of Harbin Institute of Technology(New series) | 2009年 / 16卷 / 02期

基金：

中国国家自然科学基金;

关键词：

speech synthesis; prosody model; pitch model; pitch pattern;

D O I：

暂无

中图分类号：

TP391.41 [];

学科分类号：

080203 ;

摘要：

The function of prosody model will directly affect the naturalness of synthesized speech.Aimed at the difficulty in generating the pitch contour in prosody model,two pitch models namely corpus-based pitch model and pitch pattern model are deeply studied in this paper.Key problems in the corpus-based model are calculation of the distance and searching of the optimal path with dynamic programming algorithm.For the pitch pattern model,parameters such as pitch pattern,pitch average and pitch range are used to describe the pitch contour,and six pitch patterns are presented.For the generation of pitch contour,the pitch pattern model is more flexible than the corpus-based model.Both of the two models are linked to the real TTS system,and the MOS results of synthesized Mandarin speech show that the pitch pattern model is better than the corpus-based pitch model.

引用

页码：179 / 184

页数：6

共 50 条

[1] A Mandarin text-to-speech system
Hwang, SH
Chen, SH
Wang, YR
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
[2] Text normalization in mandarin Text-to-Speech system
Jia, Yuxiang
Huang, Dezhi
Liu, Wu
Dong, Yuan
Yu, Shiwen
Wang, Haila
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4693 - +
[3] Hierarchical Stress Modeling in Mandarin Text-to-Speech
Li, Ya
Tao, Jianhua
Xu, Xiaoying
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2024 - +
[4] FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION
Lancucki, Adrian
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6588 - 6592
[5] A consistency analysis on an acoustic module for Mandarin text-to-speech
Yeh, Cheng-Yu
Chang, Shun-Chieh
Hwang, Shaw-Hwa
[J]. SPEECH COMMUNICATION, 2013, 55 (02) : 266 - 277
[6] Refining Unit Boundaries for Mandarin Text-to-Speech Database
Dong, Minghui
Cen, Ling
Chan, Paul
Li, Haizhou
[J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 245 - 248
[7] The pause duration prediction for mandarin text-to-speech system
Yu, J
Tao, JH
[J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 204 - 208
[8] A Prosodic Mandarin Text-to-Speech System Based on Tacotron
Zhang, Chuxiong
Zhang, Sheng
Zhong, Haibing
[J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 165 - 169
[9] An efficient Mandarin text-to-speech system on time domain
Lin, YJ
Yu, MS
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (06): : 545 - 555
[10] An enhanced text analysis approach in text-to-speech synthesis for mandarin chinese
Jiang, Wei
Wang, Xiao-Long
Guan, Yi
Pang, Xiu-Li
[J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2007, : 410 - +

← 1 2 3 4 5 →