Pitch models of Mandarin text-to-speech

被引:0
|
作者
邵艳秋 [1 ,2 ]
穗志方 [1 ]
韩纪庆 [2 ]
机构
[1] Institute of Computational Linguistics,Peking University
[2] School of Computer Science and Technology,Harbin Institute of Technology
基金
中国国家自然科学基金;
关键词
speech synthesis; prosody model; pitch model; pitch pattern;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
The function of prosody model will directly affect the naturalness of synthesized speech.Aimed at the difficulty in generating the pitch contour in prosody model,two pitch models namely corpus-based pitch model and pitch pattern model are deeply studied in this paper.Key problems in the corpus-based model are calculation of the distance and searching of the optimal path with dynamic programming algorithm.For the pitch pattern model,parameters such as pitch pattern,pitch average and pitch range are used to describe the pitch contour,and six pitch patterns are presented.For the generation of pitch contour,the pitch pattern model is more flexible than the corpus-based model.Both of the two models are linked to the real TTS system,and the MOS results of synthesized Mandarin speech show that the pitch pattern model is better than the corpus-based pitch model.
引用
收藏
页码:179 / 184
页数:6
相关论文
共 50 条
  • [1] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [2] Text normalization in mandarin Text-to-Speech system
    Jia, Yuxiang
    Huang, Dezhi
    Liu, Wu
    Dong, Yuan
    Yu, Shiwen
    Wang, Haila
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4693 - +
  • [3] Hierarchical Stress Modeling in Mandarin Text-to-Speech
    Li, Ya
    Tao, Jianhua
    Xu, Xiaoying
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2024 - +
  • [4] FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION
    Lancucki, Adrian
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6588 - 6592
  • [5] A consistency analysis on an acoustic module for Mandarin text-to-speech
    Yeh, Cheng-Yu
    Chang, Shun-Chieh
    Hwang, Shaw-Hwa
    [J]. SPEECH COMMUNICATION, 2013, 55 (02) : 266 - 277
  • [6] Refining Unit Boundaries for Mandarin Text-to-Speech Database
    Dong, Minghui
    Cen, Ling
    Chan, Paul
    Li, Haizhou
    [J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 245 - 248
  • [7] The pause duration prediction for mandarin text-to-speech system
    Yu, J
    Tao, JH
    [J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 204 - 208
  • [8] A Prosodic Mandarin Text-to-Speech System Based on Tacotron
    Zhang, Chuxiong
    Zhang, Sheng
    Zhong, Haibing
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 165 - 169
  • [9] An efficient Mandarin text-to-speech system on time domain
    Lin, YJ
    Yu, MS
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (06): : 545 - 555
  • [10] An enhanced text analysis approach in text-to-speech synthesis for mandarin chinese
    Jiang, Wei
    Wang, Xiao-Long
    Guan, Yi
    Pang, Xiu-Li
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2007, : 410 - +