Pitch models of Mandarin text-to-speech

被引:0
|
作者
邵艳秋 [1 ,2 ]
穗志方 [1 ]
韩纪庆 [2 ]
机构
[1] Institute of Computational Linguistics,Peking University
[2] School of Computer Science and Technology,Harbin Institute of
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The function of prosody model will directly affect the naturalness of synthesized speech.Aimed at the difficulty in generating the pitch contour in prosody model,two pitch models namely corpus-based pitch model and pitch pattern model are deeply studied in this paper.Key problems in the corpus-based model are calculation of the distance and searching of the optimal path with dynamic programming algorithm.For the pitch pattern model,parameters such as pitch pattern,pitch average and pitch range are used to describe the pitch contour,and six pitch patterns are presented.For the generation of pitch contour,the pitch pattern model is more flexible than the corpus-based model.Both of the two models are linked to the real TTS system,and the MOS results of synthesized Mandarin speech show that the pitch pattern model is better than the corpus-based pitch model.
引用
收藏
页码:179 / 184
页数:6
相关论文
共 50 条
  • [1] Pitch models of Mandarin text-to-speech
    邵艳秋
    穗志方
    韩纪庆
    Journal of Harbin Institute of Technology(New series), 2009, 16 (02) : 179 - 184
  • [2] A Mandarin text-to-speech system
    Hwang, SH
    Chen, SH
    Wang, YR
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1421 - 1424
  • [3] Text normalization in mandarin Text-to-Speech system
    Jia, Yuxiang
    Huang, Dezhi
    Liu, Wu
    Dong, Yuan
    Yu, Shiwen
    Wang, Haila
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4693 - +
  • [4] Hierarchical Stress Modeling in Mandarin Text-to-Speech
    Li, Ya
    Tao, Jianhua
    Xu, Xiaoying
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2024 - +
  • [5] FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION
    Lancucki, Adrian
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6588 - 6592
  • [6] A consistency analysis on an acoustic module for Mandarin text-to-speech
    Yeh, Cheng-Yu
    Chang, Shun-Chieh
    Hwang, Shaw-Hwa
    SPEECH COMMUNICATION, 2013, 55 (02) : 266 - 277
  • [7] Refining Unit Boundaries for Mandarin Text-to-Speech Database
    Dong, Minghui
    Cen, Ling
    Chan, Paul
    Li, Haizhou
    2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 245 - 248
  • [8] The pause duration prediction for mandarin text-to-speech system
    Yu, J
    Tao, JH
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 204 - 208
  • [9] An efficient Mandarin text-to-speech system on time domain
    Lin, YJ
    Yu, MS
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (06): : 545 - 555
  • [10] A Prosodic Mandarin Text-to-Speech System Based on Tacotron
    Zhang, Chuxiong
    Zhang, Sheng
    Zhong, Haibing
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 165 - 169