Pitch models of Mandarin text-to-speech

被引:0
|
作者
邵艳秋 [1 ,2 ]
穗志方 [1 ]
韩纪庆 [2 ]
机构
[1] Institute of Computational Linguistics,Peking University
[2] School of Computer Science and Technology,Harbin Institute of
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The function of prosody model will directly affect the naturalness of synthesized speech.Aimed at the difficulty in generating the pitch contour in prosody model,two pitch models namely corpus-based pitch model and pitch pattern model are deeply studied in this paper.Key problems in the corpus-based model are calculation of the distance and searching of the optimal path with dynamic programming algorithm.For the pitch pattern model,parameters such as pitch pattern,pitch average and pitch range are used to describe the pitch contour,and six pitch patterns are presented.For the generation of pitch contour,the pitch pattern model is more flexible than the corpus-based model.Both of the two models are linked to the real TTS system,and the MOS results of synthesized Mandarin speech show that the pitch pattern model is better than the corpus-based pitch model.
引用
收藏
页码:179 / 184
页数:6
相关论文
共 50 条
  • [31] Mandarin Text-to-Speech Front-End With Lightweight Distilled Convolution Network
    Zhao, Wei
    Wang, Zuyi
    Xu, Li
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 249 - 253
  • [32] Software text-to-speech
    Hallahan W.I.
    International Journal of Speech Technology, 1997, 1 (2) : 121 - 134
  • [33] The Art of Text-to-Speech
    Lindquist, Benjamin
    CRITICAL INQUIRY, 2024, 50 (02) : 225 - 251
  • [34] TEXT-TO-SPEECH SYNTHESIS
    SPROAT, RW
    OLIVE, JP
    AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 35 - 44
  • [35] Text-to-speech for customers
    不详
    EXPERT SYSTEMS, 1998, 15 (01) : 66 - 66
  • [36] IMPROVED MODELS FOR MANDARIN SPEECH-TO-TEXT TRANSCRIPTION
    Lamel, Lori
    Gauvain, Jean-Luc
    Viet Bac Le
    Oparin, Ilya
    Meng, Sha
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4660 - 4663
  • [38] On a cepstral technique for pitch control in the high quality text-to-speech type system
    Bae, MJ
    Lee, SH
    PROCEEDINGS OF THE 39TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, 1996, : 803 - 806
  • [39] An Improved Pitch Contour Formulation for Malay Language Storytelling Text-to-Speech (TTS)
    Ramli, Izzad
    Jamil, Nursuriati
    Seman, Noraini
    Ardi, Norizah
    2016 IEEE INDUSTRIAL ELECTRONICS AND APPLICATIONS CONFERENCE (IEACON), 2016, : 250 - 255
  • [40] NORMALIZATION OF TEXT MESSAGES FOR TEXT-TO-SPEECH
    Pennell, Deana L.
    Liu, Yang
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4842 - 4845