HMM-Based Vietnamese Speech Synthesis

被引:0
|
作者
Trinh Quoc Son [1 ]
机构
[1] Univ Informat Technol, Fac Comp Sci, Hochiminh City, Vietnam
关键词
Vietnamese speech synthesis; Tonal language; improving naturalness; HMM-based; STRAIGHT; TTS; SYSTEM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objective for the development is to achieve maximum naturalness in output speech through key points. Firstly, system uses a high quality recorded Vietnamese speech database appropriate for training, especially in statistical parametric model approach. Secondly, prosodic information such as tone, POS (part of speech) and features based on characteristics of Vietnamese language are added to ensure the quality of synthetic speech. Third, system uses STRAIGHT which showed its ability to produce high-quality voice manipulation and was successfully incorporated into HMM-based speech synthesis. The results collected show that the speech produced by our system has the best result when being compared with the other Vietnamese TTS systems trained from the same speech data.
引用
收藏
页码:349 / 353
页数:5
相关论文
共 50 条
  • [41] Robust Voicing Detection and Estimation for HMM-Based Speech Synthesis
    Narendra, N. P.
    Rao, K. Sreenivasa
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (08) : 2597 - 2619
  • [42] FACTOR ANALYZED VOICE MODELS FOR HMM-BASED SPEECH SYNTHESIS
    Kazumi, Kyosuke
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4234 - 4237
  • [43] Data Selection and Adaptation for Naturalness in HMM-based Speech Synthesis
    Cooper, Erica
    Chang, Alison
    Levitan, Yocheved
    Hirschberg, Julia
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 357 - +
  • [44] Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis
    Gao, Weixun
    Cao, Qiying
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1149 - 1166
  • [45] CONTEXTUAL PARTIAL ADDITIVE STRUCTURE FOR HMM-BASED SPEECH SYNTHESIS
    Takaki, Shinji
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7878 - 7882
  • [46] Speaker adaptation of pitch and spectrum for HMM-based speech synthesis
    [J]. Tamura, M., 1600, John Wiley and Sons Inc. (35):
  • [47] Emotion transplantation through adaptation in HMM-based speech synthesis
    Lorenzo-Trueba, Jaime
    Barra-Chicote, Roberto
    San-Segundo, Ruben
    Ferreiros, Javier
    Yamagishi, Junichi
    Montero, Juan M.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 292 - 307
  • [48] Implementation and Evaluation of an HMM-based Thai Speech Synthesis System
    Chomphan, Suphattharachai
    Kobayashi, Takao
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 173 - 176
  • [49] HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering
    Raitio, Tuomo
    Suni, Antti
    Yamagishi, Junichi
    Pulakka, Hannu
    Nurminen, Jani
    Vainio, Martti
    Alku, Paavo
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 153 - 165
  • [50] Extended Decision Tree with OR Relationship for HMM-based Speech Synthesis
    Wang, Yang
    Tao, Jianhua
    Yang, Minghao
    Li, Ya
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 225 - 229