An HMM-based speech synthesis system applied to English

被引:0
|
作者
Tokuda, K [1 ]
Zen, H [1 ]
Black, AW [1 ]
机构
[1] Nagoya Inst Technol, Dept Comp Sci, Nagoya, Aichi, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes an HMM-based speech synthesis system (HTS), in which speech waveform is generated from HMMs themselves, and applies it to English speech synthesis using the general speech synthesis architecture of Festival. Similarly to other data-driven speech synthesis approaches, HTS has a compact language dependent module: a list of contextual factors. Thus, it could easily be extended to other languages, though the first version of HTS was implemented for Japanese. The resulting run-time engine of HTS has the advantage of being small: less than 1 M bytes, excluding text analysis part. Furthermore, HTS can easily change voice characteristics of synthesized speech by using a speaker adaptation technique developed for speech recognition. The relation between the HMM-based approach and other unit selection approaches is also discussed.
引用
收藏
页码:227 / 230
页数:4
相关论文
共 50 条
  • [41] Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic
    Houidhek, Amal
    Colotte, Vincent
    Mnasri, Zied
    Jouvet, Denis
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 895 - 906
  • [42] Parameterization of Excitation Signal for Improving the Quality of HMM-Based Speech Synthesis System
    N. P. Narendra
    K. Sreenivasa Rao
    [J]. Circuits, Systems, and Signal Processing, 2017, 36 : 3650 - 3673
  • [43] Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
    Zen, Heiga
    Toda, Tomoki
    Nakamura, Masaru
    Tokuda, Keiichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (01) : 325 - 333
  • [44] SPEECH-LAUGHS: AN HMM-BASED APPROACH FOR AMUSED SPEECH SYNTHESIS
    El Haddad, Kevin
    Dupont, Stephane
    Urbain, Jerome
    Dutoit, Thierry
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4939 - 4943
  • [45] HMM-based Indonesian Speech Synthesis System with Declarative and Question Sentences Intonation
    Cahyaningtyas, Elok
    Arifianto, Dhany
    [J]. 2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 153 - 158
  • [46] Parameterization of Excitation Signal for Improving the Quality of HMM-Based Speech Synthesis System
    Narendra, N. P.
    Rao, K. Sreenivasa
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (09) : 3650 - 3673
  • [47] A Targets-based Superpositional Model of Fundamental Frequency Contours Applied to HMM-based Speech Synthesis
    Ni, Jinfu
    Shiga, Yoshinori
    Hori, Chiori
    Kidawara, Yutaka
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1051 - 1055
  • [48] Continuous Control of the Degree of Articulation in HMM-based Speech Synthesis
    Picart, Benjamin
    Drugman, Thomas
    Dutoit, Thierly
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1808 - 1811
  • [49] Statistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis
    Sung, June Sig
    Hong, Doo Hwa
    Koo, Hyun Woo
    Kim, Nam Soo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (02): : 379 - 382
  • [50] Evaluation of prosodic contextual factors for HMM-based speech synthesis
    Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama, 226-8502, Japan
    [J]. Proc. Annu. Conf. Int. Speech Commun. Assoc., INTERSPEECH, (430-433):