Arabic HMM-based Speech Synthesis

被引:0
|
作者
Khalil, Krichi Mohamed [1 ]
Adnan, Cherif [1 ]
机构
[1] Sci Sci Fac Tunis, Signal Proc Lab, Sft 1060, Tunisia
关键词
HMM; Speech Synthesis; Text to Speech; Arabic Language; Statistical Parametric Speech Synthesis; Hidden Markov Model;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
this paper describes the Arabic system synthesis on hidden Markov models (HTS). Our developed synthesis system uses phonemes as HMM synthesis unit, Arabic database was developed for the first test. The main objective is to maintain the consolidated text coherence which is interpreted by concatenating HMM phoneme. In our experiments, spectral properties were represented by Mel cepstrum coefficients. For the waveform synthesis, a noise or pulse excited corresponding MLSA filter was utilized. Besides that basic setup, a high-quality analysis/synthesis system STRAIGHT was employed for more sophisticated speech representation. This method has several advantages. As it is parametric, it is possible to play on the HMM parameters, change the producer voice characteristics. The developed model improves the speech synthesis, naturalness and intelligibility quality in the Arabic language environment.
引用
收藏
页码:450 / 454
页数:5
相关论文
共 50 条
  • [41] An acoustic model adaptation using hmm-based speech synthesis
    Tanaka, K
    Kuroiwa, S
    Tsuge, S
    Ren, F
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 368 - 373
  • [42] A Covariance-Tying Technique for HMM-Based Speech Synthesis
    Oura, Keiichiro
    Zen, Heiga
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (03): : 595 - 601
  • [43] Two-band excitation for HMM-based speech synthesis
    Kim, Sang-Jin
    Hahn, Minsoo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (01) : 378 - 381
  • [44] FACTOR ANALYZED VOICE MODELS FOR HMM-BASED SPEECH SYNTHESIS
    Kazumi, Kyosuke
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4234 - 4237
  • [45] Data Selection and Adaptation for Naturalness in HMM-based Speech Synthesis
    Cooper, Erica
    Chang, Alison
    Levitan, Yocheved
    Hirschberg, Julia
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 357 - +
  • [46] Emotion transplantation through adaptation in HMM-based speech synthesis
    Lorenzo-Trueba, Jaime
    Barra-Chicote, Roberto
    San-Segundo, Ruben
    Ferreiros, Javier
    Yamagishi, Junichi
    Montero, Juan M.
    COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 292 - 307
  • [47] CONTEXTUAL PARTIAL ADDITIVE STRUCTURE FOR HMM-BASED SPEECH SYNTHESIS
    Takaki, Shinji
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7878 - 7882
  • [48] Speaker adaptation of pitch and spectrum for HMM-based speech synthesis
    Tamura, M., 1600, John Wiley and Sons Inc. (35):
  • [49] Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis
    Gao, Weixun
    Cao, Qiying
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (04) : 1149 - 1166
  • [50] Bidirectional HMM-based Arabic POS tagging
    Kadim, Ayoub
    Lazrek, Azzeddine
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (02) : 303 - 312