Statistical parametric speech synthesis for Arabic language using ANN

被引:0
|
作者
Ilyes, Rebai [1 ]
BenAyed, Yassine [1 ]
机构
[1] Sfax Univ, MIRACL Multimedia InfoRmat Syst & Adv Comp Lab, Sfax, Tunisia
关键词
Statistical parametric; speech synthesis; neural networks;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Statistical parametric approach for speech synthesis becomes more popular over the concatenative approach due to the low size of the system and the high-quality speech. Moreover, few researches have been done in the field of speech synthesis for Arabic language with a poor quality of speech. In this paper, we propose a statistical parametric synthesis system for Arabic based on Artificial Neural Networks (ANN). Mel frequency Cepstral coefficients (MFCC), F0, energy and duration are the main components of our system. Speech waveform is generated from the predicted parameters F0, energy and MFCC. Different methods are proposed for this development process. In addition, we propose a method to solve the problem of discontinuities between neighboring segment boundaries in order to improve the speech quality. Experimental results of cepstral and prosodic parameters are given in this paper as well as the subjective evaluation.
引用
收藏
页码:452 / 457
页数:6
相关论文
共 50 条
  • [1] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
    Zangar, Imene
    Mnasri, Zied
    Colotte, Vincent
    Jouvet, Denis
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 8331 - 8353
  • [2] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
    Imene Zangar
    Zied Mnasri
    Vincent Colotte
    Denis Jouvet
    [J]. Multimedia Tools and Applications, 2021, 80 : 8331 - 8353
  • [3] Statistical Parametric Speech Synthesis Based on Speaker and Language Factorization
    Zen, Heiga
    Braunschweiler, Norbert
    Buchholz, Sabine
    Gales, Mark J. F.
    Knill, Kate
    Krstulovic, Sacha
    Latorre, Javier
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1713 - 1724
  • [4] Statistical parametric speech synthesis
    Black, Alan W.
    Zen, Heiga
    Tokuda, Keiichi
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1229 - +
  • [5] Statistical parametric speech synthesis
    Zen, Heiga
    Tokuda, Keiichi
    Black, Alan W.
    [J]. SPEECH COMMUNICATION, 2009, 51 (11) : 1039 - 1064
  • [6] Statistical Formant Speech Synthesis for Arabic
    Afshan Jafri
    Ibrahim Sobh
    Ashraf Alkhairy
    [J]. Arabian Journal for Science and Engineering, 2015, 40 : 3151 - 3159
  • [7] Statistical Formant Speech Synthesis for Arabic
    Jafri, Afshan
    Sobh, Ibrahim
    Alkhairy, Ashraf
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2015, 40 (11) : 3151 - 3159
  • [8] IMPLEMENTATION AND EVALUATION OF STATISTICAL PARAMETRIC SPEECH SYNTHESIS METHODS FOR THE PERSIAN LANGUAGE
    Bahaadini, Sara
    Sameti, Hossein
    Khorram, Soheil
    [J]. 2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [9] STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS
    Zen, Heiga
    Senior, Andrew
    Schuster, Mike
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7962 - 7966
  • [10] Statistical parametric speech synthesis using a hidden trajectory model
    Cai, Ming-Qi
    Ling, Zhen-Hua
    Dai, Li-Rong
    [J]. SPEECH COMMUNICATION, 2015, 72 : 149 - 159