Statistical parametric speech synthesis for Arabic language using ANN

被引：0

作者：

Ilyes, Rebai ^{[1
]}

BenAyed, Yassine ^{[1
]}

机构：

[1] Sfax Univ, MIRACL Multimedia InfoRmat Syst & Adv Comp Lab, Sfax, Tunisia

来源：

2014 1ST INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP 2014) | 2014年

关键词：

Statistical parametric; speech synthesis; neural networks;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Statistical parametric approach for speech synthesis becomes more popular over the concatenative approach due to the low size of the system and the high-quality speech. Moreover, few researches have been done in the field of speech synthesis for Arabic language with a poor quality of speech. In this paper, we propose a statistical parametric synthesis system for Arabic based on Artificial Neural Networks (ANN). Mel frequency Cepstral coefficients (MFCC), F0, energy and duration are the main components of our system. Speech waveform is generated from the predicted parameters F0, energy and MFCC. Different methods are proposed for this development process. In addition, we propose a method to solve the problem of discontinuities between neighboring segment boundaries in order to improve the speech quality. Experimental results of cepstral and prosodic parameters are given in this paper as well as the subjective evaluation.

引用

页码：452 / 457

页数：6

共 50 条

[1] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
Zangar, Imene
Mnasri, Zied
Colotte, Vincent
Jouvet, Denis
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 8331 - 8353
[2] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
Imene Zangar
Zied Mnasri
Vincent Colotte
Denis Jouvet
[J]. Multimedia Tools and Applications, 2021, 80 : 8331 - 8353
[3] Statistical Parametric Speech Synthesis Based on Speaker and Language Factorization
Zen, Heiga
Braunschweiler, Norbert
Buchholz, Sabine
Gales, Mark J. F.
Knill, Kate
Krstulovic, Sacha
Latorre, Javier
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1713 - 1724
[4] Statistical parametric speech synthesis
Black, Alan W.
Zen, Heiga
Tokuda, Keiichi
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1229 - +
[5] Statistical parametric speech synthesis
Zen, Heiga
Tokuda, Keiichi
Black, Alan W.
[J]. SPEECH COMMUNICATION, 2009, 51 (11) : 1039 - 1064
[6] Statistical Formant Speech Synthesis for Arabic
Afshan Jafri
Ibrahim Sobh
Ashraf Alkhairy
[J]. Arabian Journal for Science and Engineering, 2015, 40 : 3151 - 3159
[7] Statistical Formant Speech Synthesis for Arabic
Jafri, Afshan
Sobh, Ibrahim
Alkhairy, Ashraf
[J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2015, 40 (11) : 3151 - 3159
[8] IMPLEMENTATION AND EVALUATION OF STATISTICAL PARAMETRIC SPEECH SYNTHESIS METHODS FOR THE PERSIAN LANGUAGE
Bahaadini, Sara
Sameti, Hossein
Khorram, Soheil
[J]. 2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
[9] STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS
Zen, Heiga
Senior, Andrew
Schuster, Mike
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7962 - 7966
[10] Statistical parametric speech synthesis using a hidden trajectory model
Cai, Ming-Qi
Ling, Zhen-Hua
Dai, Li-Rong
[J]. SPEECH COMMUNICATION, 2015, 72 : 149 - 159

← 1 2 3 4 5 →