Statistical Formant Speech Synthesis for Arabic

被引：4

作者：

Jafri, Afshan ^{[1
]}

Sobh, Ibrahim ^{[1
]}

Alkhairy, Ashraf ^{[2
]}

机构：

[1] King Saud Univ, Riyadh, Saudi Arabia

[2] King Abdul Aziz City Sci & Technol, Riyadh, Saudi Arabia

来源：

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING | 2015年 / 40卷 / 11期

关键词：

Arabic speech synthesis; Rule based; Formants; Parametric synthesis; Pronunciation algorithm; HMM; PITCH;

D O I：

10.1007/s13369-015-1771-1

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This work constructs a hybrid system that integrates formant synthesis and context-dependent Hidden Semi-Markov Models (HSMM). HSMM parameters comprise of formants, fundamental frequency, voicing/frication amplitude, and duration. For HSMM training, formants, fundamental frequency, and voicing/frication amplitude are extracted from waveforms using the Snack toolbox and a decomposition algorithm, and duration is calculated using HMM modeled by multivariate Gaussian distribution. The acoustic features are then generated from the trained HSMM models and combined with default values of complementary acoustic features such as glottal waveform parameters to produce speech waveforms utilizing the Klatt synthesizer. We construct the text processor for phonetic transcription required at the training and synthesis phases by utilizing phonemic pronunciation algorithms. A perceptual test reveals that the statistical formant speech text-to-speech system produces good-quality speech while utilizing features that are small in dimension and close to speech perception cues.

引用

页码：3151 / 3159

页数：9

共 50 条

[1] Statistical Formant Speech Synthesis for Arabic
Afshan Jafri
Ibrahim Sobh
Ashraf Alkhairy
[J]. Arabian Journal for Science and Engineering, 2015, 40 : 3151 - 3159
[2] Pitch detection and formant analysis of Arabic speech processing
Cherif, A
Bouafif, L
Dabbabi, T
[J]. APPLIED ACOUSTICS, 2001, 62 (10) : 1129 - 1140
[3] FORMANT BASED SPEECH SYNTHESIS
HUGHES, PM
[J]. BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 84 - 90
[4] ASPECTS OF FORMANT SPEECH SYNTHESIS
SAPOZHKO.MA
[J]. TELECOMMUNICATIONS AND RADIO ENGINEER-USSR, 1971, (03): : 4 - +
[5] Statistical Vowelization of Arabic Text for Speech Synthesis in Speech-to-Speech Translation Systems
Gu, Liang
Zhang, Wei
Tahir, Lazkin
Gao, Yuqing
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 509 - 512
[6] Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis
Ali, Ikbel Hadj
Mnasri, Zied
[J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2016, 2016, 9918 : 57 - 67
[7] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
Zangar, Imene
Mnasri, Zied
Colotte, Vincent
Jouvet, Denis
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 8331 - 8353
[8] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
Imene Zangar
Zied Mnasri
Vincent Colotte
Denis Jouvet
[J]. Multimedia Tools and Applications, 2021, 80 : 8331 - 8353
[9] Statistical parametric speech synthesis for Arabic language using ANN
Ilyes, Rebai
BenAyed, Yassine
[J]. 2014 1ST INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP 2014), 2014, : 452 - 457
[10] Formant analysis in dysphonic patients and automatic Arabic digit speech recognition
Muhammad, Ghulam
Mesallam, Tamer A.
Malki, Khalid H.
Farahat, Mohamed
Alsulaiman, Mansour
Bukhari, Manal
[J]. BIOMEDICAL ENGINEERING ONLINE, 2011, 10

← 1 2 3 4 5 →