Statistical Formant Speech Synthesis for Arabic

被引:4
|
作者
Jafri, Afshan [1 ]
Sobh, Ibrahim [1 ]
Alkhairy, Ashraf [2 ]
机构
[1] King Saud Univ, Riyadh, Saudi Arabia
[2] King Abdul Aziz City Sci & Technol, Riyadh, Saudi Arabia
关键词
Arabic speech synthesis; Rule based; Formants; Parametric synthesis; Pronunciation algorithm; HMM; PITCH;
D O I
10.1007/s13369-015-1771-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This work constructs a hybrid system that integrates formant synthesis and context-dependent Hidden Semi-Markov Models (HSMM). HSMM parameters comprise of formants, fundamental frequency, voicing/frication amplitude, and duration. For HSMM training, formants, fundamental frequency, and voicing/frication amplitude are extracted from waveforms using the Snack toolbox and a decomposition algorithm, and duration is calculated using HMM modeled by multivariate Gaussian distribution. The acoustic features are then generated from the trained HSMM models and combined with default values of complementary acoustic features such as glottal waveform parameters to produce speech waveforms utilizing the Klatt synthesizer. We construct the text processor for phonetic transcription required at the training and synthesis phases by utilizing phonemic pronunciation algorithms. A perceptual test reveals that the statistical formant speech text-to-speech system produces good-quality speech while utilizing features that are small in dimension and close to speech perception cues.
引用
收藏
页码:3151 / 3159
页数:9
相关论文
共 50 条
  • [1] Statistical Formant Speech Synthesis for Arabic
    Afshan Jafri
    Ibrahim Sobh
    Ashraf Alkhairy
    [J]. Arabian Journal for Science and Engineering, 2015, 40 : 3151 - 3159
  • [2] Pitch detection and formant analysis of Arabic speech processing
    Cherif, A
    Bouafif, L
    Dabbabi, T
    [J]. APPLIED ACOUSTICS, 2001, 62 (10) : 1129 - 1140
  • [3] FORMANT BASED SPEECH SYNTHESIS
    HUGHES, PM
    [J]. BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 84 - 90
  • [4] ASPECTS OF FORMANT SPEECH SYNTHESIS
    SAPOZHKO.MA
    [J]. TELECOMMUNICATIONS AND RADIO ENGINEER-USSR, 1971, (03): : 4 - +
  • [5] Statistical Vowelization of Arabic Text for Speech Synthesis in Speech-to-Speech Translation Systems
    Gu, Liang
    Zhang, Wei
    Tahir, Lazkin
    Gao, Yuqing
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 509 - 512
  • [6] Statistical Analysis of the Prosodic Parameters of a Spontaneous Arabic Speech Corpus for Speech Synthesis
    Ali, Ikbel Hadj
    Mnasri, Zied
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2016, 2016, 9918 : 57 - 67
  • [7] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
    Zangar, Imene
    Mnasri, Zied
    Colotte, Vincent
    Jouvet, Denis
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 8331 - 8353
  • [8] Duration modelling and evaluation for Arabic statistical parametric speech synthesis
    Imene Zangar
    Zied Mnasri
    Vincent Colotte
    Denis Jouvet
    [J]. Multimedia Tools and Applications, 2021, 80 : 8331 - 8353
  • [9] Statistical parametric speech synthesis for Arabic language using ANN
    Ilyes, Rebai
    BenAyed, Yassine
    [J]. 2014 1ST INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP 2014), 2014, : 452 - 457
  • [10] Formant analysis in dysphonic patients and automatic Arabic digit speech recognition
    Muhammad, Ghulam
    Mesallam, Tamer A.
    Malki, Khalid H.
    Farahat, Mohamed
    Alsulaiman, Mansour
    Bukhari, Manal
    [J]. BIOMEDICAL ENGINEERING ONLINE, 2011, 10