Speech Processing for Arabic Speech Synthesis Based on Concatenation Rules

被引:0
|
作者
Imedjdouben F. [1 ]
机构
[1] Scientific and Technical Research Center for the Development of Arabic Language (CRSTDLA), Algiers
关键词
Concatenative synthesis; Diphones; Overlap-add (OLA) method; Pitch marks; Speech processing; Text-to-speech;
D O I
10.1007/s42979-024-02649-z
中图分类号
学科分类号
摘要
The purpose of this paper is to address speech processing phase of the synthesizer to produce artificial speech from the phonetic sequences generated at the linguistic processing level. This research work is part of the realization of a text-to-speech synthesizer based on concatenation rules for standard Arabic language. In this paper, we will detail the different steps we followed to generate the synthetic voice. These steps consist in selecting the prerecorded acoustic units to be concatenated, stored in an acoustic database by using the selection rules. Then these acoustic units undergo specific processing at the concatenation points according to the nature of sounds to be concatenated (voiced, unvoiced) to generate a synthetic speech signal as natural and intelligible as possible. This innovative method that we have developed specifically for the Arabic language acts directly on the acoustic units at the concatenation points (less signal processing on the selected acoustic units, less execution time) and reconstitute at the same time the synthetic voice using concatenation rules based on the overlap-add (OLA) method with a specific processing at the concatenation points. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2024.
引用
收藏
相关论文
共 50 条
  • [31] A Close Look into the Probablistic Concatenation Model for Corpus-based Speech Synthesis
    Sakai, Shinsuke
    Maia, Ranniery
    Kawai, Hisashi
    Nakamura, Satoshi
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 744 - 747
  • [32] Vowel Onset Point based Waveform Concatenation Technique for Intelligible Speech Synthesis
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 622 - 626
  • [33] Modern Standard Arabic speech disorders corpus for digital speech processing applications
    Alqudah A.A.M.
    Alshraideh M.A.M.
    Abushariah M.A.M.
    Sharieh A.A.S.
    [J]. International Journal of Speech Technology, 2024, 27 (01) : 157 - 170
  • [34] A TDPSOLA Based Concatenation Technique for Bengali Text to Speech Synthesis System Subachan
    Swarna, Kamrunnahar
    Naser, Abu
    [J]. 2016 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2016, : 102 - 105
  • [35] SYNTHESIS OF ARABIC SPEECH USING PHONEME-BASED SYNTHESIZERS
    MANDURAH, MM
    [J]. JOURNAL OF ENGINEERING SCIENCES, 1984, 10 (1-2): : 9 - 14
  • [36] DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation
    Houidhek, Amal
    Colotte, Vincent
    Mnasri, Zied
    Jouvet, Denis
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 9 - 20
  • [37] RULE-SYNTHESIS OF SPEECH BY WORD CONCATENATION - FIRST STEP
    OLIVE, JP
    NAKATANI, LH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (03): : 660 - 666
  • [38] MYANMAR SPEECH SYNTHESIS SYSTEM BY USING PHONEME CONCATENATION METHOD
    Hlaing, Chaw Su
    Thida, Aye
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17), 2017, : 399 - 404
  • [39] COMPUTER SYNTHESIS OF SPEECH BY CONCATENATION OF FORMANT-CODED WORDS
    RABINER, LR
    SCHAFER, RW
    FLANAGAN, JL
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1971, 50 (05): : 1541 - +
  • [40] Pitch detection and formant analysis of Arabic speech processing
    Cherif, A
    Bouafif, L
    Dabbabi, T
    [J]. APPLIED ACOUSTICS, 2001, 62 (10) : 1129 - 1140