Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic

被引:2
|
作者
Houidhek, Amal [1 ,2 ]
Colotte, Vincent [2 ]
Mnasri, Zied [1 ]
Jouvet, Denis [2 ]
机构
[1] Univ Tunis El Manar, Ecole Natl Ingenieurs Tunis, Elect Engn Dept, Tunis, Tunisia
[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
关键词
Parametric speech synthesis; Statistical modelling; Arabic language; Speech unit modelling; Vowel quantity; Gemination;
D O I
10.1007/s10772-018-09558-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates the use of hidden Markov models (HMM) for Modern Standard Arabic speech synthesis. HMM-based speech synthesis systems require a description of each speech unit with a set of contextual features that specifies phonetic, phonological and linguistic aspects. To apply this method to Arabic language, a study of its particularities was conducted to extract suitable contextual features. Two phenomena are highlighted: vowel quantity and gemination. This work focuses on how to model geminated consonants (resp. long vowels), either considering them as fully-fledged phonemes or as the same phonemes as their simple (resp. short) counterparts but with a different duration. Four modelling approaches have been proposed for this purpose. Results of subjective and objective evaluations show that there is no important difference between differentiating modelling units associated to geminated consonants (resp. long vowels) from modelling units associated to simple consonants (resp. short vowels) and merging them as long as gemination and vowel quantity information is included in the set of features.
引用
收藏
页码:895 / 906
页数:12
相关论文
共 50 条
  • [1] Arabic HMM-based Speech Synthesis
    Khalil, Krichi Mohamed
    Adnan, Cherif
    [J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
  • [2] Evaluation of Finnish Unit Selection and HMM-based Speech Synthesis
    Silen, Hanna
    Helander, Elina
    Nurminen, Jani
    Gabbouji, Moncef
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1853 - +
  • [3] Usage of the HMM-Based Speech Synthesis for intelligent Arabic voice
    Fares, Tamer S.
    Khalil, Awad H.
    Hegazy, Abd El-Fatah A.
    [J]. INTELLIGENT SYSTEMS AND AUTOMATION, 2008, 1019 : 93 - +
  • [4] Development and Evaluation of Unit Selection and HMM-Based Speech Synthesis Systems for Tamil
    Boothalingam, Ramani
    Solomi, V. Sherlin
    Gladston, Anushiya Rachel
    Christina, S. Lilly
    Vijayalakshmi, P.
    Thangavelu, Nagarajan
    Murthy, Hema A.
    [J]. 2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [5] Evaluation of the Slovenian HMM-based speech synthesis system
    Vesnicer, B
    Mihelic, F
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 513 - 520
  • [6] Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
    Andersson, Sebastian
    Yamagishi, Junichi
    Clark, Robert A. J.
    [J]. SPEECH COMMUNICATION, 2012, 54 (02) : 175 - 188
  • [7] Croatian HMM-based speech synthesis
    Department of Informatics, Faculty of Philosophy, University of Rijeka, Omladinska 14, Rijeka
    51000, Croatia
    [J]. J. Compt. Inf. Technol., 2006, 4 (307-313):
  • [8] HMM-Based Vietnamese Speech Synthesis
    Trinh Quoc Son
    [J]. 2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
  • [9] Robustness of HMM-based Speech Synthesis
    Yamagishi, Junichi
    Ling, Zhenhua
    King, Simon
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
  • [10] Czech HMM-Based Speech Synthesis
    Hanzlicek, Zdenek
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298