Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic

被引：2

作者：

Houidhek, Amal ^{[1
,2
]}

Colotte, Vincent ^{[2
]}

Mnasri, Zied ^{[1
]}

Jouvet, Denis ^{[2
]}

机构：

[1] Univ Tunis El Manar, Ecole Natl Ingenieurs Tunis, Elect Engn Dept, Tunis, Tunisia

[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2018年 / 21卷 / 04期

关键词：

Parametric speech synthesis; Statistical modelling; Arabic language; Speech unit modelling; Vowel quantity; Gemination;

D O I：

10.1007/s10772-018-09558-6

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper investigates the use of hidden Markov models (HMM) for Modern Standard Arabic speech synthesis. HMM-based speech synthesis systems require a description of each speech unit with a set of contextual features that specifies phonetic, phonological and linguistic aspects. To apply this method to Arabic language, a study of its particularities was conducted to extract suitable contextual features. Two phenomena are highlighted: vowel quantity and gemination. This work focuses on how to model geminated consonants (resp. long vowels), either considering them as fully-fledged phonemes or as the same phonemes as their simple (resp. short) counterparts but with a different duration. Four modelling approaches have been proposed for this purpose. Results of subjective and objective evaluations show that there is no important difference between differentiating modelling units associated to geminated consonants (resp. long vowels) from modelling units associated to simple consonants (resp. short vowels) and merging them as long as gemination and vowel quantity information is included in the set of features.

引用

页码：895 / 906

页数：12

共 50 条

[1] Arabic HMM-based Speech Synthesis
Khalil, Krichi Mohamed
Adnan, Cherif
[J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 450 - 454
[2] Evaluation of Finnish Unit Selection and HMM-based Speech Synthesis
Silen, Hanna
Helander, Elina
Nurminen, Jani
Gabbouji, Moncef
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1853 - +
[3] Usage of the HMM-Based Speech Synthesis for intelligent Arabic voice
Fares, Tamer S.
Khalil, Awad H.
Hegazy, Abd El-Fatah A.
[J]. INTELLIGENT SYSTEMS AND AUTOMATION, 2008, 1019 : 93 - +
[4] Development and Evaluation of Unit Selection and HMM-Based Speech Synthesis Systems for Tamil
Boothalingam, Ramani
Solomi, V. Sherlin
Gladston, Anushiya Rachel
Christina, S. Lilly
Vijayalakshmi, P.
Thangavelu, Nagarajan
Murthy, Hema A.
[J]. 2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
[5] Evaluation of the Slovenian HMM-based speech synthesis system
Vesnicer, B
Mihelic, F
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 513 - 520
[6] Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
Andersson, Sebastian
Yamagishi, Junichi
Clark, Robert A. J.
[J]. SPEECH COMMUNICATION, 2012, 54 (02) : 175 - 188
[7] Croatian HMM-based speech synthesis
Department of Informatics, Faculty of Philosophy, University of Rijeka, Omladinska 14, Rijeka
51000, Croatia
[J]. J. Compt. Inf. Technol., 2006, 4 (307-313):
[8] HMM-Based Vietnamese Speech Synthesis
Trinh Quoc Son
[J]. 2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
[9] Robustness of HMM-based Speech Synthesis
Yamagishi, Junichi
Ling, Zhenhua
King, Simon
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
[10] Czech HMM-Based Speech Synthesis
Hanzlicek, Zdenek
[J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298

← 1 2 3 4 5 →