A computational model of intonation for Yoruba text-to-speech synthesis:: Design and analysis

被引：0

作者：

Odéjobí, OA ^{[1
]}

Beaumont, AJ ^{[1
]}

Wong, SHS ^{[1
]}

机构：

[1] Aston Univ, Birmingham B4 7ET, W Midlands, England

来源：

TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2004年 / 3206卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present the design and analysis of an intonation model for text-to-speech (TTS) synthesis applications using a combination of Relational Tree (RT) and Fuzzy Logic (FL) technologies. The model is demonstrated using the Standard Yoruba (SY) language. In the proposed intonation model, phonological information extracted from text is converted into an RT. RT is a sophisticated data structure that represents the peaks and valleys as well as the spatial structure of a waveform symbolically in the form of trees. An initial approximation to the RT, called Skeletal Tree (ST), is first generated algorithmically. The exact numerical values of the peaks and valleys on the ST is then computed using FL. Quantitative analysis of the result gives RMSE of 0.56 and 0.71 for peak and valley respectively. Mean Opinion Scores (MOS) of 9.5 and 6.8, on a scale of 1 - - 10, was obtained for intelligibility and naturalness respectively.

引用

页码：409 / 416

页数：8

共 50 条

[1] Intonation contour realisation for Standard Yoruba text-to-speech synthesis:: A fuzzy computational approach
Odejobi, Odetunji A.
Beaumont, Anthony J.
Wong, Shun Ha Sylvia
[J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04): : 563 - 588
[2] A stochastic model of intonation for text-to-speech synthesis
Véronis, J
Di Cristo, P
Courtois, F
Chaumette, C
[J]. SPEECH COMMUNICATION, 1998, 26 (04) : 233 - 244
[3] FUJISAKI INTONATION MODEL IN TURKISH TEXT-TO-SPEECH SYNTHESIS
Uslu, Baran
Ilk, H. Goekhan
[J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 133 - 136
[4] Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis
Dagba, Theophile K.
Aoga, John O. R.
Fanou, Codjo C.
[J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 161 - 169
[5] AN ACCENT-UNIT MODEL OF INTONATION FOR TEXT-TO-SPEECH SYNTHESIS
JOHNSON, M
HOUSE, J
[J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 409 - 416
[6] INTONATION IN TEXT-TO-SPEECH SYNTHESIS - EVALUATION OF ALGORITHMS
AKERS, G
LENNIG, M
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (06): : 2157 - 2165
[7] A Novel Intonation Model to Improve the Quality of Tamil Text-to-Speech Synthesis System
Rajeswari, K. C.
UmaMaheswari, P.
[J]. 2014 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, 2014, : 335 - 340
[8] THE INTONATION OF TEXTUAL ANOMALIES IN TEXT-TO-SPEECH
MONAGHAN, AIC
[J]. SPEECH COMMUNICATION, 1993, 12 (04) : 371 - 382
[9] A Prosodic Text-to-Speech System for Yoruba Language
Akinwonmi, Akintoba Emmanuel
Alese, Boniface Kayode
[J]. 2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 630 - 635
[10] SPEAKER INTONATION ADAPTATION FOR TRANSFORMING TEXT-TO-SPEECH SYNTHESIS SPEAKER IDENTITY
Langarani, Mahsa Sadat Elyasi
van Santen, Jan
[J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 116 - 123

← 1 2 3 4 5 →