A computational model of intonation for Yoruba text-to-speech synthesis:: Design and analysis

被引:0
|
作者
Odéjobí, OA [1 ]
Beaumont, AJ [1 ]
Wong, SHS [1 ]
机构
[1] Aston Univ, Birmingham B4 7ET, W Midlands, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present the design and analysis of an intonation model for text-to-speech (TTS) synthesis applications using a combination of Relational Tree (RT) and Fuzzy Logic (FL) technologies. The model is demonstrated using the Standard Yoruba (SY) language. In the proposed intonation model, phonological information extracted from text is converted into an RT. RT is a sophisticated data structure that represents the peaks and valleys as well as the spatial structure of a waveform symbolically in the form of trees. An initial approximation to the RT, called Skeletal Tree (ST), is first generated algorithmically. The exact numerical values of the peaks and valleys on the ST is then computed using FL. Quantitative analysis of the result gives RMSE of 0.56 and 0.71 for peak and valley respectively. Mean Opinion Scores (MOS) of 9.5 and 6.8, on a scale of 1 - - 10, was obtained for intelligibility and naturalness respectively.
引用
收藏
页码:409 / 416
页数:8
相关论文
共 50 条
  • [1] Intonation contour realisation for Standard Yoruba text-to-speech synthesis:: A fuzzy computational approach
    Odejobi, Odetunji A.
    Beaumont, Anthony J.
    Wong, Shun Ha Sylvia
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (04): : 563 - 588
  • [2] A stochastic model of intonation for text-to-speech synthesis
    Véronis, J
    Di Cristo, P
    Courtois, F
    Chaumette, C
    [J]. SPEECH COMMUNICATION, 1998, 26 (04) : 233 - 244
  • [3] FUJISAKI INTONATION MODEL IN TURKISH TEXT-TO-SPEECH SYNTHESIS
    Uslu, Baran
    Ilk, H. Goekhan
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 133 - 136
  • [4] Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis
    Dagba, Theophile K.
    Aoga, John O. R.
    Fanou, Codjo C.
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 161 - 169
  • [5] AN ACCENT-UNIT MODEL OF INTONATION FOR TEXT-TO-SPEECH SYNTHESIS
    JOHNSON, M
    HOUSE, J
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 409 - 416
  • [6] INTONATION IN TEXT-TO-SPEECH SYNTHESIS - EVALUATION OF ALGORITHMS
    AKERS, G
    LENNIG, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (06): : 2157 - 2165
  • [7] A Novel Intonation Model to Improve the Quality of Tamil Text-to-Speech Synthesis System
    Rajeswari, K. C.
    UmaMaheswari, P.
    [J]. 2014 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, 2014, : 335 - 340
  • [8] THE INTONATION OF TEXTUAL ANOMALIES IN TEXT-TO-SPEECH
    MONAGHAN, AIC
    [J]. SPEECH COMMUNICATION, 1993, 12 (04) : 371 - 382
  • [9] A Prosodic Text-to-Speech System for Yoruba Language
    Akinwonmi, Akintoba Emmanuel
    Alese, Boniface Kayode
    [J]. 2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 630 - 635
  • [10] SPEAKER INTONATION ADAPTATION FOR TRANSFORMING TEXT-TO-SPEECH SYNTHESIS SPEAKER IDENTITY
    Langarani, Mahsa Sadat Elyasi
    van Santen, Jan
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 116 - 123