A Novel Intonation Model to Improve the Quality of Tamil Text-to-Speech Synthesis System

被引:0
|
作者
Rajeswari, K. C. [1 ]
UmaMaheswari, P. [2 ]
机构
[1] Sona Coll Technol, Dept Comp Sci & Engn, Salem, India
[2] Anna Univ, Madras Inst Technol, Dept Comp Technol, Madras, Tamil Nadu, India
关键词
speech synthesis; intonation; prosody; corpora; FEATURES;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The global growth of Information and Communication technologies has a greater impact towards the research focus on speech technologies. Especially visually impaired people, vocally challenged people can utilize speech technology enabled devices as it helps them as a lifeline. In the broad sense, Speech technology has two major applications, speech synthesis and speech recognition. Speech synthesis is a popular technique to produce synthetic speech given the input text, whereas speech recognition is the technique that understands human speech and can produce either text or speech as output. Tamil Nadu is one of state in southern region of INDIA, the eleventh biggest state and it also stands seventh as a most populous state in India, has over 74 million population. Among these huge population, only 58 million people are literates, 0.3 million people are visually differently-abler. In spite of the tremendous growth in information and communication technology, there is a great demand for speech technology enabled devices in the regional language Tamil to facilitate the illiterates and especially visually challenged people to avail the technological and communicative facilities at par with others. But, speech applications in Tamil put forward a greater demand for the quality of speech. The government of Tamil Nadu encourages the researchers to innovate software's and devices that in fact help the visually challenged people to enlighten their lives. Researchers are still making efforts to produce highly intelligible and natural sounding speech irrespective of the language. This paper investigates the available prosodic models and details on the prosodic parameters that contribute towards the improvement in quality of speech synthesis systems. This study streamlines the method to develop an intonation model, which is one of the important prosodic parameter to accomplish the quality in terms of naturalness of the speech in Tamil TTS. The subjective evaluation of the proposed method shows the significant improvement in the quality in terms of naturalness of the produced speech.
引用
收藏
页码:335 / 340
页数:6
相关论文
共 50 条
  • [1] A stochastic model of intonation for text-to-speech synthesis
    Véronis, J
    Di Cristo, P
    Courtois, F
    Chaumette, C
    [J]. SPEECH COMMUNICATION, 1998, 26 (04) : 233 - 244
  • [2] A complete text-to-speech synthesis system in Tamil
    Rama, GLJ
    Ramakrishnan, AG
    Muralishankar, R
    Prathibha, R
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 191 - 194
  • [3] FUJISAKI INTONATION MODEL IN TURKISH TEXT-TO-SPEECH SYNTHESIS
    Uslu, Baran
    Ilk, H. Goekhan
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 133 - 136
  • [4] AN ACCENT-UNIT MODEL OF INTONATION FOR TEXT-TO-SPEECH SYNTHESIS
    JOHNSON, M
    HOUSE, J
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 409 - 416
  • [5] INTONATION IN TEXT-TO-SPEECH SYNTHESIS - EVALUATION OF ALGORITHMS
    AKERS, G
    LENNIG, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (06): : 2157 - 2165
  • [6] A computational model of intonation for Yoruba text-to-speech synthesis:: Design and analysis
    Odéjobí, OA
    Beaumont, AJ
    Wong, SHS
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 409 - 416
  • [7] A Novel Text-to-Speech Synthesis System Using Syllable-Based HMM for Tamil Language
    Manoharan, J. Samuel
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 305 - 314
  • [8] THE INTONATION OF TEXTUAL ANOMALIES IN TEXT-TO-SPEECH
    MONAGHAN, AIC
    [J]. SPEECH COMMUNICATION, 1993, 12 (04) : 371 - 382
  • [9] Paraphrase generation to improve Text-To-Speech Synthesis
    Putois, Ghislain
    Chevelu, Jonathan
    Boidin, Cedric
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 198 - 201
  • [10] SPEAKER INTONATION ADAPTATION FOR TRANSFORMING TEXT-TO-SPEECH SYNTHESIS SPEAKER IDENTITY
    Langarani, Mahsa Sadat Elyasi
    van Santen, Jan
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 116 - 123