Evaluation of Prosody in Text-to-Speech Synthesis System of Bangla

被引:0
|
作者
Basu, Tulika [1 ]
Saha, Arup [1 ]
机构
[1] Ctr Dev Adv Comp, Adv Speech Proc Grp, Kolkata, India
关键词
Prosody; intonation; evaluation; perception;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In speech synthesis the role of prosody is very crucial. To make the synthesized speech more natural and soothing to the human ears various prosody and intonation model together with emotional model have been experimented over last few decades. Apart from the segmental quality and voice characteristics, it depends mostly on the quality of the prosody model which is responsible for the naturalness of any TTS system. But as it is very hard to evaluate prosody model in an objective way, a perceptual comparison method is adopted in this work to evaluate prosody model.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] A statistical model with hierarchical structure for predicting prosody in a mandarin text-to-speech system
    Yu, MS
    Pan, NH
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2005, 28 (03) : 385 - 399
  • [32] Speech Synthesis for Bangla Text to Speech Conversion
    Arafat, Mohammad Yasir
    Fahrin, Sanjana
    Islam, Md. Jamirul
    Siddiquee, Md. Ashraf
    Khan, Afsana
    Kotwal, Mohammed Rokibul Alam
    Huda, Mohammad Nurul
    8TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA 2014), 2014,
  • [33] DIPHONES EVALUATION FOR TEXT-TO-SPEECH SYNTHESIS OF ITALIAN.
    Salza, P.L.
    Sandri, S.
    Foti, E.
    CSELT Technical Reports, 1988, 16 (01): : 9 - 11
  • [34] Text and Speech Corpora for Text-To-Speech Synthesis of Tales
    Doukhan, David
    Rosset, Sophie
    Rilliard, Albert
    d'Alessandro, Christophe
    Adda-Decker, Martine
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1003 - 1010
  • [35] Optimisation of artificial neural network topology applied in the prosody control in text-to-speech synthesis
    Sebesta, V
    Tucková, J
    SOFSEM 2000: THEORY AND PRACTICE OF INFORMATICS, 2000, 1963 : 420 - 430
  • [36] Multilingual text-to-speech synthesis
    Black, AW
    Lenzo, KA
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 761 - 764
  • [37] Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks
    Reddy, V. Ramu
    Rao, K. Sreenivasa
    NEUROCOMPUTING, 2016, 171 : 1323 - 1334
  • [38] An introduction to text-to-speech synthesis
    Fitzpatrick, E
    COMPUTATIONAL LINGUISTICS, 1998, 24 (02) : 322 - 323
  • [39] Improving text-to-speech synthesis
    Tatham, M
    Lewis, E
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1856 - 1859
  • [40] Issues in text-to-speech synthesis
    Macchi, M
    IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 318 - 325